Using with Docker

To build docker images locally, run the following command via sbt:

sbt dockerize

NOTE: this depends on having OpenJDK 11 installed on your machine.

We also publish images to dockerhub (see below for information on our docker images).

Docker image for annotating text and indexing Odinson JSON documents

docker pull lumai/odinson-extras:latest

See our repository for other tags.

Annotating text using the docker image

docker run \
  --name="odinson-extras" \
  -it \
  --rm \
  -e "HOME=/app" \
  -e "JAVA_OPTS=-Dodinson.extra.processorType=CluProcessor" \
  -v "/path/to/data/odinson:/app/data/odinson" \
  --entrypoint "bin/annotate-text" \
  "lumai/odinson-extras:latest"

NOTE: Replace /path/to/data/odinson with the path to the directory containing a directory called text containing the .txt files you want to annotate. Compressed OdinsonDocument JSON will be written to a directory called docs under whatever you use for /path/to/data/odinson.

Indexing documents using the docker image

docker run \
  --name="odinson-extras" \
  -it \
  --rm \
  -e "HOME=/app" \
  -v "/path/to/data/odinson:/app/data/odinson" \
  --entrypoint "bin/index-documents" \
  "lumai/odinson-extras:latest"

NOTE: Replace /path/to/data/odinson with the path to the directory containing docs. The index will be written to a directory called index under whatever you use for /path/to/data/odinson.