A minimal Docker image for running tabula-java from a Docker container. Based on openjdk:8u312-jre-slim-buster.
This repo uses a Makefile
to automate build steps and output fragments. Run make help
to view its capabilities.
Images can be found in multiple locations for this repo.
GitHub package images can be found at this repo's GitHub packages.
There is a CD GitHub Action that will automatically build these fragments based on release tags.
So far, they are for the following architectures:
linux/amd64
linux/arm64
...and the following versions of tabula-java
:
- ghcr.io/othyn/docker-tabula-java:latest
- ghcr.io/othyn/docker-tabula-java:1.0.5
- ghcr.io/othyn/docker-tabula-java:1.0.4
- ghcr.io/othyn/docker-tabula-java:1.0.3
Docker hub images can be found at othyn/tabula-java.
So far, they are for the following architectures:
linux/amd64
linux/arm64
...and the following versions of tabula-java
:
Run make build
! This will push to Docker Hub.
I haven't found a way to run a seperate architecture push when using buildx
, so they have to be run as one command.
Mount the PDF file you wish to convert as a volume against the container by using the -v
option on the docker run
command. As an example:
docker run --rm -v /full/path/to/my/file.pdf:/data.pdf othyn/tabula-java [tabula options] /data.pdf > output.csv
Further commandline arguments can be found in the tabulapdf/tabula-java documentation.
Run make test
to run the test PDF within convert/test.pdf
, this will allow you to test that the image is working correctly.
This should output the contents of the table to your terminal.