Convert table PDF docs to CSV files.
This script uses tabula Python lib that encapsulates tabula-java.
To run it is necessary to install tabula-py as described here.
I tried to use pip install tabula-py
in Windows as suggested in tabula-py link but it didn't work. Then I tried pip install tabula-py[jpype]
and it worked nicely.
Then, to run the script you must run:
python pdf2csv --input_folder <path to pdf files>
It will process all pdf files that exist in generating a new file with the same name and csv extension.