Skip to content

AdaptaBrasil/pdf2csv

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 

Repository files navigation

pdf2csv

Convert table PDF docs to CSV files.

This script uses tabula Python lib that encapsulates tabula-java.

To run it is necessary to install tabula-py as described here. I tried to use pip install tabula-py in Windows as suggested in tabula-py link but it didn't work. Then I tried pip install tabula-py[jpype] and it worked nicely.

Then, to run the script you must run:

python pdf2csv --input_folder <path to pdf files>

It will process all pdf files that exist in generating a new file with the same name and csv extension.

About

Convert table PDF docs to CSV files.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages