ocr-table

This project aims to extract tables from scanned image PDFs using Optical Character Recognition.

Install Requirements

Clear the pdf/ folder and copy all your pdf files to be scanned in it.
Run the OCR:
```
python3 shellocr.py
```
The scanned text files shall be available in the txt/ folder once the process completes.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
pdf		pdf
test_cases		test_cases
txt		txt
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
extract_text.sh		extract_text.sh
input.pdf		input.pdf
output.txt		output.txt
pdf_miner.py		pdf_miner.py
py_ocr.py		py_ocr.py
requirements.txt		requirements.txt
shellocr.py		shellocr.py