GitHub - Techget/Text-Recognition

Optical Character Recognition engine with stroke width transform(swt), maximally stable extremal regions, connvolutional neural networks, and various morphological operations

Dependencies required are

numpy==1.14.3, opencv-python==3.4.0.12, Pillow==5.1.0, pypillowfight==0.2.4, autocorrect, spellchecker, editdistance, tensorflow,

Usage

To run the OCR engine, use following command

python3 main.py demo.png demoGroundTruthText.txt

After running this, you'll see result.txt, with extracted texts, and coordinates and width&height of the text region, you'll also get textBlockdemo.png which is an image with bounding box indicating which part has been extracted. To compare with pytesseract using:

python3 comparePytesseract.py demo.png demoGroundTruthText.txt

MISC

To get a better understanding, have a look in the report

We use 'char74k' to train the CNN model.

Name		Name	Last commit message	Last commit date
Latest commit History 58 Commits
CNN		CNN
__pycache__		__pycache__
.gitignore		.gitignore
README.md		README.md
characters.py		characters.py
comparePytesseract.py		comparePytesseract.py
demo.png		demo.png
demoGroundTruthText.txt		demoGroundTruthText.txt
lib.py		lib.py
main.py		main.py
requirements.txt		requirements.txt
words.py		words.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Optical Character Recognition engine with stroke width transform(swt), maximally stable extremal regions, connvolutional neural networks, and various morphological operations

Dependencies required are

Usage

MISC

About

Releases

Packages

Contributors 2

Languages

Techget/Text-Recognition

Folders and files

Latest commit

History

Repository files navigation

Optical Character Recognition engine with stroke width transform(swt), maximally stable extremal regions, connvolutional neural networks, and various morphological operations

Dependencies required are

Usage

MISC

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages