Skip to content

In this project, text is extracted from an image and is converted into text format. This project only works where text is in Capital Letters.

License

Notifications You must be signed in to change notification settings

DarylFernandes99/Optical-Character-Recognition

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Optical-Character-Recognition

In this project, text is extracted from an image and is converted into text format. It works on both typed and handwritten text, but only works where text is in Block Letters.

Dataset Used:

Extended MNIST (Letters) :- emnist-byclass-train.csv, emnist-byclass-test.csv

Kaggle Link for Dataset:

https://www.kaggle.com/crawford/emnist

Link to Published Paper:

https://www.irjet.net/archives/V7/i5/IRJET-V7I5964.pdf

System Configuration

  • Processor: Intel i7 9th gen
  • GPU: Nvidia RTX 2060 Mobile
  • RAM: 16GB

Softwares used for this project

  1. Python 3.7
  2. Tensorflow 2.2.1

Steps to run the project

**Change the File paths in the codes to their respective paths before execution.**
  1. Run csv_to_image(emnist).py to convert from csv to image format.
  2. Run letters(emnist).py to train the CNN model on EMNIST dataset.
  3. Run bounding box.py to predict to extract text from image.

CNN Architecture used

CNN layers

About

In this project, text is extracted from an image and is converted into text format. This project only works where text is in Capital Letters.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages