Skip to content

Latest commit

 

History

History
32 lines (25 loc) · 1.15 KB

README.md

File metadata and controls

32 lines (25 loc) · 1.15 KB

Optical-Character-Recognition

In this project, text is extracted from an image and is converted into text format. It works on both typed and handwritten text, but only works where text is in Block Letters.

Dataset Used:

Extended MNIST (Letters) :- emnist-byclass-train.csv, emnist-byclass-test.csv

Kaggle Link for Dataset:

https://www.kaggle.com/crawford/emnist

Link to Published Paper:

https://www.irjet.net/archives/V7/i5/IRJET-V7I5964.pdf

System Configuration

  • Processor: Intel i7 9th gen
  • GPU: Nvidia RTX 2060 Mobile
  • RAM: 16GB

Softwares used for this project

  1. Python 3.7
  2. Tensorflow 2.2.1

Steps to run the project

**Change the File paths in the codes to their respective paths before execution.**
  1. Run csv_to_image(emnist).py to convert from csv to image format.
  2. Run letters(emnist).py to train the CNN model on EMNIST dataset.
  3. Run bounding box.py to predict to extract text from image.

CNN Architecture used

CNN layers