Skip to content

anugyaparashar/Machine-Learning-Project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 

Repository files navigation

"THIS PROJECT IS DONE UNDER TCS iON MACHINE LEARNING INTERNSHIP"

Machine-Learning-Project

"Automated extraction of handwritten text from images"

Handwriting recognition (HWR), also known as Handwritten Text Recognition (HTR), is the ability of a computer to receive and interpret intelligible handwritten input from sources such as paper documents, photographs, touch-screens and other devices. This project seeks to classify an individual handwritten word so that handwritten text can be translated to a digital form. The main approach used to accomplish this task i.e, classifying words directly is Convolutional Neural Network (CNN) with various architectures to train a model that can accurately classify words.


Handwritten text is a very general term, and we wanted to narrow down the scope of the project by specifying the meaning of handwritten text for our purposes. In this project, we took on the challenge of classifying the image of any handwritten word, which might be of the form of cursive or block writing.


Description of Project

The objective of this project is to identify handwritten characters with the use of neural networks. We have to construct suitable neural network and train it properly. The program should be able to extract the characters one by one and map the target output for training purpose. After automatic processing of the image, the training dataset has to be used to train “classification engine” for recognition purpose. The program code has to be written in Python and TensorFlow and the dataset used is the IAM handwritten dataset.

To solve the defined handwritten character recognition problem of classification we used Google colab computation software. The computation code is divided into the next categories:

  1. Collect handwritten images.
  2. Split the data into the train set(80%) and the test set(20%) for further use.
  3. Use Dimension reduction techniques.
  4. Train classification model on the training dataset using neural network.
  5. Validate the model on the test data.
  6. Fine tune the parameters for increase in classification that accurates the model on train and test data.

After training the model the accuracy comes out to be 0.2522335946559906

The images generated in predicting the accuracy of the model are:

About

"Automated extraction of handwritten text from images"

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published