Skip to content

This is an implementation of YOLOv8 and CRNN network for Scene Text Recognition task

Notifications You must be signed in to change notification settings

minhnhat2001vt/Scence-Text-Recognition-With-YOLOv8-and-CRNN

Repository files navigation

Scence Text Recognition With YOLOv8 and CRNN

Scene Text Recognition (STR) is a problem that applies image processing and character recognition algorithms to identify text appearing in images. A Scene Text Recognition program usually includes two main stages: Text Detection (Detector) and Text Recognition (Recognizer).

The goal of this project is to build a STR model that can takes an image containing text as input, then returns Coordinate location and text in the image as outputs.

Project demo: https://scence-text-recognition-with-yolov8-and-crnn-jyuei7poccent4app.streamlit.app/

1. Dataset

2. Data Preparation

  • Extract data from XML and convert to YOLOv8 format.
  • Perform train test split for training.

3. Evaluation

  • YOLOv8: 0.956 mAP50 on validation set.
  • Plain CRNN: 3.85 Val Loss, 3.87 Test loss.
  • CRNN + Skip Connection: 3.1 Val loss, 3.40 Test loss.
  • CRNN + Resnet: 1.63 Val loss, 1.44 Test loss.

4. References

  • 7th International Conference on Document Analysis and Recognition (ICDAR 2003), 2-Volume Set, 3-6 August 2003, Edinburgh, Scotland, UK. IEEE Computer Society 2003, ISBN 0-7695-1960-1

About

This is an implementation of YOLOv8 and CRNN network for Scene Text Recognition task

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published