GitHub - AhmedImtiazPrio/ICDAR2021supplementary: Supplementary Material and Appendix

This repo contains code to extend/replicate the dataset present in the Kaggle Bengali.AI Handwritten Grapheme Classification. For the dataset, codes, discussions and leaderboards, visit the Kaggle competition page.

The paper with added appendix can be found here.

Common Handwritten Graphemes in Context

Project Structure

.
- data
   -- scanned
   -- extracted
   -- error
   -- packed
- codes
- collection
   -- A4
   -- Letter
- logs

Basic Usage

Run python ./data/extracted/purge.py to clear extraction folders
Download and extract batch of scanned file .jpgs to ./data/scanned/<batchname>
cd ./data/scanned and run python transcribeGui.py <batchname>
After Roll/ID are transcribed execute extract.m on MATLAB. Specify source folder before executing. Replace surfAlignGPU() with surfAlign in the absence of GPU support. Set disp=true for ocrForm(), surfAlign(), surfAlignGPU() to validate extraction performance. For surfAlign() set nonrigid=true.
cd ./data/error and check for extraction failures.
cd ./data/extracted and check for label errors in sub-folders.
Run python pack.py which will create separate folders for each extracted <batchname> inside ./data/packed.
cd ./data/packed/ and run python labelXGui.py <batchname>. Select overwriting and empty blobs to be discarded and Ctrl+S to save. After you are done going through all of the packets, click the transfer button to remove errors from the packaged folder.

Dependencies

MATLAB 2017b or higher
MATLAB Computer Vision Toolbox
Python 3.6.3 or higher
Pillow == 4.2.1

Documentation

Kaggle competition page www.kaggle.com/c/bengaliai-cv19
Dataset introduction COCO-Grapheme

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
codes		codes
collection		collection
data		data
logs		logs
ICDAR21+Appendix.pdf		ICDAR21+Appendix.pdf
LICENSE		LICENSE
README.md		README.md
favicon.ico		favicon.ico
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Common Handwritten Graphemes in Context

Project Structure

Basic Usage

Dependencies

Documentation

About

Releases

Packages

Languages

License

AhmedImtiazPrio/ICDAR2021supplementary

Folders and files

Latest commit

History

Repository files navigation

Common Handwritten Graphemes in Context

Project Structure

Basic Usage

Dependencies

Documentation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages