This repository contains the IIIT5K dataset. The original dataset shared by IIIT is in matlab format. In this repository, we have converted the dataset to readable .csv
and coco
format for easy loading into python
codes.
- Cropped word images split into training and test sets
- Ground truth annotation, small and medium sized lexicons
- Lexicon with 0.5 million words (from Weinman et al. 2009)
- Character bounding box level annotations
The lexicon used to compute language priors is in the file sample/og_labels/lexicon.txt
. This lexicon was provided by Weinman et al. 2009. The cited article should be cited when using this lexicon.
img_1 | img_2 | img_3 | img_4 | |||
---|---|---|---|---|---|---|
img_1 | img_2 | |
---|---|---|
sample/
: contains sample dataset structure to help understand what you're downloadingimages/
: images folder with train/test splitlabels/
: labels folder with train/test split in coco formatog_labels/
: original label files shared by the authors in csv format.lexicon.txt
testCharBound.csv
testdata.csv
trainCharBound.csv
traindata.csv
test.txt
: list of test image files (coco format)train.txt
: list of train image files (coco format)
- Clone this repo:
git clone https://github.com/adumrewal/iiit-5k-word-coco-dataset.git
- Setup git-lfs
sudo apt-get install git-lfs
orbrew install git-lfs
git lfs install
(inside the cloned repo)
git lfs pull
(pulls the.zip
file onto your system)unzip IIIT5K_coco.zip -d .
- Thanks to IIIT5K for open-sourcing the dataset.
- Incase you need the script to convert from
csv
tococo
format, please feel free to reach out. - If you have any comments/suggestions, please feel free to drop an e-mail or raise an issue in this repo.
- If you like what I've provided here, it would be great if you could star this repo.
Please mention the following citation if you plan on using this dataset. More details can be found on original dataset webpage.
@InProceedings{MishraBMVC12,
author = "Mishra, A. and Alahari, K. and Jawahar, C.~V.",
title = "Scene Text Recognition using Higher Order Language Priors",
booktitle= "BMVC",
year = "2012"
}
@article{Weinman09,
author = {Jerod J. Weinman and Erik Learned-Miller and Allen Hanson},
title = {Scene Text Recognition using Similarity and a Lexicon with Sparse Belief Propagation},
journal= {IEEE Trans. Pattern Analysis and Machine Intelligence},
volume = {31},
number = {10},
pages = {1733--1746},
month = {Oct},
year = {2009}
}