-
Notifications
You must be signed in to change notification settings - Fork 8
Datasets
Christian Clausner edited this page Nov 21, 2017
·
2 revisions
A (probably incomplete) list of datasets / collections with ground truth in PAGE XML format:
- IMPACT Digitisation - Dataset part of the IMPACT Centre of Competence in Digitisation with more than half a million representative text-based images
- Layout Analysis Dataset - A realistic contemporary document dataset
- Europeana Newspapers Project Dataset - Point of reference for all activities related to evaluation within the scope of the Europeana Newspapers project
- REID2017 Competition - Competition on Recognition of Early Indian printed Documents
- More datasets