A curated list of awesome datasets for papers/experiments/validation.
Datasets for classification.
- KEEL - General - General classification datasets.
- KEEL - Missing-values - Missing values datasets.
- KEEL - Imbalanced datasets - Imbalanced datasets for classification.
- KEEL - Multi-label - Multi-label datasets.
- KEEL - Class noise - Datasets with class noise.
- KEEL - Attribute noise - Datasets with attribute noise.
Datasets for semi-supervised applications.
- KEEL - semi-supervised - Datasets for semi-supervised experiments.
- KEEL - semi-supervised - Datasets for semi-supervised experiments.
Datasets for regression applications.
- KEEL - regression - Datasets for regression experiments.
Datasets for time-series problems.
- KEEL - time-series - Datasets for time-series experiments.
Face Recognition datasets.
- JAFFE - The Japanese Female Facial Expression (JAFFE) Database.
- Carnegie Mellon - Datasets from theo-8 projects at Carnegie Mellon University.
- Yale Face Database - Datasets for facial expression (happy, sad, angry...) recognition.
- Cohn-Kanade - The Cohn-Kanade AU-Coded Facial Expression Database is for research in automatic facial image analysis and synthesis and for perceptual studies.
- AR face Database - Different facial expressions, illumination conditions and occlusions.
- Face Detection CBCL - Face Detection Data from MIT.
- Face Recognition LFW - Face Recognition from UMASS.
- Face Recognition ORL - Face Recognition from AT&T.
Image Processing.
- Microsoft - Salient Object Database - MSRA Salient Object Database.
- IVRG - Salient Object Database - Frequency-tuned Salient Region Detection.
- ICDAR - Robust Reading - Robust Reading Competition.
- Brodatz - Texture Recognition - Texture Recognition.
- Vistex - Texture Recognition - Texture Recognition.
- Caltech - Object Categorization - Object Categorization from Caltech101.
- Marcel - Gesture Recognition - Gesture Recognition from Marcel.
- RPPDI - Gesture Recognition - Gesture Recognition from RPPDI.
Handwriting Recognition
- MNIST - Database of Handwritten Digits - THE MNIST DATABASE of handwritten digits.
Text Classification
- 20 Newsgroups - The 20 newsgroups text dataset.
- Reuters-21578 - Reuters-21578 Text Categorization Collection Data Set