- 08/03/24 : A new abstract is available to the public, introducing a novel approach to glaucoma detection: Assessment of Retinal Vasculature for Glaucoma Detection: A Comparative Analysis of Human Expertise and Deep Learning Algorithms.
- 05/15/24 : The EyePACS-light-V2 code folder on Kaggle has a new PyTorch template for quick and easy glaucoma detection setup. This model uses the lightweight MobileNetV3 and achieved a test accuracy of 92.6%: https://www.kaggle.com/code/deathtrooper/pytorch-easy-setup-for-glaucoma-detection-92-6
- 03/09/24 : EyePACS-light-V2 now has a 10.0 Kaggle usability score: supplemental metadata.csv file added to dataset.
- 01/20/24 : EyePACS-light-V2 preprocess high-level overview is now available on the Kaggle dataset link in the about section (scroll all the way down) if you are curious on how the dataset was derived
- 12/28/23 : EyePACS-light-V2 94.94% test accuracy benchmark using ConvNeXtTiny: https://www.kaggle.com/code/deathtrooper/benchmark-94-94-convnexttiny
- 12/12/23 : EyePACS-light-V2 is HERE!!! Be the first to benchmark your model with this improved dataset! Download from kaggle: https://www.kaggle.com/datasets/deathtrooper/glaucoma-dataset-eyepacs-airogs-light-v2/data
If you found this catalog helpful, please consider citing the following:
- Riley Kiefer, Muhammad Abid, Jessica Steen, Mahsa Raeisi Ardali, and Ehsan Amjadian. 2023. A Catalog of Public Glaucoma Datasets for Machine Learning Applications: A detailed description and analysis of public glaucoma datasets available to machine learning engineers tackling glaucoma-related problems using retinal fundus images and OCT images. In Proceedings of the 2023 7th International Conference on Information System and Data Mining (ICISDM '23). Association for Computing Machinery, New York, NY, USA, 24–31. https://doi.org/10.1145/3603765.3603779
[Help expand this repository by providing links/publications to new glaucoma datasets!]
- README.md : Glaucoma overview, relevant research, and dataset access links
- benchmark-eyepacs-airogs-light.md : Leaderboard for the test set evaluation using the train/val sets of the EyePACS-AIROGS-light dataset
- summary.md : Dataset class breakdown, image types, and glaucoma types
- data-availability.md : Dataset image and segmentation availability.
- origin.md : Dataset collection origin and collection years.
According to AAO, "Glaucoma is a disease that damages your eye’s optic nerve. It usually happens when fluid builds up in the front part of your eye. That extra fluid increases the pressure in your eye, damaging the optic nerve". It is a leading cause of blindness and worsens over time if left untreated. Optometrists diagnose glaucoma through fundus images (a 2D image of the eye) or ocular coherence tomography (OCT) images (a 3D image of the eye). In fundus images, optometrists typically look for optic cup or optic disc damage. In OCT images, optometrists typically look for layer atrophy. To automate the detection of glaucoma, datasets are curated for machine learning. Fundus image datasets are typically designed for either glaucoma classification to distinguish healthy for glaucoma or optic nerve head segmentation to extract and analyze cup or disc damage. High-quality benchmark datasets like EyePACS-AIROGS-light have predetermined train, validation, and test sets for reproducibility.
- SMDG-19 [Dataset], "Standardized Multi-Channel Dataset for Glaucoma, Version 19", https://www.kaggle.com/datasets/deathtrooper/multichannel-glaucoma-benchmark-dataset
- EyePACS-AIROGS-light-v1 [Dataset], "EyePACS Artificial Intelligence for Robust Glaucoma Screening Challenge, Lightweight Version 1", https://www.kaggle.com/datasets/deathtrooper/eyepacs-airogs-light
- EyePACS-AIROGS-light-v2 [Dataset], "EyePACS Artificial Intelligence for Robust Glaucoma Screening Challenge, Lightweight Version 2", https://www.kaggle.com/datasets/deathtrooper/glaucoma-dataset-eyepacs-airogs-light-v2
- ARVO 2024 [Abstract], "Assessment of Retinal Vasculature for Glaucoma Detection: A Comparative Analysis of Human Expertise and Deep Learning Algorithms", https://iovs.arvojournals.org/article.aspx?articleid=2794846
- AAO 2022 [Abstract], "A Comprehensive Survey of Publicly Available Glaucoma Datasets for Automated Glaucoma Detection", https://aaopt.org/past-meeting-abstract-archives/?SortBy=ArticleYear&ArticleType=&ArticleYear=2022&Title=&Abstract=&Authors=&Affiliation=&PROGRAMNUMBER=225129
- AAO 2023 [Abstract], "The Predictive Power of Fundus Blood Vessels in Glaucoma Detection", accepted as a presentation for AAO 2023
- AAO 2023 [Abstract], "EyePACS-light: A Lightweight Balanced Dataset for Automated Glaucoma Classification Modeling", accepted as a poster for AAO 2023
- ARVO 2023 [Abstract], "Ground truth validation of publicly available datasets utilized in artificial intelligence models for glaucoma detection", https://iovs.arvojournals.org/article.aspx?articleid=2791017
- ARVO 2023 [Abstract], "Standardized and Open-Access Glaucoma Dataset for Artificial Intelligence Applications", https://iovs.arvojournals.org/article.aspx?articleid=2790420
- IEEE-IEMCOM 2022 [Full Paper], "A Survey of Glaucoma Detection Algorithms using Fundus and OCT Images", https://ieeexplore.ieee.org/abstract/document/9946629
- IEEE-ICIVC 2023 [Full Paper], "Automated Fundus Image Standardization Using a Dynamic Global Foreground Threshold Algorithm", https://ieeexplore.ieee.org/abstract/document/10270429
- ICISDM 2023 [Full Paper], "A Catalog of Public Glaucoma Datasets for Machine Learning Applications", https://dl.acm.org/doi/abs/10.1145/3603765.3603779
Drishti-GS | G1020 | ORIGA-light | REFUGE1-VAL | PAPILA |
---|---|---|---|---|
- Classification
- BGC = Binary Glaucoma Classification (healthy vs. glaucoma or non-glaucoma vs. glaucoma)
- MGC = Multi Glaucoma Classification (at least 2 glaucoma types, including suspect)
- Segmentation
- ODS = Optic Disc Segmentation
- OCS = Optic Cup Segmentation
- BVS = Blood Vessel Segmentation
- OLS = OCT Layer Segmentation
- RNFLS = Retinal Nerve Fiber Layer Segmentation
- Other
- LT = Localization Task
- IQA = Image Quality Assesment
- MIDI = Multi Image Domain Input
- CDR = Cup-to-Disc Ratio Estimation
- N = Notching
- VF = Visual Field Information/Segmentation