This is the official codebase for the implementation of Deep Ordinal Regression with Label Diversity, presented at ICPR2020.
Code author: Axel Berg
- Matlab 2020a with the Deep Learning Toolbox
- Cuda 10.1
Download the aligned and cropped images from https://susanqq.github.io/UTKFace/ and run age/data/createCroppedUtkDataset.m
after setting the correct path to the dataset in the script.
This will create datastore objects for reading the images.
The train/test split is the same as the one used in "Coral-CNN" and can be found here: https://github.com/Raschka-research-group/coral-cnn
We use protocol 2 as described in the FSA-Net paper, which uses a 70-30 train/test split. You can prepare the dataset yourself using the original code from https://github.com/shamangary/FSA-Net and store the train/test splits as .mat
-files.
Download the dataset from http://graphics.cs.cmu.edu/projects/historicalColor/ and add it set the datapath variable in date/dateParameters.m
to point to it.
For each dataset, run the iteration scripts to train the ResNet50 backbone for 10 iterations and output save the mean average error results on the test set in a text file. All the hyperparameters are the same for each method and are defined in a single function.
The following loss functions are supported:
- L2 - use the standard L2 (mean squares error) loss. This is the regression baseline
- CE - use the cross entropy loss. This is the RvC baseline.
- Equal Bins - use the equal bins label diversity method
- Random Bins - use the random bins label diversity methods.
- Multiple bins (for the HCI dataset) - uses the multiple bins labels diversity method specified in the paper
If you find this work useful, please cite our paper:
@inproceedings{berg2021deep,
title={Deep ordinal regression with label diversity},
author={Berg, Axel and Oskarsson, Magnus and O'Connor, Mark},
booktitle={2020 25th International Conference on Pattern Recognition (ICPR)},
pages={2740--2747},
year={2021},
organization={IEEE}
}
W. Cao, V. Mirjalili, and S. Raschka (2019). Consistent rank logits for ordinal regression with convolutional neural networks. arXiv preprint arXiv:1901.07884
T.-Y. Yang, Y.-T. Chen, Y.-Y. Lin, and Y.-Y. Chuang, (2019) Fsa-net: Learning fine-grained structure aggregation for head pose estimation from a single image