- Cleanlab: A Python library for machine learning with noisy labels. It supports finding label errors in datasets and training models on noisily labeled data.
- Knodle: A modular Python framework that allows combining different classifiers, weak supervision approaches and denoising methods.
- skweak: A Python library to support weak supervision and labeling functions for sequence and text classification.
- ANEA: A GUI and Python library to annotate NER tags via distant supervision for many languages and entity types.
- Albumentations: A Python library supporting more than 70 types of image augmentation and including support for bounding boxes and keypoints.
- nlpaug: A Python libraries for data augmentation for text and signal data.
- TextAttack: A Python library for adversarial attacks and data augmentation for NLP tasks.