Skip to content

Latest commit

 

History

History
15 lines (12 loc) · 1.2 KB

tools.md

File metadata and controls

15 lines (12 loc) · 1.2 KB

Tools

Data Programming & Weak Supervision

General ML

  • Cleanlab: A Python library for machine learning with noisy labels. It supports finding label errors in datasets and training models on noisily labeled data.
  • Knodle: A modular Python framework that allows combining different classifiers, weak supervision approaches and denoising methods.

NLP

  • skweak: A Python library to support weak supervision and labeling functions for sequence and text classification.
  • ANEA: A GUI and Python library to annotate NER tags via distant supervision for many languages and entity types.

Data Augmentation

  • Albumentations: A Python library supporting more than 70 types of image augmentation and including support for bounding boxes and keypoints.
  • nlpaug: A Python libraries for data augmentation for text and signal data.
  • TextAttack: A Python library for adversarial attacks and data augmentation for NLP tasks.