Skip to content

Latest commit

 

History

History

docs

MDTK Documentation

This directory contains documentation with examples for the toolkit. This includes:

  1. The ACME dataset
    • an introduction and
    • a description of the dataset used in our paper
  2. Dataset creation
    • how to create your own ACME datasets
  3. Degradation functions
    • an introduction to the available functions, and
    • their parameters
  4. Data parsers and the degrader class
    • How to parse data and provide it to, for example, pytorch models
    • How to augment a dataset with degradations on-the-fly
  5. Matching errors with your AMT system
    • How to generate data which matches the output of your AMT system
  6. Reproducing results from the paper
    • script to perform training & evaluation to reproduce paper results provided

Augmenting your AMT system

If you are interested in augmenting the data to train a model which cleans the output of your AMT system:

  1. Read how to match the errors of your AMT system, then either
    1. how to create a dataset with parameters to match your AMT system's errors, or
    2. degrade data on the fly
  2. Train a model on the degraded data to fix the errors