python run_experiment.py -p configs/lstmcrf_lemmas.yml
bash run_experiment_job.sh "./experiment_jobs/lstm" 3 3
python run_test_segmentation.py --model_name LstmTagger --train_type forms
Model | Lemmas | Lemmas Low Resource | Forms | Forms Low Resource | Forms Shuffled | Forms Shuffled Low Resource |
---|---|---|---|---|---|---|
Random* |
0 | 0 | 0 | 0 | 0 | 0 |
Baseline | 1 | 1 | 1 | 1 | 1 | 1 |
Baseline-CRF | 1 | 1 | 1 | 1 | 1 | 1 |
LSTM | 3 | 3 | 3 | 3 | 3 | 3 |
LSTM-CRF | 3 | 3 | 3 | 3 | 3 | 3 |
CNN | 3 | 3 | 3 | 3 | 3 | 3 |
CNN-CRF | 3 | 3 | 3 | 3 | 3 | 3 |
Transformer | 3 | 3 | 3 | 3 | 3 | 3 |
Transformer-CRF | 3 | 3 | 3 | 3 | 3 | 3 |
All*
models are trained for 100 epochs with early stopping if there's no improvement
in the ratio of correctly predicted examples for 10 epochs in a row.
*
Random model is not trained
Model | Lemmas | Lemmas Low Resource | Forms | Forms Low Resource | Forms Shuffled | Forms Shuffled Low Resource |
---|---|---|---|---|---|---|
Random | 0.000069 | — | 0.000079 | — | 0.000178 | — |
Baseline | 0.0032 | 0.003 | 0.0018 | 0.0017 | 0.0019 | 0.0019 |
Baseline-CRF | 0.18 | 0.1553 | 0.1858 | 0.1652 | 0.1853 | 0.1907 |
LSTM | 0.898 | 0.6312 | 0.7567 | 0.4962 | 0.9936 | 0.777 |
LSTM-CRF | 0.9059 | 0.6464 | 0.7668 | 0.5047 | 0.9938 | 0.7874 |
CNN | 0.7285 | 0.4168 | 0.5588 | 0.3068 | 0.8816 | 0.5555 |
CNN-CRF | 0.4375 | 0.4062 | 0.377 | 0.2981 | 0.1741 | 0.4544 |
Transformer | 0.8722 | 0.5558 | 0.7267 | 0.4421 | 0.9965 | 0.7475 |
Transformer-CRF | 0.88 | 0.57 | 0.75 | 0.4546 | 0.9969 | 0.7539 |
Low resource settings use the same valid/test data as their full counterparts