forked from UniversalDependencies/UD_Hindi_English-HIENCS
-
Notifications
You must be signed in to change notification settings - Fork 0
/
eval.log
44 lines (44 loc) · 2.35 KB
/
eval.log
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
Running the following version of tools/evaluate_treebank.pl:
commit d71d1b63fd448a8b098d4b5a0329f8da3f7413b0
Author: Dan Zeman <zeman@ufal.mff.cuni.cz>
Date: Sat Apr 21 19:02:58 2018 +0200
Evaluating the following revision of UD_Hindi_English-HIENCS:
commit 2c8f9653dc1ee15f7cfa5de48689089f536c9ecd
Author: Dan Zeman <zeman@ufal.mff.cuni.cz>
Date: Thu Nov 8 14:51:29 2018 +0100
Size: counted 26909 of 26909 words (nodes).
Size: min(0, log((N/1000)**2)) = 6.58492160628321.
Size: maximum value 13.815511 is for 1000000 words or more.
Split: Found more than 10000 training words.
Split: Did not find at least 10000 development words.
Split: Did not find at least 10000 test words.
Lemmas: '_' is the most frequent lemma.
Universal POS tags: 17 out of 17 found in the corpus.
Universal POS tags: source of annotation (from README) factor is 1.
Features: 0 out of 26909 total words have one or more features.
Features: source of annotation (from README) factor is 0.8.
Universal relations: 31 out of 37 found in the corpus.
Universal relations: source of annotation (from README) factor is 0.8.
Udapi: found 6908 bugs.
Udapi: worst expected case (threshold) is one bug per 10 words. There are 26909 words.
Genres: found 1 out of 17 known.
Availability: README does not say Includes text: yes
Availability: '_' is the most frequent form.
validate.py --lang qhe --max-err=10 UD_Hindi_English-HIENCS/qhe_hiencs-ud-dev.conllu
*** PASSED ***
validate.py --lang qhe --max-err=10 UD_Hindi_English-HIENCS/qhe_hiencs-ud-test.conllu
*** PASSED ***
validate.py --lang qhe --max-err=10 UD_Hindi_English-HIENCS/qhe_hiencs-ud-train.conllu
*** PASSED ***
Validity: 1
(weight=0.0769230769230769) * (score{features}=0.01) = 0.000769230769230769
(weight=0.0769230769230769) * (score{genres}=0.0588235294117647) = 0.00452488687782805
(weight=0.0769230769230769) * (score{lemmas}=0.01) = 0.000769230769230769
(weight=0.256410256410256) * (score{size}=0.476632519562383) = 0.122213466554457
(weight=0.0512820512820513) * (score{split}=0.34) = 0.0174358974358974
(weight=0.0769230769230769) * (score{tags}=1) = 0.0769230769230769
(weight=0.307692307692308) * (score{udapi}=0.01) = 0.00307692307692308
(weight=0.0769230769230769) * (score{udeprels}=0.67027027027027) = 0.0515592515592516
(TOTAL score=0.277271963965896) * (availability=0.1) * (validity=1) = 0.0277271963965896
STARS = 0
UD_Hindi_English-HIENCS 0.0277271963965896 0