| Running the following version of UD tools: | |
| commit e9726a6a7d6913193d90edb45a4cb549235c5b16 | |
| Author: Dan Zeman <zeman@ufal.mff.cuni.cz> | |
| Date: Sat Nov 4 17:10:55 2023 +0100 | |
| Evaluating the following revision of UD_Tamil-TTB: | |
| commit c1739c0397fd034200edaf4e403c2e4c9923dd75 | |
| Merge: 1392fa0 fbea79c | |
| Author: Dan Zeman <zeman@ufal.mff.cuni.cz> | |
| Size: counted 9581 of 9581 words (nodes). | |
| Size: min(0, log((N/1000)**2)) = 4.51956394133747. | |
| Size: maximum value 13.815511 is for 1000000 words or more. | |
| Split: Did not find more than 10000 training words. | |
| Split: Did not find at least 10000 development words. | |
| Split: Did not find at least 10000 test words. | |
| Lemmas: source of annotation (from README) factor is 0.8. | |
| Universal POS tags: 14 out of 17 found in the corpus. | |
| Universal POS tags: source of annotation (from README) factor is 0.8. | |
| Features: 8280 out of 9581 total words have one or more features. | |
| Features: source of annotation (from README) factor is 0.8. | |
| Universal relations: 25 out of 37 found in the corpus. | |
| Universal relations: source of annotation (from README) factor is 0.8. | |
| Udapi: | |
| TOTAL 205 | |
| Udapi: found 205 bugs. | |
| Udapi: worst expected case (threshold) is one bug per 10 words. There are 9581 words. | |
| Genres: found 1 out of 17 known. | |
| /net/work/people/zeman/unidep/tools/validate.py --lang ta --max-err=10 UD_Tamil-TTB/ta_ttb-ud-dev.conllu | |
| [Line 9 Sent dev-s1]: [L4 Enhanced unknown-edeprel] Unknown enhanced relation type 'obl:dat' in '11:obl:dat' | |
| The following 63 enhanced relations are currently permitted in language [ta]: | |
| acl, acl:relcl, advcl, advcl:cond, advmod, advmod:emph, advmod:lmod, amod, appos, aux, aux:neg, aux:pass, case, cc, ccomp, clf, compound, compound:lvc, compound:prt, compound:redup, compound:svc, conj, cop, csubj, csubj:xsubj, dep, det, discourse, dislocated, expl, fixed, flat, flat:name, goeswith, iobj, list, mark, nmod, nmod:poss, nsubj, nsubj:nc, nsubj:nc:xsubj, nsubj:pass, nsubj:pass:xsubj, nsubj:xsubj, nummod, obj, obl, obl:agent, obl:arg, obl:cmpr, obl:inst, obl:lmod, obl:pmod, obl:tmod, orphan, parataxis, punct, ref, reparandum, root, vocative, xcomp | |
| See https://quest.ms.mff.cuni.cz/udvalidator/cgi-bin/unidep/langspec/specify_edeprel.pl for details. | |
| [Line 10 Sent dev-s1]: [L4 Enhanced unknown-edeprel] Unknown enhanced relation type 'obl:dat' in '7:obl:dat' | |
| [Line 11 Sent dev-s1]: [L4 Enhanced unknown-edeprel] Unknown enhanced relation type 'obl:இல்:nom' in '11:obl:இல்:nom' | |
| [Line 32 Sent dev-s2]: [L4 Enhanced unknown-edeprel] Unknown enhanced relation type 'obl:com' in '10:obl:com' | |
| [Line 45 Sent dev-s3]: [L4 Enhanced unknown-edeprel] Unknown enhanced relation type 'obl:com' in '11:obl:com' | |
| [Line 48 Sent dev-s3]: [L4 Enhanced unknown-edeprel] Unknown enhanced relation type 'obl:com' in '11:obl:com' | |
| [Line 50 Sent dev-s3]: [L4 Enhanced unknown-edeprel] Unknown enhanced relation type 'nmod:nom' in '10:nmod:nom' | |
| [Line 58 Sent dev-s3]: [L4 Enhanced unknown-edeprel] Unknown enhanced relation type 'nmod:nom' in '18:nmod:nom' | |
| [Line 68 Sent dev-s4]: [L4 Enhanced unknown-edeprel] Unknown enhanced relation type 'obl:loc' in '23:obl:loc' | |
| ...suppressing further errors regarding Enhanced | |
| Enhanced errors: 351 | |
| *** FAILED *** with 351 errors | |
| Exit code: 1 | |
| /net/work/people/zeman/unidep/tools/validate.py --lang ta --max-err=10 UD_Tamil-TTB/ta_ttb-ud-test.conllu | |
| [Line 6 Sent test-s1]: [L4 Enhanced unknown-edeprel] Unknown enhanced relation type 'nmod:இலிருந்து:nom' in '4:nmod:இலிருந்து:nom' | |
| The following 63 enhanced relations are currently permitted in language [ta]: | |
| acl, acl:relcl, advcl, advcl:cond, advmod, advmod:emph, advmod:lmod, amod, appos, aux, aux:neg, aux:pass, case, cc, ccomp, clf, compound, compound:lvc, compound:prt, compound:redup, compound:svc, conj, cop, csubj, csubj:xsubj, dep, det, discourse, dislocated, expl, fixed, flat, flat:name, goeswith, iobj, list, mark, nmod, nmod:poss, nsubj, nsubj:nc, nsubj:nc:xsubj, nsubj:pass, nsubj:pass:xsubj, nsubj:xsubj, nummod, obj, obl, obl:agent, obl:arg, obl:cmpr, obl:inst, obl:lmod, obl:pmod, obl:tmod, orphan, parataxis, punct, ref, reparandum, root, vocative, xcomp | |
| See https://quest.ms.mff.cuni.cz/udvalidator/cgi-bin/unidep/langspec/specify_edeprel.pl for details. | |
| [Line 13 Sent test-s1]: [L4 Enhanced unknown-edeprel] Unknown enhanced relation type 'obl:dat' in '9:obl:dat' | |
| [Line 28 Sent test-s2]: [L4 Enhanced unknown-edeprel] Unknown enhanced relation type 'obl:இலிருந்து:nom' in '9:obl:இலிருந்து:nom' | |
| [Line 42 Sent test-s3]: [L4 Enhanced unknown-edeprel] Unknown enhanced relation type 'nmod:nom' in '2:nmod:nom' | |
| [Line 43 Sent test-s3]: [L4 Enhanced unknown-edeprel] Unknown enhanced relation type 'obl:loc' in '5:obl:loc' | |
| [Line 44 Sent test-s3]: [L4 Enhanced unknown-edeprel] Unknown enhanced relation type 'nmod:nom' in '4:nmod:nom' | |
| [Line 49 Sent test-s3]: [L4 Enhanced unknown-edeprel] Unknown enhanced relation type 'nmod:dat' in '9:nmod:dat' | |
| [Line 54 Sent test-s3]: [L4 Enhanced unknown-edeprel] Unknown enhanced relation type 'obl:arg:இடம்:gen' in '15:obl:arg:இடம்:gen' | |
| [Line 66 Sent test-s4]: [L4 Enhanced unknown-edeprel] Unknown enhanced relation type 'obl:com' in '5:obl:com' | |
| ...suppressing further errors regarding Enhanced | |
| [Line 2738 Sent test-s118 Node 7]: [L3 Syntax too-many-subjects] Multiple subjects [4, 6] not subtyped as ':outer'. Outer subjects are allowed if a clause acts as the predicate of another clause. | |
| Enhanced errors: 483 | |
| Syntax errors: 1 | |
| *** FAILED *** with 484 errors | |
| Exit code: 1 | |
| /net/work/people/zeman/unidep/tools/validate.py --lang ta --max-err=10 UD_Tamil-TTB/ta_ttb-ud-train.conllu | |
| [Line 5 Sent train-s1]: [L4 Enhanced unknown-edeprel] Unknown enhanced relation type 'obl:அருகே:nom' in '18:obl:அருகே:nom' | |
| The following 63 enhanced relations are currently permitted in language [ta]: | |
| acl, acl:relcl, advcl, advcl:cond, advmod, advmod:emph, advmod:lmod, amod, appos, aux, aux:neg, aux:pass, case, cc, ccomp, clf, compound, compound:lvc, compound:prt, compound:redup, compound:svc, conj, cop, csubj, csubj:xsubj, dep, det, discourse, dislocated, expl, fixed, flat, flat:name, goeswith, iobj, list, mark, nmod, nmod:poss, nsubj, nsubj:nc, nsubj:nc:xsubj, nsubj:pass, nsubj:pass:xsubj, nsubj:xsubj, nummod, obj, obl, obl:agent, obl:arg, obl:cmpr, obl:inst, obl:lmod, obl:pmod, obl:tmod, orphan, parataxis, punct, ref, reparandum, root, vocative, xcomp | |
| See https://quest.ms.mff.cuni.cz/udvalidator/cgi-bin/unidep/langspec/specify_edeprel.pl for details. | |
| [Line 7 Sent train-s1]: [L4 Enhanced unknown-edeprel] Unknown enhanced relation type 'nmod:nom' in '4:nmod:nom' | |
| [Line 8 Sent train-s1]: [L4 Enhanced unknown-edeprel] Unknown enhanced relation type 'obl:loc' in '18:obl:loc' | |
| [Line 9 Sent train-s1]: [L4 Enhanced unknown-edeprel] Unknown enhanced relation type 'nmod:nom' in '6:nmod:nom' | |
| [Line 10 Sent train-s1]: [L4 Enhanced unknown-edeprel] Unknown enhanced relation type 'nmod:nom' in '11:nmod:nom' | |
| [Line 16 Sent train-s1]: [L4 Enhanced unknown-edeprel] Unknown enhanced relation type 'nmod:dat' in '12:nmod:dat' | |
| [Line 19 Sent train-s1]: [L4 Enhanced unknown-edeprel] Unknown enhanced relation type 'nmod:dat' in '15:nmod:dat' | |
| [Line 20 Sent train-s1]: [L4 Enhanced unknown-edeprel] Unknown enhanced relation type 'nmod:இல்:nom' in '17:nmod:இல்:nom' | |
| [Line 22 Sent train-s1]: [L4 Enhanced unknown-edeprel] Unknown enhanced relation type 'obl:loc' in '18:obl:loc' | |
| ...suppressing further errors regarding Enhanced | |
| [Line 4427 Sent train-s192 Node 25]: [L3 Syntax too-many-subjects] Multiple subjects [11, 17] not subtyped as ':outer'. Outer subjects are allowed if a clause acts as the predicate of another clause. | |
| Enhanced errors: 1922 | |
| Syntax errors: 1 | |
| *** FAILED *** with 1923 errors | |
| Exit code: 1 | |
| Validity: 0.01 | |
| (weight=0.0769230769230769) * (score{features}=0.8) = 0.0615384615384615 | |
| (weight=0.0769230769230769) * (score{genres}=0.0588235294117647) = 0.00452488687782805 | |
| (weight=0.0769230769230769) * (score{lemmas}=0.8) = 0.0615384615384615 | |
| (weight=0.256410256410256) * (score{size}=0.327136946721963) = 0.0838812683902469 | |
| (weight=0.0512820512820513) * (score{split}=0.01) = 0.000512820512820513 | |
| (weight=0.0769230769230769) * (score{tags}=0.658823529411765) = 0.0506787330316742 | |
| (weight=0.307692307692308) * (score{udapi}=0.786034860661726) = 0.241856880203608 | |
| (weight=0.0769230769230769) * (score{udeprels}=0.540540540540541) = 0.0415800415800416 | |
| (TOTAL score=0.546111553673142) * (availability=1) * (validity=0.01) = 0.00546111553673142 | |
| STARS = 0 | |
| UD_Tamil-TTB 0.00546111553673142 0 | |