Regenerated:
UD Afrikaans
af
PASS
python tools/validate.py --lang af UD-dev-branches/UD_Afrikaans/af-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang af UD-dev-branches/UD_Afrikaans/af-ud-test.conllu *** PASSED *** ****************** python tools/validate.py --lang af UD-dev-branches/UD_Afrikaans/af-ud-train.conllu *** PASSED *** ******************
UD Amharic
am
EMPTY
No data
UD Ancient Greek
grc
PASS
python tools/validate.py --lang grc UD-dev-branches/UD_Ancient_Greek/grc-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang grc UD-dev-branches/UD_Ancient_Greek/grc-ud-test.conllu *** PASSED *** ****************** python tools/validate.py --lang grc UD-dev-branches/UD_Ancient_Greek/grc-ud-train.conllu *** PASSED *** ******************
UD Ancient Greek-PROIEL
grc proiel
PASS
python tools/validate.py --lang grc_proiel UD-dev-branches/UD_Ancient_Greek-PROIEL/grc_proiel-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang grc_proiel UD-dev-branches/UD_Ancient_Greek-PROIEL/grc_proiel-ud-test.conllu *** PASSED *** ****************** python tools/validate.py --lang grc_proiel UD-dev-branches/UD_Ancient_Greek-PROIEL/grc_proiel-ud-train.conllu *** PASSED *** ******************
UD Arabic
ar
PASS
python tools/validate.py --lang ar UD-dev-branches/UD_Arabic/ar-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang ar UD-dev-branches/UD_Arabic/ar-ud-test.conllu *** PASSED *** ****************** python tools/validate.py --lang ar UD-dev-branches/UD_Arabic/ar-ud-train.conllu *** PASSED *** ******************
UD Arabic-NYUAD
ar nyuad
PASS
python tools/validate.py --lang ar_nyuad UD-dev-branches/UD_Arabic-NYUAD/ar_nyuad-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang ar_nyuad UD-dev-branches/UD_Arabic-NYUAD/ar_nyuad-ud-test.conllu *** PASSED *** ****************** python tools/validate.py --lang ar_nyuad UD-dev-branches/UD_Arabic-NYUAD/ar_nyuad-ud-train.conllu *** PASSED *** ******************
UD Arabic-PUD
ar pud
PASS
python tools/validate.py --lang ar_pud UD-dev-branches/UD_Arabic-PUD/ar_pud-ud-test.conllu *** PASSED *** ******************
UD Armenian
hy
EMPTY
No data
UD Bambara
bm
FAIL
python tools/validate.py --lang bm UD-dev-branches/UD_Bambara/bm-ud-dev.conllu [Line 5]: Unknown attribute-value pair ASpect=Perf [Line 5]: Unknown attribute-value pair Valency=1 [Line 5]: Invalid DEPREL value _ [Line 5]: Unknown UD DEPREL: _ [Line 6]: Unknown UD DEPREL: nmod:poss [Line 7]: Unknown UD DEPREL: nmod:poss [Line 8]: Unknown UD DEPREL: nmod:poss [Line 13]: Undefined ID in HEAD: _ [Tree number 1 on line 3]: Empty head for word ID 3 [Tree number 1 on line 3]: Non-tree structure. Words 1,2,3,4,5,6,7,8,9,10 are not reachable from the root 0. [Line 13]: SpaceAfter=No is missing in the MISC field of node #10 because the text is '1976.' [Line 13]: Extra characters at the end of the text attribute, not accounted for in the FORM fields: '.' [Line 16]: Unknown UD DEPREL: nmod:poss [Line 18]: Morphological features must be sorted: 'Aspectpect=Perf|Valency=1|Polarity=Pos' [Line 18]: Unknown attribute-value pair Aspectpect=Perf [Line 18]: Unknown attribute-value pair Valency=1 [Line 18]: Invalid DEPREL value _ [Line 18]: Unknown UD DEPREL: _ [Line 19]: Unknown UD DEPREL: nmod:poss [Line 20]: Unknown UD DEPREL: nmod:poss [Line 24]: Undefined ID in HEAD: _ [Tree number 2 on line 16]: Empty head for word ID 3 [Tree number 2 on line 16]: Non-tree structure. Words 1,2,3,4,5,6,7,8 are not reachable from the root 0. [Line 24]: The forward slash is reserved for special use in parallel treebanks: ../kibaru543_03dunbuya_konta-bamako_jumamisiriba.dis.html:2 [Line 24]: SpaceAfter=No is missing in the MISC field of node #7 because the text is '8.' [Line 27]: Morphological features must be sorted: 'PronType=Prs|Number=Sing|Person=3' [Line 27]: Unknown UD DEPREL: nmod:poss [Line 29]: Repeated features are disallowed: Tense=Past|Tense=Past [Line 31]: Unknown UD DEPREL: nmod:poss [Line 34]: Invalid DEPREL value _ [Line 34]: Unknown UD DEPREL: _ [Line 38]: Unknown UD DEPREL: flat:name [Line 39]: Unknown UD DEPREL: flat:name [Line 40]: Unknown UD DEPREL: flat:name ...suppressing further errors regarding Syntax [Line 43]: Undefined ID in HEAD: _ [Tree number 3 on line 27]: Empty head for word ID 8 [Line 43]: The forward slash is reserved for special use in parallel treebanks: ../kibaru543_03dunbuya_konta-bamako_jumamisiriba.dis.html:3 [Line 43]: SpaceAfter=No is missing in the MISC field of node #9 because the text is 'bolo, Cɛrino Amadu U[...]' [Line 43]: SpaceAfter=No is missing in the MISC field of node #15 because the text is 'Jalo.' [Line 53]: Repeated features are disallowed: Tense=Past|Tense=Past [Line 55]: Morphological features must be sorted: 'PronType=Prs|Number=Sing|Person=3' [Line 59]: Undefined ID in HEAD: _ [Tree number 4 on line 46]: Empty head for word ID 2 [Line 59]: The forward slash is reserved for special use in parallel treebanks: ../kibaru543_03dunbuya_konta-bamako_jumamisiriba.dis.html:4 [Line 59]: SpaceAfter=No is missing in the MISC field of node #2 because the text is 'lasigiden, Masiwudi [...]' [Line 59]: SpaceAfter=No is missing in the MISC field of node #9 because the text is 'b'a kɛnɛ kan.' [Line 59]: SpaceAfter=No is missing in the MISC field of node #12 because the text is 'kan.' [Line 74]: Morphological features must be sorted: 'PronType=Prs|Number=Plur' [Line 75]: Repeated features are disallowed: Tense=Past|Tense=Past [Line 77]: Morphological features must be sorted: 'PronType=Prs|Number=Sing|Person=3' [Line 81]: Undefined ID in HEAD: _ [Tree number 5 on line 62]: Empty head for word ID 3 [Line 81]: The forward slash is reserved for special use in parallel treebanks: ../kibaru543_03dunbuya_konta-bamako_jumamisiriba.dis.html:5 [Line 81]: SpaceAfter=No is missing in the MISC field of node #5 because the text is 'Diko, misiriba alima[...]' [Line 81]: SpaceAfter=No is missing in the MISC field of node #8 because the text is 'alimami, Mahamudu Ka[...]' [Line 81]: SpaceAfter=No is missing in the MISC field of node #11 because the text is 'Kale, olu tun b'a kɛ[...]' [Line 81]: SpaceAfter=No is missing in the MISC field of node #15 because the text is 'b'a kɛnɛ kan.' [Line 81]: SpaceAfter=No is missing in the MISC field of node #18 because the text is 'kan.' [Line 95]: Morphological features must be sorted: 'PronType=Prs|Number=Plur' [Line 96]: Unknown attribute-value pair AdjType=Attr [Line 97]: Repeated features are disallowed: Tense=Past|Tense=Past [Line 99]: Morphological features must be sorted: 'PronType=Prs|Number=Sing|Person=3' [Line 103]: The forward slash is reserved for special use in parallel treebanks: ../kibaru543_03dunbuya_konta-bamako_jumamisiriba.dis.html:6 [Line 103]: SpaceAfter=No is missing in the MISC field of node #3 because the text is 'lasigidenw, Bamakɔ s[...]' ...suppressing further errors regarding Metadata *** FAILED *** with 90 errors Format errors: 10 Metadata errors: 23 Morpho errors: 16 Syntax errors: 41 The language-specific file /home/ginter/UD_PROJHOOK/tools/data/deprel.bm does not exist. The language-specific file /home/ginter/UD_PROJHOOK/tools/data/feat_val.bm does not exist. python conllu-stats.py --catvals=langspec yourdata/*.conllu > /home/ginter/UD_PROJHOOK/tools/data/feat_val.bm ******************
UD Bangla
bn
EMPTY
No data
UD Basque
eu
PASS
python tools/validate.py --lang eu UD-dev-branches/UD_Basque/eu-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang eu UD-dev-branches/UD_Basque/eu-ud-test.conllu *** PASSED *** ****************** python tools/validate.py --lang eu UD-dev-branches/UD_Basque/eu-ud-train.conllu *** PASSED *** ******************
UD Belarusian
be
PASS
python tools/validate.py --lang be UD-dev-branches/UD_Belarusian/be-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang be UD-dev-branches/UD_Belarusian/be-ud-test.conllu *** PASSED *** ****************** python tools/validate.py --lang be UD-dev-branches/UD_Belarusian/be-ud-train.conllu *** PASSED *** ******************
UD Bengali-DDS
bn dds
EMPTY
No data
UD Bulgarian
bg
PASS
python tools/validate.py --lang bg UD-dev-branches/UD_Bulgarian/bg-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang bg UD-dev-branches/UD_Bulgarian/bg-ud-test.conllu *** PASSED *** ****************** python tools/validate.py --lang bg UD-dev-branches/UD_Bulgarian/bg-ud-train.conllu *** PASSED *** ******************
UD Buryat
bxr
PASS
python tools/validate.py --lang bxr UD-dev-branches/UD_Buryat/bxr-ud-sample.conllu *** PASSED *** ****************** python tools/validate.py --lang bxr UD-dev-branches/UD_Buryat/bxr-ud-test.conllu *** PASSED *** ******************
UD Cantonese
yue
EMPTY
No data
UD Catalan
ca
PASS
python tools/validate.py --lang ca UD-dev-branches/UD_Catalan/ca-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang ca UD-dev-branches/UD_Catalan/ca-ud-test.conllu *** PASSED *** ****************** python tools/validate.py --lang ca UD-dev-branches/UD_Catalan/ca-ud-train.conllu *** PASSED *** ******************
UD Chinese
zh
PASS
python tools/validate.py --lang zh UD-dev-branches/UD_Chinese/zh-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang zh UD-dev-branches/UD_Chinese/zh-ud-test.conllu *** PASSED *** ****************** python tools/validate.py --lang zh UD-dev-branches/UD_Chinese/zh-ud-train.conllu *** PASSED *** ******************
UD Chinese-CFL
zh cfl
FAIL
python tools/validate.py --lang zh_cfl UD-dev-branches/UD_Chinese-CFL/zh-cfl-test.conllu [Line 21]: The forward slash is reserved for special use in parallel treebanks: CFL_A_1-1a/ori [Line 33]: The forward slash is reserved for special use in parallel treebanks: CFL_A_1-1b/ori [Line 44]: The forward slash is reserved for special use in parallel treebanks: CFL_A_1-2/ori [Line 79]: The forward slash is reserved for special use in parallel treebanks: CFL_A_1-3/ori [Line 97]: The forward slash is reserved for special use in parallel treebanks: CFL_A_1-4/ori [Line 117]: The forward slash is reserved for special use in parallel treebanks: CFL_A_1-5/ori [Line 130]: The forward slash is reserved for special use in parallel treebanks: CFL_A_1-6/ori [Line 140]: The forward slash is reserved for special use in parallel treebanks: CFL_A_1-7/ori [Line 149]: The forward slash is reserved for special use in parallel treebanks: CFL_A_1-8/ori [Line 163]: The forward slash is reserved for special use in parallel treebanks: CFL_A_1-9/ori [Line 184]: The forward slash is reserved for special use in parallel treebanks: CFL_A_1-10/ori [Line 196]: The forward slash is reserved for special use in parallel treebanks: CFL_A_1-11/ori [Line 214]: The forward slash is reserved for special use in parallel treebanks: CFL_A_1-12/ori [Line 233]: The forward slash is reserved for special use in parallel treebanks: CFL_A_1-13/ori [Line 246]: The forward slash is reserved for special use in parallel treebanks: CFL_A_1-14/ori [Line 281]: The forward slash is reserved for special use in parallel treebanks: CFL_A_2-1/ori [Line 297]: The forward slash is reserved for special use in parallel treebanks: CFL_A_2-2/ori [Line 308]: The forward slash is reserved for special use in parallel treebanks: CFL_A_2-3a/ori [Line 317]: The forward slash is reserved for special use in parallel treebanks: CFL_A_2-3b/ori ...suppressing further errors regarding Metadata *** FAILED *** with 453 errors Metadata errors: 453 ******************
UD Chinese-HK
zh hk
PASS
python tools/validate.py --lang zh_hk UD-dev-branches/UD_Chinese-HK/zh_hk-ud-test.conllu *** PASSED *** ******************
UD Chinese-PUD
zh pud
PASS
python tools/validate.py --lang zh_pud UD-dev-branches/UD_Chinese-PUD/zh_pud-ud-test.conllu *** PASSED *** ******************
UD Coptic
cop
PASS
python tools/validate.py --lang cop UD-dev-branches/UD_Coptic/cop-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang cop UD-dev-branches/UD_Coptic/cop-ud-test.conllu *** PASSED *** ****************** python tools/validate.py --lang cop UD-dev-branches/UD_Coptic/cop-ud-train.conllu *** PASSED *** ******************
UD Croatian
hr
PASS
python tools/validate.py --lang hr UD-dev-branches/UD_Croatian/hr-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang hr UD-dev-branches/UD_Croatian/hr-ud-test.conllu *** PASSED *** ****************** python tools/validate.py --lang hr UD-dev-branches/UD_Croatian/hr-ud-train.conllu *** PASSED *** ******************
UD Czech
cs
PASS
python tools/validate.py --lang cs UD-dev-branches/UD_Czech/cs-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang cs UD-dev-branches/UD_Czech/cs-ud-test.conllu *** PASSED *** ****************** python tools/validate.py --lang cs UD-dev-branches/UD_Czech/cs-ud-train-c.conllu *** PASSED *** ****************** python tools/validate.py --lang cs UD-dev-branches/UD_Czech/cs-ud-train-l.conllu *** PASSED *** ****************** python tools/validate.py --lang cs UD-dev-branches/UD_Czech/cs-ud-train-m.conllu *** PASSED *** ****************** python tools/validate.py --lang cs UD-dev-branches/UD_Czech/cs-ud-train-v.conllu *** PASSED *** ******************
UD Czech-CAC
cs cac
PASS
python tools/validate.py --lang cs_cac UD-dev-branches/UD_Czech-CAC/cs_cac-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang cs_cac UD-dev-branches/UD_Czech-CAC/cs_cac-ud-test.conllu *** PASSED *** ****************** python tools/validate.py --lang cs_cac UD-dev-branches/UD_Czech-CAC/cs_cac-ud-train.conllu *** PASSED *** ******************
UD Czech-CLTT
cs cltt
PASS
python tools/validate.py --lang cs_cltt UD-dev-branches/UD_Czech-CLTT/cs_cltt-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang cs_cltt UD-dev-branches/UD_Czech-CLTT/cs_cltt-ud-test.conllu *** PASSED *** ****************** python tools/validate.py --lang cs_cltt UD-dev-branches/UD_Czech-CLTT/cs_cltt-ud-train.conllu *** PASSED *** ******************
UD Czech-FicTree
cs fictree
EMPTY
No data
UD Czech-PUD
cs pud
PASS
python tools/validate.py --lang cs_pud UD-dev-branches/UD_Czech-PUD/cs_pud-ud-test.conllu *** PASSED *** ******************
UD Danish
da
PASS
python tools/validate.py --lang da UD-dev-branches/UD_Danish/da-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang da UD-dev-branches/UD_Danish/da-ud-test.conllu *** PASSED *** ****************** python tools/validate.py --lang da UD-dev-branches/UD_Danish/da-ud-train.conllu *** PASSED *** ******************
UD Dargwa
dar
EMPTY
No data
UD Dutch
nl
PASS
python tools/validate.py --lang nl UD-dev-branches/UD_Dutch/nl-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang nl UD-dev-branches/UD_Dutch/nl-ud-test.conllu *** PASSED *** ****************** python tools/validate.py --lang nl UD-dev-branches/UD_Dutch/nl-ud-train.conllu *** PASSED *** ******************
UD Dutch-LassySmall
nl lassysmall
PASS
python tools/validate.py --lang nl_lassysmall UD-dev-branches/UD_Dutch-LassySmall/nl_lassysmall-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang nl_lassysmall UD-dev-branches/UD_Dutch-LassySmall/nl_lassysmall-ud-test.conllu *** PASSED *** ****************** python tools/validate.py --lang nl_lassysmall UD-dev-branches/UD_Dutch-LassySmall/nl_lassysmall-ud-train.conllu *** PASSED *** ******************
UD English
en
PASS
python tools/validate.py --lang en UD-dev-branches/UD_English/en-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang en UD-dev-branches/UD_English/en-ud-test.conllu *** PASSED *** ****************** python tools/validate.py --lang en UD-dev-branches/UD_English/en-ud-train.conllu *** PASSED *** ******************
UD English-ESL
en esl
FAIL
python tools/validate.py --lang en_esl UD-dev-branches/UD_English-ESL/en_esl-ud-dev.conllu [Line 15]: Unknown UD DEPREL: neg [Line 17]: Unknown UD DEPREL: dobj [Line 19]: Missing the sent_id attribute. [Line 19]: Missing the text attribute. [Line 30]: Unknown UD DEPREL: dobj [Line 33]: Unknown UPOS tag: CONJ [Line 37]: Unknown UD DEPREL: dobj [Line 41]: Missing the sent_id attribute. [Line 41]: Missing the text attribute. [Line 54]: Unknown UPOS tag: CONJ [Line 69]: Unknown UD DEPREL: dobj [Line 71]: Missing the sent_id attribute. [Line 71]: Missing the text attribute. [Line 77]: Unknown UD DEPREL: neg [Line 83]: Unknown UD DEPREL: dobj [Line 88]: Unknown UPOS tag: CONJ [Line 95]: Missing the sent_id attribute. [Line 95]: Missing the text attribute. [Line 99]: Unknown UD DEPREL: dobj [Line 121]: Unknown UPOS tag: CONJ [Line 124]: Unknown UPOS tag: CONJ [Line 127]: Missing the sent_id attribute. [Line 127]: Missing the text attribute. [Line 133]: Unknown UD DEPREL: neg [Line 142]: Missing the sent_id attribute. [Line 142]: Missing the text attribute. [Line 158]: Unknown UPOS tag: CONJ [Line 164]: Unknown UD DEPREL: neg [Line 167]: Unknown UD DEPREL: dobj [Line 182]: Missing the sent_id attribute. [Line 182]: Missing the text attribute. [Line 191]: Unknown UD DEPREL: dobj [Line 199]: Unknown UD DEPREL: neg [Line 202]: Unknown UD DEPREL: dobj [Line 203]: Unknown UPOS tag: CONJ [Line 207]: Missing the sent_id attribute. [Line 207]: Missing the text attribute. [Line 220]: Unknown UD DEPREL: neg [Line 226]: Missing the sent_id attribute. [Line 226]: Missing the text attribute. [Line 235]: Unknown UD DEPREL: dobj [Line 240]: Missing the sent_id attribute. ...suppressing further errors regarding Metadata [Line 243]: Unknown UPOS tag: CONJ [Line 246]: Unknown UD DEPREL: dobj [Line 252]: Unknown UD DEPREL: dobj [Line 256]: Unknown UD DEPREL: dobj ...suppressing further errors regarding Syntax [Line 279]: Unknown UPOS tag: CONJ [Line 331]: Unknown UPOS tag: CONJ [Line 356]: Unknown UPOS tag: CONJ [Line 377]: Unknown UPOS tag: CONJ [Line 435]: Unknown UPOS tag: CONJ [Line 479]: Unknown UPOS tag: CONJ [Line 517]: Unknown UPOS tag: CONJ [Line 545]: Unknown UPOS tag: CONJ [Line 591]: Unknown UPOS tag: CONJ [Line 638]: Unknown UPOS tag: CONJ [Line 744]: Unknown UPOS tag: CONJ ...suppressing further errors regarding Morpho *** FAILED *** with 2110 errors Metadata errors: 1000 Morpho errors: 316 Syntax errors: 794 ****************** python tools/validate.py --lang en_esl UD-dev-branches/UD_English-ESL/en_esl-ud-train.conllu [Line 13]: Unknown UPOS tag: CONJ [Line 18]: Unknown UD DEPREL: dobj [Line 20]: Missing the sent_id attribute. [Line 20]: Missing the text attribute. [Line 32]: Unknown UD DEPREL: neg [Line 38]: Unknown UPOS tag: CONJ [Line 45]: Unknown UD DEPREL: dobj [Line 52]: Unknown UPOS tag: CONJ [Line 57]: Missing the sent_id attribute. [Line 57]: Missing the text attribute. [Line 61]: Unknown UD DEPREL: mwe [Line 76]: Unknown UD DEPREL: dobj [Line 79]: Unknown UPOS tag: CONJ [Line 83]: Missing the sent_id attribute. [Line 83]: Missing the text attribute. [Line 91]: Unknown UD DEPREL: nsubjpass [Line 92]: Unknown UPOS tag: CONJ [Line 95]: Unknown UD DEPREL: auxpass [Line 97]: Unknown UD DEPREL: dobj [Line 99]: Missing the sent_id attribute. [Line 99]: Missing the text attribute. [Line 113]: Unknown UD DEPREL: dobj [Line 116]: Unknown UD DEPREL: dobj [Line 118]: Unknown UD DEPREL: neg [Line 128]: Missing the sent_id attribute. [Line 128]: Missing the text attribute. [Line 135]: Unknown UPOS tag: CONJ [Line 140]: Unknown UD DEPREL: neg [Line 141]: Unknown UD DEPREL: dobj [Line 143]: Missing the sent_id attribute. [Line 143]: Missing the text attribute. [Line 153]: Unknown UPOS tag: CONJ [Line 158]: Unknown UD DEPREL: dobj [Line 160]: Missing the sent_id attribute. [Line 160]: Missing the text attribute. [Line 173]: Unknown UPOS tag: CONJ [Line 176]: Missing the sent_id attribute. [Line 176]: Missing the text attribute. [Line 193]: Unknown UPOS tag: CONJ [Line 195]: Unknown UD DEPREL: mwe [Line 208]: Unknown UPOS tag: CONJ [Line 218]: Missing the sent_id attribute. [Line 218]: Missing the text attribute. [Line 224]: Unknown UPOS tag: CONJ [Line 227]: Unknown UPOS tag: CONJ [Line 233]: Unknown UPOS tag: CONJ [Line 237]: Unknown UD DEPREL: dobj [Line 245]: Unknown UPOS tag: CONJ [Line 254]: Missing the sent_id attribute. ...suppressing further errors regarding Metadata [Line 264]: Unknown UD DEPREL: dobj [Line 267]: Unknown UPOS tag: CONJ [Line 268]: Unknown UD DEPREL: neg [Line 275]: Unknown UD DEPREL: dobj [Line 277]: Unknown UPOS tag: CONJ ...suppressing further errors regarding Syntax [Line 311]: Unknown UPOS tag: CONJ [Line 323]: Unknown UPOS tag: CONJ [Line 352]: Unknown UPOS tag: CONJ ...suppressing further errors regarding Morpho *** FAILED *** with 17197 errors Metadata errors: 8248 Morpho errors: 2549 Syntax errors: 6400 ******************
UD English-LinES
en lines
PASS
python tools/validate.py --lang en_lines UD-dev-branches/UD_English-LinES/en_lines-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang en_lines UD-dev-branches/UD_English-LinES/en_lines-ud-test.conllu *** PASSED *** ****************** python tools/validate.py --lang en_lines UD-dev-branches/UD_English-LinES/en_lines-ud-train.conllu *** PASSED *** ******************
UD English-PUD
en pud
PASS
python tools/validate.py --lang en_pud UD-dev-branches/UD_English-PUD/en_pud-ud-test.conllu *** PASSED *** ******************
UD English-ParTUT
en partut
PASS
python tools/validate.py --lang en_partut UD-dev-branches/UD_English-ParTUT/en_partut-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang en_partut UD-dev-branches/UD_English-ParTUT/en_partut-ud-test.conllu *** PASSED *** ****************** python tools/validate.py --lang en_partut UD-dev-branches/UD_English-ParTUT/en_partut-ud-train.conllu *** PASSED *** ******************
UD Erzya
myv
FAIL
python tools/validate.py --lang myv UD-dev-branches/UD_Erzya/myv_BryzhinskijMixail_Kirdazht_manu_Pers_Chap-01.conlluTraceback (most recent call last): File "tools/validate.py", line 735, in <module> validate(inp,out,args,tagsets,known_sent_ids) File "tools/validate.py", line 625, in validate validate_text_meta(comments,tree) File "tools/validate.py", line 165, in validate_text_meta if u"NoSpaceAfter=Yes" in cols[MISC]: IndexError: list index out of range [Line 33]: Morphological features must be sorted: 'Sem/Ant_Mal|Number=Sing,Plur|Case=Gen|Definite=Ind' [Line 33]: Spurious morphological feature: 'Sem/Ant_Mal'. Should be of the form attribute=value and must start with [A-Z0-9] and only contain [A-Za-z0-9]. [Line 33]: If an attribute has multiple values, these must be sorted as well: 'Number=Sing,Plur' [Line 34]: Morphological features must be sorted: 'Valency=2|Mood=Ind|Tense=Prt1|Number[subj]=Plur|Person[subj]=3|Number[obj]=Sing|Person[obj]=3' [Line 34]: Unknown attribute-value pair Valency=2 [Line 34]: Unknown attribute-value pair Tense=Prt1 [Line 34]: Unknown attribute-value pair Number[subj]=Plur [Line 34]: Unknown attribute-value pair Person[subj]=3 [Line 34]: Unknown attribute-value pair Number[obj]=Sing [Line 34]: Unknown attribute-value pair Person[obj]=3 [Line 35]: The line has 9 columns, but 10 are expected. [Line 35]: Morphological features must be sorted: 'Valency=2|Derivation=NomAg|Number=Sing|Case=Nom|Definite=Ind' [Line 35]: Unknown attribute-value pair Valency=2 [Line 35]: Unknown attribute-value pair Derivation=NomAg [Line 35]: Failed for parse DEPS: налкставтыця [Line 35]: Malformed head:deprel pair 'налкставтыця' [Line 36]: Morphological features must be sorted: 'Sem/Ani|Number=Plur|Case=Nom|Definite=Ind' [Line 36]: Spurious morphological feature: 'Sem/Ani'. Should be of the form attribute=value and must start with [A-Z0-9] and only contain [A-Za-z0-9]. [Line 37]: Spurious morphological feature: 'CLB'. Should be of the form attribute=value and must start with [A-Z0-9] and only contain [A-Za-z0-9]. [Line 38]: Failed for parse DEPS: налкставтыця [Line 38]: Failed to parse DEPS: налкставтыця [Line 38]: Spurious sent_id line: '#sent_id = chapID1:paragID1:sentID1' Should look like '# sent_id = xxxxxx' where xxxx is not whitespace. Forward slash reserved for special purposes. [Line 38]: Missing the sent_id attribute. [Line 38]: Exception caught! *** FAILED *** with 24 errors Format errors: 4 Metadata errors: 2 Morpho errors: 16 Syntax errors: 2 The language-specific file /home/ginter/UD_PROJHOOK/tools/data/feat_val.myv does not exist. python conllu-stats.py --catvals=langspec yourdata/*.conllu > /home/ginter/UD_PROJHOOK/tools/data/feat_val.myv ****************** python tools/validate.py --lang myv UD-dev-branches/UD_Erzya/myv_KutorkinAndrej_LazhnicyaSuraII_1976_Pers_Part01-Chap01.conlluTraceback (most recent call last): File "tools/validate.py", line 735, in <module> validate(inp,out,args,tagsets,known_sent_ids) File "tools/validate.py", line 625, in validate validate_text_meta(comments,tree) File "tools/validate.py", line 165, in validate_text_meta if u"NoSpaceAfter=Yes" in cols[MISC]: IndexError: list index out of range [Line 33]: Morphological features must be sorted: 'Number=Sing|Case=Nom|Definite=Def' [Line 33]: Unknown UD DEPREL: nsubj:cop [Line 34]: Morphological features must be sorted: 'Valency=1|Mood=Ind|Tense=Prt1|Number[subj]=Sing|Person[subj]=3' [Line 34]: Unknown attribute-value pair Valency=1 [Line 34]: Unknown attribute-value pair Tense=Prt1 [Line 34]: Unknown attribute-value pair Number[subj]=Sing [Line 34]: Unknown attribute-value pair Person[subj]=3 [Line 35]: Morphological features must be sorted: 'Number=Sing,Plur|Case=Gen|Definite=Ind' [Line 35]: If an attribute has multiple values, these must be sorted as well: 'Number=Sing,Plur' [Line 35]: Unknown UD DEPREL: nmod:poss [Line 36]: Morphological features must be sorted: 'Number=Sing|Case=Nom|Definite=Ind' [Line 38]: Spurious sent_id line: '#sent_id = partID1:chapID1:paragID1:sentID1: pgNo="5"' Should look like '# sent_id = xxxxxx' where xxxx is not whitespace. Forward slash reserved for special purposes. [Line 38]: Missing the sent_id attribute. [Line 38]: SpaceAfter=No is missing in the MISC field of node #4 because the text is 'сочельник.' [Line 41]: Morphological features must be sorted: 'Valency=2|Mood=Ind|Tense=Prt1|Number[subj]=Sing|Person[subj]=3' [Line 41]: Unknown attribute-value pair Valency=2 [Line 41]: Unknown attribute-value pair Tense=Prt1 [Line 41]: Unknown attribute-value pair Number[subj]=Sing [Line 41]: Unknown attribute-value pair Person[subj]=3 [Line 42]: Spurious morphological feature: 'Attr'. Should be of the form attribute=value and must start with [A-Z0-9] and only contain [A-Za-z0-9]. [Line 43]: Morphological features must be sorted: 'Number=Sing|Case=Nom|Definite=Ind' [Line 45]: Morphological features must be sorted: 'Number=Sing|Case=Nom|Definite=Ind' [Line 46]: Morphological features must be sorted: 'Number=Sing,Plur|Case=Gen|Definite=Ind' [Line 46]: If an attribute has multiple values, these must be sorted as well: 'Number=Sing,Plur' ...suppressing further errors regarding Morpho [Line 50]: Unknown UD DEPREL: nmod:poss [Line 53]: Spurious sent_id line: '#sent_id = partID1:chapID1:paragID1:sentID2: pgNo="5"' Should look like '# sent_id = xxxxxx' where xxxx is not whitespace. Forward slash reserved for special purposes. [Line 53]: Missing the sent_id attribute. [Line 53]: SpaceAfter=No is missing in the MISC field of node #11 because the text is 'ломанть.' [Line 66]: Spurious sent_id line: '#sent_id = partID1:chapID1:paragID2:sentID1: pgNo="5"' Should look like '# sent_id = xxxxxx' where xxxx is not whitespace. Forward slash reserved for special purposes. [Line 66]: Missing the sent_id attribute. [Line 66]: SpaceAfter=No is missing in the MISC field of node #6 because the text is 'ланга, прок лайшесь.' [Line 66]: SpaceAfter=No is missing in the MISC field of node #9 because the text is 'лайшесь.' [Line 69]: Unknown UD DEPREL: nmod:poss [Line 74]: Spurious sent_id line: '#sent_id = partID1:chapID1:paragID3:sentID1: pgNo="5"' Should look like '# sent_id = xxxxxx' where xxxx is not whitespace. Forward slash reserved for special purposes. [Line 74]: Missing the sent_id attribute. [Line 74]: SpaceAfter=No is missing in the MISC field of node #4 because the text is 'сыль.' [Line 80]: Unknown UD DEPREL: flat:name [Line 81]: Unknown UD DEPREL: nmod:poss [Line 84]: Spurious sent_id line: '#sent_id = partID1:chapID1:paragID3:sentID2: pgNo="5"' Should look like '# sent_id = xxxxxx' where xxxx is not whitespace. Forward slash reserved for special purposes. [Line 84]: Missing the sent_id attribute. [Line 84]: SpaceAfter=No is missing in the MISC field of node #6 because the text is 'Груня.' [Line 89]: Unknown UD DEPREL: nmod:poss [Line 92]: Unknown UD DEPREL: nmod:poss [Line 101]: Spurious sent_id line: '#sent_id = partID1:chapID1:paragID3:sentID3: pgNo="5"' Should look like '# sent_id = xxxxxx' where xxxx is not whitespace. Forward slash reserved for special purposes. [Line 101]: Missing the sent_id attribute. [Line 101]: SpaceAfter=No is missing in the MISC field of node #8 because the text is 'пилензэ, теке бокава[...]' ...suppressing further errors regarding Metadata [Line 130]: Unknown UD DEPREL: nmod:poss [Line 136]: Invalid DEPREL value 7 [Line 136]: Failed for parse DEPS: punct [Line 136]: Unknown UD DEPREL: 7 [Line 136]: Malformed head:deprel pair 'punct' [Line 137]: Invalid DEPREL value _ [Line 137]: Unknown UD DEPREL: _ [Line 141]: Invalid DEPREL value _ [Line 141]: Unknown UD DEPREL: _ [Line 149]: Undefined ID in HEAD: _ [Line 149]: Failed for parse DEPS: punct [Line 149]: Undefined ID in HEAD: _ [Line 149]: Undefined ID in HEAD: _ [Line 149]: Failed to parse DEPS: punct [Tree number 9 on line 129]: Empty head for word ID 8 [Tree number 9 on line 129]: Empty head for word ID 9 [Tree number 9 on line 129]: Empty head for word ID 13 [Tree number 9 on line 129]: Non-tree structure. Words 1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20 are not reachable from the root 0. [Line 161]: Unknown UD DEPREL: aux:subj ...suppressing further errors regarding Syntax [Line 369]: Undefined ID in HEAD: _ [Tree number 26 on line 356]: Empty head for word ID 4 [Line 498]: Undefined ID in HEAD: _ [Line 498]: Failed for parse DEPS: punct [Line 498]: Failed to parse DEPS: punct [Tree number 37 on line 486]: Empty head for word ID 10 [Tree number 44 on line 566]: HEAD == ID for 2 [Line 712]: Undefined ID in HEAD: nmod:poss [Tree number 55 on line 694]: Non-integer head for word ID 7 [Line 912]: Undefined ID in HEAD: obj [Tree number 71 on line 900]: Non-integer head for word ID 11 ...suppressing further errors regarding Format *** FAILED *** with 3011 errors Format errors: 39 Metadata errors: 529 Morpho errors: 2315 Syntax errors: 128 The language-specific file /home/ginter/UD_PROJHOOK/tools/data/deprel.myv does not exist. The language-specific file /home/ginter/UD_PROJHOOK/tools/data/feat_val.myv does not exist. python conllu-stats.py --catvals=langspec yourdata/*.conllu > /home/ginter/UD_PROJHOOK/tools/data/feat_val.myv ******************
UD Estonian
et
PASS
python tools/validate.py --lang et UD-dev-branches/UD_Estonian/et-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang et UD-dev-branches/UD_Estonian/et-ud-test.conllu *** PASSED *** ****************** python tools/validate.py --lang et UD-dev-branches/UD_Estonian/et-ud-train.conllu *** PASSED *** ******************
UD Faroese
fo
EMPTY
No data
UD Finnish
fi
PASS
python tools/validate.py --lang fi UD-dev-branches/UD_Finnish/fi-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang fi UD-dev-branches/UD_Finnish/fi-ud-test.conllu *** PASSED *** ****************** python tools/validate.py --lang fi UD-dev-branches/UD_Finnish/fi-ud-train.conllu *** PASSED *** ******************
UD Finnish-FTB
fi ftb
PASS
python tools/validate.py --lang fi_ftb UD-dev-branches/UD_Finnish-FTB/fi_ftb-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang fi_ftb UD-dev-branches/UD_Finnish-FTB/fi_ftb-ud-test.conllu *** PASSED *** ****************** python tools/validate.py --lang fi_ftb UD-dev-branches/UD_Finnish-FTB/fi_ftb-ud-train.conllu *** PASSED *** ******************
UD Finnish-PUD
fi pud
PASS
python tools/validate.py --lang fi_pud UD-dev-branches/UD_Finnish-PUD/fi_pud-ud-test.conllu *** PASSED *** ******************
UD French
fr
PASS
python tools/validate.py --lang fr UD-dev-branches/UD_French/fr-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang fr UD-dev-branches/UD_French/fr-ud-test.conllu *** PASSED *** ****************** python tools/validate.py --lang fr UD-dev-branches/UD_French/fr-ud-train.conllu *** PASSED *** ******************
UD French-FTB
fr ftb
PASS
python tools/validate.py --lang fr_ftb UD-dev-branches/UD_French-FTB/fr_ftb-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang fr_ftb UD-dev-branches/UD_French-FTB/fr_ftb-ud-test.conllu *** PASSED *** ****************** python tools/validate.py --lang fr_ftb UD-dev-branches/UD_French-FTB/fr_ftb-ud-train.conllu *** PASSED *** ******************
UD French-PUD
fr pud
PASS
python tools/validate.py --lang fr_pud UD-dev-branches/UD_French-PUD/fr_pud-ud-test.conllu *** PASSED *** ******************
UD French-ParTUT
fr partut
PASS
python tools/validate.py --lang fr_partut UD-dev-branches/UD_French-ParTUT/fr_partut-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang fr_partut UD-dev-branches/UD_French-ParTUT/fr_partut-ud-test.conllu *** PASSED *** ****************** python tools/validate.py --lang fr_partut UD-dev-branches/UD_French-ParTUT/fr_partut-ud-train.conllu *** PASSED *** ******************
UD French-Sequoia
fr sequoia
PASS
python tools/validate.py --lang fr_sequoia UD-dev-branches/UD_French-Sequoia/fr_sequoia-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang fr_sequoia UD-dev-branches/UD_French-Sequoia/fr_sequoia-ud-test.conllu *** PASSED *** ****************** python tools/validate.py --lang fr_sequoia UD-dev-branches/UD_French-Sequoia/fr_sequoia-ud-train.conllu *** PASSED *** ******************
UD French-Spoken
fr spoken
EMPTY
No data
UD Galician
gl
PASS
python tools/validate.py --lang gl UD-dev-branches/UD_Galician/gl-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang gl UD-dev-branches/UD_Galician/gl-ud-test.conllu *** PASSED *** ****************** python tools/validate.py --lang gl UD-dev-branches/UD_Galician/gl-ud-train.conllu *** PASSED *** ******************
UD Galician-TreeGal
gl treegal
PASS
python tools/validate.py --lang gl_treegal UD-dev-branches/UD_Galician-TreeGal/gl_treegal-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang gl_treegal UD-dev-branches/UD_Galician-TreeGal/gl_treegal-ud-test.conllu *** PASSED *** ****************** python tools/validate.py --lang gl_treegal UD-dev-branches/UD_Galician-TreeGal/gl_treegal-ud-train.conllu *** PASSED *** ******************
UD German
de
PASS
python tools/validate.py --lang de UD-dev-branches/UD_German/de-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang de UD-dev-branches/UD_German/de-ud-test.conllu *** PASSED *** ****************** python tools/validate.py --lang de UD-dev-branches/UD_German/de-ud-train.conllu *** PASSED *** ******************
UD German-PUD
de pud
PASS
python tools/validate.py --lang de_pud UD-dev-branches/UD_German-PUD/de_pud-ud-test.conllu *** PASSED *** ******************
UD Gothic
got
PASS
python tools/validate.py --lang got UD-dev-branches/UD_Gothic/got-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang got UD-dev-branches/UD_Gothic/got-ud-test.conllu *** PASSED *** ****************** python tools/validate.py --lang got UD-dev-branches/UD_Gothic/got-ud-train.conllu *** PASSED *** ******************
UD Greek
el
PASS
python tools/validate.py --lang el UD-dev-branches/UD_Greek/el-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang el UD-dev-branches/UD_Greek/el-ud-test.conllu *** PASSED *** ****************** python tools/validate.py --lang el UD-dev-branches/UD_Greek/el-ud-train.conllu *** PASSED *** ******************
UD Hebrew
he
PASS
python tools/validate.py --lang he UD-dev-branches/UD_Hebrew/he-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang he UD-dev-branches/UD_Hebrew/he-ud-test.conllu *** PASSED *** ****************** python tools/validate.py --lang he UD-dev-branches/UD_Hebrew/he-ud-train.conllu *** PASSED *** ******************
UD Hindi
hi
PASS
python tools/validate.py --lang hi UD-dev-branches/UD_Hindi/hi-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang hi UD-dev-branches/UD_Hindi/hi-ud-test.conllu *** PASSED *** ****************** python tools/validate.py --lang hi UD-dev-branches/UD_Hindi/hi-ud-train.conllu *** PASSED *** ******************
UD Hindi-PUD
hi pud
PASS
python tools/validate.py --lang hi_pud UD-dev-branches/UD_Hindi-PUD/hi_pud-ud-test.conllu *** PASSED *** ******************
UD Hungarian
hu
PASS
python tools/validate.py --lang hu UD-dev-branches/UD_Hungarian/hu-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang hu UD-dev-branches/UD_Hungarian/hu-ud-test.conllu *** PASSED *** ****************** python tools/validate.py --lang hu UD-dev-branches/UD_Hungarian/hu-ud-train.conllu *** PASSED *** ******************
UD Indonesian
id
PASS
python tools/validate.py --lang id UD-dev-branches/UD_Indonesian/id-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang id UD-dev-branches/UD_Indonesian/id-ud-test.conllu *** PASSED *** ****************** python tools/validate.py --lang id UD-dev-branches/UD_Indonesian/id-ud-train.conllu *** PASSED *** ******************
UD Indonesian-PUD
id pud
FAIL
python tools/validate.py --lang id_pud UD-dev-branches/UD_Indonesian-PUD/id_pud-ud-test.conllu [Line 3]: Spurious morphological feature: 'id/proper=false'. Should be of the form attribute=value and must start with [A-Z0-9] and only contain [A-Za-z0-9]. [Line 3]: Unknown UD DEPREL: p [Line 4]: Spurious morphological feature: 'id/proper=false'. Should be of the form attribute=value and must start with [A-Z0-9] and only contain [A-Za-z0-9]. [Line 5]: Spurious morphological feature: 'id/proper=false'. Should be of the form attribute=value and must start with [A-Z0-9] and only contain [A-Za-z0-9]. [Line 5]: Unknown UPOS tag: AFFIX [Line 5]: Unknown UD DEPREL: pref [Line 6]: Spurious morphological feature: 'id/proper=false'. Should be of the form attribute=value and must start with [A-Z0-9] and only contain [A-Za-z0-9]. [Line 7]: Spurious morphological feature: 'id/proper=false'. Should be of the form attribute=value and must start with [A-Z0-9] and only contain [A-Za-z0-9]. [Line 7]: Unknown UPOS tag: AFFIX [Line 7]: Unknown UD DEPREL: suff [Line 8]: Spurious morphological feature: 'id/proper=false'. Should be of the form attribute=value and must start with [A-Z0-9] and only contain [A-Za-z0-9]. [Line 8]: Unknown UD DEPREL: nn [Line 9]: Spurious morphological feature: 'id/proper=false'. Should be of the form attribute=value and must start with [A-Z0-9] and only contain [A-Za-z0-9]. [Line 10]: Spurious morphological feature: 'id/proper=false'. Should be of the form attribute=value and must start with [A-Z0-9] and only contain [A-Za-z0-9]. [Line 11]: Spurious morphological feature: 'id/proper=false'. Should be of the form attribute=value and must start with [A-Z0-9] and only contain [A-Za-z0-9]. [Line 11]: Unknown UPOS tag: AFFIX [Line 11]: Unknown UD DEPREL: pref [Line 12]: Spurious morphological feature: 'id/proper=false'. Should be of the form attribute=value and must start with [A-Z0-9] and only contain [A-Za-z0-9]. [Line 12]: Unknown UD DEPREL: rcmod [Line 13]: Spurious morphological feature: 'id/proper=false'. Should be of the form attribute=value and must start with [A-Z0-9] and only contain [A-Za-z0-9]. [Line 13]: Unknown UD DEPREL: prep [Line 14]: Spurious morphological feature: 'id/proper=true'. Should be of the form attribute=value and must start with [A-Z0-9] and only contain [A-Za-z0-9]. [Line 14]: Unknown UD DEPREL: pobj [Line 15]: Spurious morphological feature: 'id/proper=true'. Should be of the form attribute=value and must start with [A-Z0-9] and only contain [A-Za-z0-9]. [Line 15]: Unknown UD DEPREL: nn [Line 16]: Spurious morphological feature: 'id/proper=false'. Should be of the form attribute=value and must start with [A-Z0-9] and only contain [A-Za-z0-9]. [Line 16]: Unknown UD DEPREL: neg [Line 17]: Spurious morphological feature: 'id/proper=false'. Should be of the form attribute=value and must start with [A-Z0-9] and only contain [A-Za-z0-9]. [Line 18]: Spurious morphological feature: 'id/proper=false'. Should be of the form attribute=value and must start with [A-Z0-9] and only contain [A-Za-z0-9]. ...suppressing further errors regarding Morpho [Line 18]: Unknown UD DEPREL: pref [Line 20]: Unknown UD DEPREL: prep [Line 21]: Unknown UD DEPREL: pobj [Line 22]: Unknown UD DEPREL: p [Line 24]: Unknown UD DEPREL: pref [Line 25]: Unknown UD DEPREL: nn [Line 26]: Unknown UD DEPREL: suff [Line 28]: Unknown UD DEPREL: rcmod [Line 29]: Unknown UD DEPREL: neg ...suppressing further errors regarding Syntax [Line 51]: Missing the sent_id attribute. [Line 51]: Missing the text attribute. [Line 73]: Missing the sent_id attribute. [Line 73]: Missing the text attribute. [Line 118]: Missing the sent_id attribute. [Line 118]: Missing the text attribute. [Line 164]: Missing the sent_id attribute. [Line 164]: Missing the text attribute. [Line 180]: Missing the sent_id attribute. [Line 180]: Missing the text attribute. [Line 202]: Missing the sent_id attribute. [Line 202]: Missing the text attribute. [Line 214]: Missing the sent_id attribute. [Line 214]: Missing the text attribute. [Line 262]: Missing the sent_id attribute. [Line 262]: Missing the text attribute. [Line 292]: Missing the sent_id attribute. [Line 292]: Missing the text attribute. [Line 305]: Missing the sent_id attribute. ...suppressing further errors regarding Metadata *** FAILED *** with 54306 errors Metadata errors: 2000 Morpho errors: 31778 Syntax errors: 20528 ******************
UD Irish
ga
PASS
python tools/validate.py --lang ga UD-dev-branches/UD_Irish/ga-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang ga UD-dev-branches/UD_Irish/ga-ud-test.conllu *** PASSED *** ****************** python tools/validate.py --lang ga UD-dev-branches/UD_Irish/ga-ud-train.conllu *** PASSED *** ******************
UD Italian
it
PASS
python tools/validate.py --lang it UD-dev-branches/UD_Italian/it-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang it UD-dev-branches/UD_Italian/it-ud-train.conllu *** PASSED *** ******************
UD Italian-PUD
it pud
PASS
python tools/validate.py --lang it_pud UD-dev-branches/UD_Italian-PUD/it_pud-ud-test.conllu *** PASSED *** ******************
UD Italian-ParTUT
it partut
PASS
python tools/validate.py --lang it_partut UD-dev-branches/UD_Italian-ParTUT/it_partut-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang it_partut UD-dev-branches/UD_Italian-ParTUT/it_partut-ud-test.conllu *** PASSED *** ****************** python tools/validate.py --lang it_partut UD-dev-branches/UD_Italian-ParTUT/it_partut-ud-train.conllu *** PASSED *** ******************
UD Italian-PoSTWITA
it postwita
PASS
python tools/validate.py --lang it_postwita UD-dev-branches/UD_Italian-PoSTWITA/it_postwita-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang it_postwita UD-dev-branches/UD_Italian-PoSTWITA/it_postwita-ud-test.conllu *** PASSED *** ****************** python tools/validate.py --lang it_postwita UD-dev-branches/UD_Italian-PoSTWITA/it_postwita-ud-train.conllu *** PASSED *** ******************
UD Japanese
ja
PASS
python tools/validate.py --lang ja UD-dev-branches/UD_Japanese/ja-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang ja UD-dev-branches/UD_Japanese/ja-ud-test.conllu *** PASSED *** ****************** python tools/validate.py --lang ja UD-dev-branches/UD_Japanese/ja-ud-train.conllu *** PASSED *** ******************
UD Japanese-KTC
ja ktc
FAIL
python tools/validate.py --lang ja_ktc UD-dev-branches/UD_Japanese-KTC/ja_ktc-ud-dev.conllu [Line 10]: Missing the sent_id attribute. [Line 10]: Missing the text attribute. [Line 14]: Unknown UPOS tag: CONJ [Line 16]: Unknown UD DEPREL: name [Line 23]: Unknown UD DEPREL: dobj [Line 32]: Unknown UD DEPREL: dobj [Line 39]: Unknown UD DEPREL: dobj [Line 47]: Unknown UD DEPREL: dobj [Line 54]: Missing the sent_id attribute. [Line 54]: Missing the text attribute. [Line 62]: Unknown UPOS tag: CONJ [Line 67]: Unknown UD DEPREL: dobj [Line 80]: Unknown UD DEPREL: mwe [Line 81]: Unknown UD DEPREL: mwe [Line 95]: Unknown UD DEPREL: dobj [Line 109]: Unknown UD DEPREL: dobj [Line 122]: Unknown UD DEPREL: auxpass [Line 124]: Missing the sent_id attribute. [Line 124]: Missing the text attribute. [Line 151]: Missing the sent_id attribute. [Line 151]: Missing the text attribute. [Line 161]: Unknown UD DEPREL: dobj [Line 167]: Missing the sent_id attribute. [Line 167]: Missing the text attribute. [Line 169]: Unknown UPOS tag: CONJ [Line 191]: Unknown UD DEPREL: dobj [Line 197]: Missing the sent_id attribute. [Line 197]: Missing the text attribute. [Line 237]: Unknown UD DEPREL: dobj [Line 247]: Missing the sent_id attribute. [Line 247]: Missing the text attribute. [Line 263]: Unknown UD DEPREL: dobj [Line 269]: Unknown UD DEPREL: dobj [Line 283]: Unknown UD DEPREL: mwe [Line 290]: Unknown UD DEPREL: dobj [Line 298]: Missing the sent_id attribute. [Line 298]: Missing the text attribute. [Line 300]: Unknown UPOS tag: CONJ [Line 306]: Unknown UD DEPREL: mwe ...suppressing further errors regarding Syntax [Line 334]: Missing the sent_id attribute. [Line 334]: Missing the text attribute. [Line 398]: Missing the sent_id attribute. ...suppressing further errors regarding Metadata [Line 420]: Unknown UPOS tag: CONJ [Line 439]: Unknown UPOS tag: CONJ [Line 445]: Unknown UPOS tag: CONJ [Line 541]: Unknown UPOS tag: CONJ [Line 583]: Unknown UPOS tag: CONJ [Line 589]: Unknown UPOS tag: CONJ [Line 634]: Unknown UPOS tag: CONJ [Line 896]: Unknown UPOS tag: CONJ [Line 954]: Unknown UPOS tag: CONJ [Line 1058]: Unknown UPOS tag: CONJ [Line 1310]: Unknown UPOS tag: CONJ [Line 1359]: Unknown UPOS tag: CONJ [Line 1382]: Unknown UPOS tag: CONJ [Line 1401]: Unknown UPOS tag: CONJ [Line 1539]: Unknown UPOS tag: CONJ ...suppressing further errors regarding Morpho *** FAILED *** with 4474 errors Metadata errors: 2238 Morpho errors: 283 Syntax errors: 1953 ****************** python tools/validate.py --lang ja_ktc UD-dev-branches/UD_Japanese-KTC/ja_ktc-ud-train.conllu [Line 3]: Unknown UD DEPREL: name [Line 34]: Unknown UD DEPREL: mwe [Line 35]: Unknown UD DEPREL: mwe [Line 39]: Unknown UD DEPREL: dobj [Line 46]: Unknown UD DEPREL: neg [Line 70]: Unknown UD DEPREL: neg [Line 73]: Unknown UD DEPREL: dobj [Line 78]: Missing the sent_id attribute. [Line 78]: Missing the text attribute. [Line 80]: Unknown UPOS tag: CONJ [Line 101]: Unknown UD DEPREL: dobj [Line 116]: Unknown UD DEPREL: dobj [Line 124]: Missing the sent_id attribute. [Line 124]: Missing the text attribute. [Line 146]: Unknown UD DEPREL: dobj [Line 162]: Missing the sent_id attribute. [Line 162]: Missing the text attribute. [Line 179]: Missing the sent_id attribute. [Line 179]: Missing the text attribute. [Line 193]: Unknown UPOS tag: CONJ [Line 194]: Unknown UD DEPREL: dobj [Line 209]: Missing the sent_id attribute. [Line 209]: Missing the text attribute. [Line 222]: Unknown UD DEPREL: dobj [Line 229]: Missing the sent_id attribute. [Line 229]: Missing the text attribute. [Line 231]: Unknown UPOS tag: CONJ [Line 245]: Unknown UD DEPREL: dobj [Line 263]: Unknown UD DEPREL: dobj [Line 271]: Missing the sent_id attribute. [Line 271]: Missing the text attribute. [Line 273]: Unknown UPOS tag: CONJ [Line 300]: Missing the sent_id attribute. [Line 300]: Missing the text attribute. [Line 316]: Unknown UD DEPREL: dobj [Line 326]: Unknown UD DEPREL: dobj [Line 332]: Missing the sent_id attribute. [Line 332]: Missing the text attribute. [Line 335]: Unknown UD DEPREL: name [Line 348]: Missing the sent_id attribute. ...suppressing further errors regarding Metadata [Line 380]: Unknown UD DEPREL: mwe [Line 381]: Unknown UD DEPREL: mwe ...suppressing further errors regarding Syntax [Line 455]: Unknown UPOS tag: CONJ [Line 478]: Unknown UPOS tag: CONJ [Line 726]: Unknown UPOS tag: CONJ [Line 1056]: Unknown UPOS tag: CONJ [Line 1291]: Unknown UPOS tag: CONJ [Line 1483]: Unknown UPOS tag: CONJ [Line 1557]: Unknown UPOS tag: CONJ [Line 1572]: Unknown UPOS tag: CONJ [Line 1861]: Unknown UPOS tag: CONJ [Line 1870]: Unknown UPOS tag: CONJ [Line 1893]: Unknown UPOS tag: CONJ [Line 1903]: Unknown UPOS tag: CONJ [Line 1917]: Unknown UPOS tag: CONJ [Line 1964]: Unknown UPOS tag: CONJ [Line 2022]: Unknown UPOS tag: CONJ ...suppressing further errors regarding Morpho *** FAILED *** with 23622 errors Metadata errors: 12078 Morpho errors: 1694 Syntax errors: 9850 ******************
UD Japanese-PUD
ja pud
PASS
python tools/validate.py --lang ja_pud UD-dev-branches/UD_Japanese-PUD/ja_pud-ud-test.conllu *** PASSED *** ******************
UD Kazakh
kk
PASS
python tools/validate.py --lang kk UD-dev-branches/UD_Kazakh/kk-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang kk UD-dev-branches/UD_Kazakh/kk-ud-test.conllu *** PASSED *** ******************
UD Korean
ko
PASS
python tools/validate.py --lang ko UD-dev-branches/UD_Korean/ko-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang ko UD-dev-branches/UD_Korean/ko-ud-train.conllu *** PASSED *** ******************
UD Korean-PUD
ko pud
FAIL
python tools/validate.py --lang ko_pud UD-dev-branches/UD_Korean-PUD/ko_pud-ud-test.conllu [Line 3]: Spurious morphological feature: 'ko/proper=true'. Should be of the form attribute=value and must start with [A-Z0-9] and only contain [A-Za-z0-9]. [Line 3]: Unknown UD DEPREL: nn [Line 4]: Spurious morphological feature: 'ko/proper=true'. Should be of the form attribute=value and must start with [A-Z0-9] and only contain [A-Za-z0-9]. [Line 4]: Unknown UD DEPREL: nn [Line 5]: Spurious morphological feature: 'ko/proper=true'. Should be of the form attribute=value and must start with [A-Z0-9] and only contain [A-Za-z0-9]. [Line 5]: Unknown UD DEPREL: nn [Line 6]: Spurious morphological feature: 'ko/proper=false'. Should be of the form attribute=value and must start with [A-Z0-9] and only contain [A-Za-z0-9]. [Line 6]: Unknown UD DEPREL: nn [Line 7]: Spurious morphological feature: 'ko/proper=false'. Should be of the form attribute=value and must start with [A-Z0-9] and only contain [A-Za-z0-9]. [Line 7]: Unknown UD DEPREL: nn [Line 8]: Spurious morphological feature: 'ko/proper=false'. Should be of the form attribute=value and must start with [A-Z0-9] and only contain [A-Za-z0-9]. [Line 9]: Morphological features must be sorted: 'ko/case=nom|ko/proper=false|ko/formality=fml' [Line 9]: Spurious morphological feature: 'ko/case=nom'. Should be of the form attribute=value and must start with [A-Z0-9] and only contain [A-Za-z0-9]. [Line 9]: Spurious morphological feature: 'ko/proper=false'. Should be of the form attribute=value and must start with [A-Z0-9] and only contain [A-Za-z0-9]. [Line 9]: Spurious morphological feature: 'ko/formality=fml'. Should be of the form attribute=value and must start with [A-Z0-9] and only contain [A-Za-z0-9]. [Line 9]: Unknown UPOS tag: PRT [Line 9]: Unknown UD DEPREL: prt [Line 10]: Spurious morphological feature: 'ko/proper=false'. Should be of the form attribute=value and must start with [A-Z0-9] and only contain [A-Za-z0-9]. [Line 10]: Unknown UD DEPREL: tmod [Line 11]: Spurious morphological feature: 'ko/proper=false'. Should be of the form attribute=value and must start with [A-Z0-9] and only contain [A-Za-z0-9]. [Line 11]: Unknown UD DEPREL: p [Line 12]: Spurious morphological feature: 'ko/proper=true'. Should be of the form attribute=value and must start with [A-Z0-9] and only contain [A-Za-z0-9]. [Line 13]: Morphological features must be sorted: 'ko/case=advb|ko/proper=false|ko/formality=fml' [Line 13]: Spurious morphological feature: 'ko/case=advb'. Should be of the form attribute=value and must start with [A-Z0-9] and only contain [A-Za-z0-9]. [Line 13]: Spurious morphological feature: 'ko/proper=false'. Should be of the form attribute=value and must start with [A-Z0-9] and only contain [A-Za-z0-9]. [Line 13]: Spurious morphological feature: 'ko/formality=fml'. Should be of the form attribute=value and must start with [A-Z0-9] and only contain [A-Za-z0-9]. [Line 13]: Unknown UPOS tag: PRT [Line 13]: Unknown UD DEPREL: prt ...suppressing further errors regarding Morpho [Line 14]: Unknown UD DEPREL: nn [Line 15]: Unknown UD DEPREL: poss [Line 16]: Unknown UD DEPREL: prt [Line 18]: Unknown UD DEPREL: prt [Line 19]: Unknown UD DEPREL: dobj [Line 20]: Unknown UD DEPREL: prt [Line 22]: Unknown UD DEPREL: rcmod [Line 23]: Unknown UD DEPREL: attr [Line 26]: Unknown UD DEPREL: suff [Line 27]: Unknown UD DEPREL: nn ...suppressing further errors regarding Syntax [Line 40]: Missing the sent_id attribute. [Line 40]: Missing the text attribute. [Line 59]: Missing the sent_id attribute. [Line 59]: Missing the text attribute. [Line 113]: Missing the sent_id attribute. [Line 113]: Missing the text attribute. [Line 147]: Missing the sent_id attribute. [Line 147]: Missing the text attribute. [Line 163]: Missing the sent_id attribute. [Line 163]: Missing the text attribute. [Line 184]: Missing the sent_id attribute. [Line 184]: Missing the text attribute. [Line 196]: Missing the sent_id attribute. [Line 196]: Missing the text attribute. [Line 249]: Missing the sent_id attribute. [Line 249]: Missing the text attribute. [Line 281]: Missing the sent_id attribute. [Line 281]: Missing the text attribute. [Line 293]: Missing the sent_id attribute. ...suppressing further errors regarding Metadata *** FAILED *** with 76447 errors Metadata errors: 2000 Morpho errors: 56121 Syntax errors: 18326 ******************
UD Kurmanji
kmr
PASS
python tools/validate.py --lang kmr UD-dev-branches/UD_Kurmanji/kmr-ud-sample.conllu *** PASSED *** ****************** python tools/validate.py --lang kmr UD-dev-branches/UD_Kurmanji/kmr-ud-test.conllu *** PASSED *** ******************
UD Latin
la
PASS
python tools/validate.py --lang la UD-dev-branches/UD_Latin/la-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang la UD-dev-branches/UD_Latin/la-ud-test.conllu *** PASSED *** ****************** python tools/validate.py --lang la UD-dev-branches/UD_Latin/la-ud-train.conllu *** PASSED *** ******************
UD Latin-ITTB
la ittb
PASS
python tools/validate.py --lang la_ittb UD-dev-branches/UD_Latin-ITTB/la_ittb-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang la_ittb UD-dev-branches/UD_Latin-ITTB/la_ittb-ud-test.conllu *** PASSED *** ****************** python tools/validate.py --lang la_ittb UD-dev-branches/UD_Latin-ITTB/la_ittb-ud-train.conllu *** PASSED *** ******************
UD Latin-PROIEL
la proiel
PASS
python tools/validate.py --lang la_proiel UD-dev-branches/UD_Latin-PROIEL/la_proiel-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang la_proiel UD-dev-branches/UD_Latin-PROIEL/la_proiel-ud-test.conllu *** PASSED *** ****************** python tools/validate.py --lang la_proiel UD-dev-branches/UD_Latin-PROIEL/la_proiel-ud-train.conllu *** PASSED *** ******************
UD Latvian
lv
PASS
python tools/validate.py --lang lv UD-dev-branches/UD_Latvian/lv-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang lv UD-dev-branches/UD_Latvian/lv-ud-test.conllu *** PASSED *** ****************** python tools/validate.py --lang lv UD-dev-branches/UD_Latvian/lv-ud-train.conllu *** PASSED *** ******************
UD Lithuanian
lt
PASS
python tools/validate.py --lang lt UD-dev-branches/UD_Lithuanian/lt-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang lt UD-dev-branches/UD_Lithuanian/lt-ud-test.conllu *** PASSED *** ****************** python tools/validate.py --lang lt UD-dev-branches/UD_Lithuanian/lt-ud-train.conllu *** PASSED *** ******************
UD Lithuanian-Alksnis
lt alksnis
EMPTY
No data
UD Maltese
mt
FAIL
python tools/validate.py --lang mt UD-dev-branches/UD_Maltese/mt-ud-sample.conllu [Line 296]: DEPREL must be "root" if HEAD is 0 [Line 303]: Unknown attribute-value pair NumType=Gen [Line 312]: Unknown attribute-value pair NumType=Gen [Line 557]: DEPREL must be "root" if HEAD is 0 [Line 792]: DEPREL must be "root" if HEAD is 0 [Line 952]: Unknown attribute-value pair NumType=Gen [Line 1670]: Unknown UD DEPREL: punc [Line 1940]: DEPREL can only be "root" if HEAD is 0 [Line 2205]: Unknown attribute-value pair NumType=Gen [Line 2239]: Unknown attribute-value pair NumType=Gen [Line 2694]: Unknown attribute-value pair NumType=Gen *** FAILED *** with 11 errors Morpho errors: 6 Syntax errors: 5 ******************
UD Marathi
mr
PASS
python tools/validate.py --lang mr UD-dev-branches/UD_Marathi/mr-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang mr UD-dev-branches/UD_Marathi/mr-ud-test.conllu *** PASSED *** ****************** python tools/validate.py --lang mr UD-dev-branches/UD_Marathi/mr-ud-train.conllu *** PASSED *** ******************
UD Naija
pcm
EMPTY
No data
UD North Sami
sme
PASS
python tools/validate.py --lang sme UD-dev-branches/UD_North_Sami/sme-ud-rest.conllu *** PASSED *** ****************** python tools/validate.py --lang sme UD-dev-branches/UD_North_Sami/sme-ud-sample.conllu *** PASSED *** ****************** python tools/validate.py --lang sme UD-dev-branches/UD_North_Sami/sme-ud-test.conllu *** PASSED *** ****************** python tools/validate.py --lang sme UD-dev-branches/UD_North_Sami/sme-ud.conllu *** PASSED *** ******************
UD Norwegian-Bokmaal
no bokmaal
PASS
python tools/validate.py --lang no_bokmaal UD-dev-branches/UD_Norwegian-Bokmaal/no_bokmaal-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang no_bokmaal UD-dev-branches/UD_Norwegian-Bokmaal/no_bokmaal-ud-test.conllu *** PASSED *** ****************** python tools/validate.py --lang no_bokmaal UD-dev-branches/UD_Norwegian-Bokmaal/no_bokmaal-ud-train.conllu *** PASSED *** ******************
UD Norwegian-Nynorsk
no nynorsk
PASS
python tools/validate.py --lang no_nynorsk UD-dev-branches/UD_Norwegian-Nynorsk/no_nynorsk-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang no_nynorsk UD-dev-branches/UD_Norwegian-Nynorsk/no_nynorsk-ud-test.conllu *** PASSED *** ****************** python tools/validate.py --lang no_nynorsk UD-dev-branches/UD_Norwegian-Nynorsk/no_nynorsk-ud-train.conllu *** PASSED *** ******************
UD Norwegian-NynorskLIA
no nynorsklia
PASS
python tools/validate.py --lang no_nynorsklia UD-dev-branches/UD_Norwegian-NynorskLIA/no_nynorsklia-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang no_nynorsklia UD-dev-branches/UD_Norwegian-NynorskLIA/no_nynorsklia-ud-test.conllu *** PASSED *** ******************
UD Old Church Slavonic
cu
PASS
python tools/validate.py --lang cu UD-dev-branches/UD_Old_Church_Slavonic/cu-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang cu UD-dev-branches/UD_Old_Church_Slavonic/cu-ud-test.conllu *** PASSED *** ****************** python tools/validate.py --lang cu UD-dev-branches/UD_Old_Church_Slavonic/cu-ud-train.conllu *** PASSED *** ******************
UD Old French
fro
EMPTY
No data
UD Persian
fa
PASS
python tools/validate.py --lang fa UD-dev-branches/UD_Persian/fa-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang fa UD-dev-branches/UD_Persian/fa-ud-test.conllu *** PASSED *** ****************** python tools/validate.py --lang fa UD-dev-branches/UD_Persian/fa-ud-train.conllu *** PASSED *** ******************
UD Polish
pl
PASS
python tools/validate.py --lang pl UD-dev-branches/UD_Polish/pl-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang pl UD-dev-branches/UD_Polish/pl-ud-test.conllu *** PASSED *** ****************** python tools/validate.py --lang pl UD-dev-branches/UD_Polish/pl-ud-train.conllu *** PASSED *** ******************
UD Portuguese
pt
PASS
python tools/validate.py --lang pt UD-dev-branches/UD_Portuguese/pt-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang pt UD-dev-branches/UD_Portuguese/pt-ud-test.conllu *** PASSED *** ****************** python tools/validate.py --lang pt UD-dev-branches/UD_Portuguese/pt-ud-train.conllu *** PASSED *** ******************
UD Portuguese-BR
pt br
PASS
python tools/validate.py --lang pt_br UD-dev-branches/UD_Portuguese-BR/pt_br-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang pt_br UD-dev-branches/UD_Portuguese-BR/pt_br-ud-test.conllu *** PASSED *** ****************** python tools/validate.py --lang pt_br UD-dev-branches/UD_Portuguese-BR/pt_br-ud-train.conllu *** PASSED *** ******************
UD Portuguese-PUD
pt pud
PASS
python tools/validate.py --lang pt_pud UD-dev-branches/UD_Portuguese-PUD/pt_pud-ud-test.conllu *** PASSED *** ******************
UD Romanian
ro
PASS
python tools/validate.py --lang ro UD-dev-branches/UD_Romanian/ro-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang ro UD-dev-branches/UD_Romanian/ro-ud-test.conllu *** PASSED *** ****************** python tools/validate.py --lang ro UD-dev-branches/UD_Romanian/ro-ud-train.conllu *** PASSED *** ******************
UD Romanian-Nonstandard
ro nonstandard
FAIL
python tools/validate.py --lang ro_nonstandard UD-dev-branches/UD_Romanian-Nonstandard/ro_nonstandard-ud-test.conlluTraceback (most recent call last): File "tools/validate.py", line 735, in <module> validate(inp,out,args,tagsets,known_sent_ids) File "tools/validate.py", line 613, in validate for comments,tree in trees(inp,tag_sets,args): File "tools/validate.py", line 96, in trees validate_cols(cols,tag_sets,args) File "tools/validate.py", line 211, in validate_cols validate_character_constraints(cols) File "tools/validate.py", line 382, in validate_character_constraints if any(deprel for head, deprel in deps_list(cols) TypeError: 'NoneType' object is not iterable [Line 7]: Morphological features must be sorted: 'Case=Acc,Nom|Definite=Def|Degree=Pos|Gender=Fem|' [Line 7]: Spurious morphological feature: ''. Should be of the form attribute=value and must start with [A-Z0-9] and only contain [A-Za-z0-9]. [Line 10]: Missing the sent_id attribute. [Line 10]: Missing the text attribute. [Line 21]: Morphological features must be sorted: 'Gender=Masc|Number=Sing|Mood=Part|Polarity=Pos|VerbForm=Fin' [Line 21]: Unknown attribute-value pair Mood=Part [Line 25]: Missing the sent_id attribute. [Line 25]: Missing the text attribute. [Line 42]: Missing the sent_id attribute. [Line 42]: Missing the text attribute. [Line 63]: Unknown UPOS tag: _ [Line 63]: Invalid UPOSTAG value _ [Line 65]: Missing the sent_id attribute. [Line 65]: Missing the text attribute. [Line 74]: Unknown attribute-value pair Compound=Yes [Line 88]: Missing the sent_id attribute. [Line 88]: Missing the text attribute. [Line 106]: Missing the sent_id attribute. [Line 106]: Missing the text attribute. [Line 112]: Unknown attribute-value pair Compound=Yes [Line 120]: Unknown attribute-value pair Compound=Yes [Line 128]: Missing the sent_id attribute. [Line 128]: Missing the text attribute. [Line 143]: Unknown attribute-value pair Compound=Yes [Line 147]: Unknown attribute-value pair Mood=Part [Line 148]: Morphological features must be sorted: 'Case=Acc,Nom|Gender=Fem|Number=Plur|Poss=Yes|' [Line 148]: Spurious morphological feature: ''. Should be of the form attribute=value and must start with [A-Z0-9] and only contain [A-Za-z0-9]. [Line 151]: Missing the sent_id attribute. [Line 151]: Missing the text attribute. [Line 169]: Missing the sent_id attribute. [Line 169]: Missing the text attribute. [Line 187]: Missing the sent_id attribute. ...suppressing further errors regarding Metadata [Line 241]: Unknown attribute-value pair Compound=Yes [Line 322]: Unknown attribute-value pair Compound=Yes [Line 328]: Unknown attribute-value pair PronType=Refl [Line 335]: The line has 11 columns, but 10 are expected. [Line 335]: Empty value in column HEAD [Line 335]: Invalid DEPREL value 5 [Line 335]: Failed for parse DEPS: discourse [Line 335]: Unknown UD DEPREL: 5 [Line 335]: Malformed head:deprel pair 'discourse' [Line 346]: Unknown UPOS tag: _ [Line 346]: Invalid UPOSTAG value _ [Line 347]: Unknown UPOS tag: _ ...suppressing further errors regarding Morpho [Line 349]: Undefined ID in HEAD: [Line 349]: Failed for parse DEPS: discourse [Line 349]: Failed to parse DEPS: discourse [Tree number 19 on line 334]: Non-integer head for word ID 2 [Tree number 19 on line 334]: Non-tree structure. Words 1,2 are not reachable from the root 0. [Line 369]: Trailing whitespace not allowed in column FORM [Line 369]: 'pînă ' in column FORM is not on the list of exceptions allowed to contain whitespace (data/tokens_w_space.ud and data/tokens_w_space.LANG files). [Line 416]: Trailing whitespace not allowed in column XPOSTAG [Line 416]: White space not allowed in the XPOSTAG column: 'Pp3fso ' [Line 424]: Trailing whitespace not allowed in column XPOSTAG [Line 424]: White space not allowed in the XPOSTAG column: 'Pp3fsa--------w ' [Line 429]: Trailing whitespace not allowed in column XPOSTAG [Line 429]: White space not allowed in the XPOSTAG column: 'Pp3fsa--------w ' [Line 436]: The line has 11 columns, but 10 are expected. [Line 436]: Empty value in column HEAD [Line 436]: Invalid DEPREL value 11 [Line 436]: Failed for parse DEPS: cc [Line 436]: Unknown UD DEPREL: 11 [Line 436]: Malformed head:deprel pair 'cc' [Line 438]: Trailing whitespace not allowed in column XPOSTAG [Line 438]: White space not allowed in the XPOSTAG column: 'Pp3msr ' [Line 444]: Trailing whitespace not allowed in column XPOSTAG ...suppressing further errors regarding Format [Line 471]: Invalid DEPREL value 37 [Line 471]: Failed for parse DEPS: nsubj [Line 471]: Unknown UD DEPREL: 37 [Line 471]: Malformed head:deprel pair 'nsubj' [Tree number 23 on line 436]: Non-tree structure. Words 1,36 are not reachable from the root 0. [Line 518]: Invalid DEPREL value 9 [Line 518]: Failed for parse DEPS: obj [Line 518]: Unknown UD DEPREL: 9 [Line 518]: Malformed head:deprel pair 'obj' [Tree number 25 on line 511]: Non-tree structure. Words 8 are not reachable from the root 0. ...suppressing further errors regarding Syntax *** FAILED *** with 8162 errors Format errors: 2979 Metadata errors: 1348 Morpho errors: 2065 Syntax errors: 1770 The language-specific file /home/ginter/UD_PROJHOOK/tools/data/tokens_w_space.ro_nonstandard does not exist. ******************
UD Romansh
rm
EMPTY
No data
UD Romansh-Sursilv
rm sursilv
EMPTY
No data
UD Russian
ru
PASS
python tools/validate.py --lang ru UD-dev-branches/UD_Russian/ru-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang ru UD-dev-branches/UD_Russian/ru-ud-test.conllu *** PASSED *** ****************** python tools/validate.py --lang ru UD-dev-branches/UD_Russian/ru-ud-train.conllu *** PASSED *** ******************
UD Russian-PUD
ru pud
PASS
python tools/validate.py --lang ru_pud UD-dev-branches/UD_Russian-PUD/ru_pud-ud-test.conllu *** PASSED *** ******************
UD Russian-SynTagRus
ru syntagrus
PASS
python tools/validate.py --lang ru_syntagrus UD-dev-branches/UD_Russian-SynTagRus/ru_syntagrus-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang ru_syntagrus UD-dev-branches/UD_Russian-SynTagRus/ru_syntagrus-ud-test.conllu *** PASSED *** ****************** python tools/validate.py --lang ru_syntagrus UD-dev-branches/UD_Russian-SynTagRus/ru_syntagrus-ud-train.conllu *** PASSED *** ******************
UD Sanskrit
sa
PASS
python /home/ginter/UD_PROJHOOK/tools/validate.py --lang sa /home/ginter/UD_PROJHOOK/UD-dev-branches/UD_Sanskrit/sa-ud-test.conllu *** PASSED *** ******************
UD Serbian
sr
PASS
python tools/validate.py --lang sr UD-dev-branches/UD_Serbian/sr-ud-train.conllu *** PASSED *** ******************
UD Slovak
sk
PASS
python /home/ginter/UD_PROJHOOK/tools/validate.py --lang sk /home/ginter/UD_PROJHOOK/UD-dev-branches/UD_Slovak/sk-ud-dev.conllu *** PASSED *** ****************** python /home/ginter/UD_PROJHOOK/tools/validate.py --lang sk /home/ginter/UD_PROJHOOK/UD-dev-branches/UD_Slovak/sk-ud-test.conllu *** PASSED *** ****************** python /home/ginter/UD_PROJHOOK/tools/validate.py --lang sk /home/ginter/UD_PROJHOOK/UD-dev-branches/UD_Slovak/sk-ud-train.conllu *** PASSED *** ******************
UD Slovenian
sl
PASS
python tools/validate.py --lang sl UD-dev-branches/UD_Slovenian/sl-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang sl UD-dev-branches/UD_Slovenian/sl-ud-test.conllu *** PASSED *** ****************** python tools/validate.py --lang sl UD-dev-branches/UD_Slovenian/sl-ud-train.conllu *** PASSED *** ******************
UD Slovenian-SST
sl sst
PASS
python tools/validate.py --lang sl_sst UD-dev-branches/UD_Slovenian-SST/sl_sst-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang sl_sst UD-dev-branches/UD_Slovenian-SST/sl_sst-ud-test.conllu *** PASSED *** ****************** python tools/validate.py --lang sl_sst UD-dev-branches/UD_Slovenian-SST/sl_sst-ud-train.conllu *** PASSED *** ******************
UD Somali
so
EMPTY
No data
UD Sorani
ckb
EMPTY
No data
UD Spanish
es
PASS
python tools/validate.py --lang es UD-dev-branches/UD_Spanish/es-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang es UD-dev-branches/UD_Spanish/es-ud-test.conllu *** PASSED *** ****************** python tools/validate.py --lang es UD-dev-branches/UD_Spanish/es-ud-train.conllu *** PASSED *** ******************
UD Spanish-AnCora
es ancora
PASS
python tools/validate.py --lang es_ancora UD-dev-branches/UD_Spanish-AnCora/es_ancora-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang es_ancora UD-dev-branches/UD_Spanish-AnCora/es_ancora-ud-test.conllu *** PASSED *** ****************** python tools/validate.py --lang es_ancora UD-dev-branches/UD_Spanish-AnCora/es_ancora-ud-train.conllu *** PASSED *** ******************
UD Spanish-PUD
es pud
PASS
python tools/validate.py --lang es_pud UD-dev-branches/UD_Spanish-PUD/es_pud-ud-test.conllu *** PASSED *** ******************
UD Swedish
sv
PASS
python tools/validate.py --lang sv UD-dev-branches/UD_Swedish/sv-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang sv UD-dev-branches/UD_Swedish/sv-ud-test.conllu *** PASSED *** ****************** python tools/validate.py --lang sv UD-dev-branches/UD_Swedish/sv-ud-train.conllu *** PASSED *** ******************
UD Swedish-LinES
sv lines
PASS
python tools/validate.py --lang sv_lines UD-dev-branches/UD_Swedish-LinES/sv_lines-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang sv_lines UD-dev-branches/UD_Swedish-LinES/sv_lines-ud-test.conllu *** PASSED *** ****************** python tools/validate.py --lang sv_lines UD-dev-branches/UD_Swedish-LinES/sv_lines-ud-train.conllu *** PASSED *** ******************
UD Swedish-PUD
sv pud
PASS
python tools/validate.py --lang sv_pud UD-dev-branches/UD_Swedish-PUD/sv_pud-ud-test.conllu *** PASSED *** ******************
UD Swedish Sign Language
swl
PASS
python /home/ginter/UD_PROJHOOK/tools/validate.py --lang swl /home/ginter/UD_PROJHOOK/UD-dev-branches/UD_Swedish_Sign_Language/swl-ud-dev.conllu *** PASSED *** ****************** python /home/ginter/UD_PROJHOOK/tools/validate.py --lang swl /home/ginter/UD_PROJHOOK/UD-dev-branches/UD_Swedish_Sign_Language/swl-ud-test.conllu *** PASSED *** ****************** python /home/ginter/UD_PROJHOOK/tools/validate.py --lang swl /home/ginter/UD_PROJHOOK/UD-dev-branches/UD_Swedish_Sign_Language/swl-ud-train.conllu *** PASSED *** ******************
UD Tamil
ta
PASS
python tools/validate.py --lang ta UD-dev-branches/UD_Tamil/ta-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang ta UD-dev-branches/UD_Tamil/ta-ud-test.conllu *** PASSED *** ****************** python tools/validate.py --lang ta UD-dev-branches/UD_Tamil/ta-ud-train.conllu *** PASSED *** ******************
UD Telugu
te
PASS
python tools/validate.py --lang te UD-dev-branches/UD_Telugu/dev-tescript.conllu *** PASSED *** ****************** python tools/validate.py --lang te UD-dev-branches/UD_Telugu/test-tescript.conllu *** PASSED *** ****************** python tools/validate.py --lang te UD-dev-branches/UD_Telugu/train-tescript.conllu *** PASSED *** ******************
UD Thai-PUD
th
FAIL
python tools/validate.py --lang th UD-dev-branches/UD_Thai-PUD/th_pud-ud-test.conllu [Line 3]: Spurious morphological feature: 'th/proper=false'. Should be of the form attribute=value and must start with [A-Z0-9] and only contain [A-Za-z0-9]. [Line 3]: Unknown UD DEPREL: p [Line 4]: Spurious morphological feature: 'th/proper=false'. Should be of the form attribute=value and must start with [A-Z0-9] and only contain [A-Za-z0-9]. [Line 5]: Spurious morphological feature: 'th/proper=false'. Should be of the form attribute=value and must start with [A-Z0-9] and only contain [A-Za-z0-9]. [Line 5]: Unknown UD DEPREL: mwe [Line 6]: Spurious morphological feature: 'th/proper=false'. Should be of the form attribute=value and must start with [A-Z0-9] and only contain [A-Za-z0-9]. [Line 6]: Unknown UPOS tag: AFFIX [Line 6]: Unknown UD DEPREL: pref [Line 7]: Spurious morphological feature: 'th/proper=false'. Should be of the form attribute=value and must start with [A-Z0-9] and only contain [A-Za-z0-9]. [Line 8]: Spurious morphological feature: 'th/proper=false'. Should be of the form attribute=value and must start with [A-Z0-9] and only contain [A-Za-z0-9]. [Line 8]: Unknown UPOS tag: PRT [Line 8]: Unknown UD DEPREL: prt [Line 9]: Spurious morphological feature: 'th/proper=false'. Should be of the form attribute=value and must start with [A-Z0-9] and only contain [A-Za-z0-9]. [Line 10]: Spurious morphological feature: 'th/proper=false'. Should be of the form attribute=value and must start with [A-Z0-9] and only contain [A-Za-z0-9]. [Line 10]: Unknown UD DEPREL: dobj [Line 11]: Spurious morphological feature: 'th/proper=false'. Should be of the form attribute=value and must start with [A-Z0-9] and only contain [A-Za-z0-9]. [Line 12]: Spurious morphological feature: 'th/proper=false'. Should be of the form attribute=value and must start with [A-Z0-9] and only contain [A-Za-z0-9]. [Line 13]: Spurious morphological feature: 'th/proper=false'. Should be of the form attribute=value and must start with [A-Z0-9] and only contain [A-Za-z0-9]. [Line 13]: Unknown UD DEPREL: attr [Line 14]: Spurious morphological feature: 'th/proper=false'. Should be of the form attribute=value and must start with [A-Z0-9] and only contain [A-Za-z0-9]. [Line 15]: Spurious morphological feature: 'th/proper=false'. Should be of the form attribute=value and must start with [A-Z0-9] and only contain [A-Za-z0-9]. [Line 15]: Unknown UPOS tag: PRT [Line 15]: Unknown UD DEPREL: neg [Line 16]: Spurious morphological feature: 'th/aspect=perf'. Should be of the form attribute=value and must start with [A-Z0-9] and only contain [A-Za-z0-9]. [Line 16]: Spurious morphological feature: 'th/proper=false'. Should be of the form attribute=value and must start with [A-Z0-9] and only contain [A-Za-z0-9]. [Line 16]: Unknown UPOS tag: PRT [Line 16]: Unknown UD DEPREL: asp ...suppressing further errors regarding Morpho [Line 17]: Unknown UD DEPREL: rcmod [Line 18]: Unknown UD DEPREL: asp [Line 20]: Unknown UD DEPREL: prep [Line 21]: Unknown UD DEPREL: pobj [Line 22]: Unknown UD DEPREL: p [Line 23]: Unknown UD DEPREL: pref [Line 25]: Unknown UD DEPREL: dobj [Line 26]: Unknown UD DEPREL: pref [Line 30]: Unknown UD DEPREL: neg [Line 32]: Unknown UD DEPREL: attr [Line 34]: Unknown UD DEPREL: p ...suppressing further errors regarding Syntax [Line 48]: Missing the sent_id attribute. [Line 48]: Missing the text attribute. [Line 67]: Missing the sent_id attribute. [Line 67]: Missing the text attribute. [Line 117]: Missing the sent_id attribute. [Line 117]: Missing the text attribute. [Line 155]: Missing the sent_id attribute. [Line 155]: Missing the text attribute. [Line 174]: Missing the sent_id attribute. [Line 174]: Missing the text attribute. [Line 197]: Missing the sent_id attribute. [Line 197]: Missing the text attribute. [Line 210]: Missing the sent_id attribute. [Line 210]: Missing the text attribute. [Line 249]: Missing the sent_id attribute. [Line 249]: Missing the text attribute. [Line 277]: Missing the sent_id attribute. [Line 277]: Missing the text attribute. [Line 291]: Missing the sent_id attribute. ...suppressing further errors regarding Metadata *** FAILED *** with 46893 errors Metadata errors: 2000 Morpho errors: 27316 Syntax errors: 17577 The language-specific file /home/ginter/UD_PROJHOOK/tools/data/deprel.th does not exist. ******************
UD Turkish
tr
PASS
python tools/validate.py --lang tr UD-dev-branches/UD_Turkish/tr-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang tr UD-dev-branches/UD_Turkish/tr-ud-test.conllu *** PASSED *** ****************** python tools/validate.py --lang tr UD-dev-branches/UD_Turkish/tr-ud-train.conllu *** PASSED *** ******************
UD Turkish-PUD
tr pud
PASS
python tools/validate.py --lang tr_pud UD-dev-branches/UD_Turkish-PUD/tr_pud-ud-test.conllu *** PASSED *** ******************
UD Ukrainian
uk
PASS
python /home/ginter/UD_PROJHOOK/tools/validate.py --lang uk /home/ginter/UD_PROJHOOK/UD-dev-branches/UD_Ukrainian/uk-ud-dev.conllu *** PASSED *** ****************** python /home/ginter/UD_PROJHOOK/tools/validate.py --lang uk /home/ginter/UD_PROJHOOK/UD-dev-branches/UD_Ukrainian/uk-ud-test.conllu *** PASSED *** ****************** python /home/ginter/UD_PROJHOOK/tools/validate.py --lang uk /home/ginter/UD_PROJHOOK/UD-dev-branches/UD_Ukrainian/uk-ud-train.conllu *** PASSED *** ******************
UD Upper Sorbian
hsb
PASS
python tools/validate.py --lang hsb UD-dev-branches/UD_Upper_Sorbian/hsb-ud-sample.conllu *** PASSED *** ****************** python tools/validate.py --lang hsb UD-dev-branches/UD_Upper_Sorbian/hsb-ud-test.conllu *** PASSED *** ******************
UD Urdu
ur
PASS
python tools/validate.py --lang ur UD-dev-branches/UD_Urdu/ur-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang ur UD-dev-branches/UD_Urdu/ur-ud-test.conllu *** PASSED *** ****************** python tools/validate.py --lang ur UD-dev-branches/UD_Urdu/ur-ud-train.conllu *** PASSED *** ******************
UD Uyghur
ug
PASS
python tools/validate.py --lang ug UD-dev-branches/UD_Uyghur/ug-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang ug UD-dev-branches/UD_Uyghur/ug-ud-test.conllu *** PASSED *** ******************
UD Vietnamese
vi
PASS
python tools/validate.py --lang vi UD-dev-branches/UD_Vietnamese/vi-ud-dev.conllu *** PASSED *** ****************** python tools/validate.py --lang vi UD-dev-branches/UD_Vietnamese/vi-ud-test.conllu *** PASSED *** ****************** python tools/validate.py --lang vi UD-dev-branches/UD_Vietnamese/vi-ud-train.conllu *** PASSED *** ******************