home et/pos edit page issue tracker

This page still pertains to UD version 1.

DET: determiner

POS-tag DET is not used in Estonian UD.


Treebank Statistics (UD_Estonian)

There are 32 DET lemmas (0%), 116 DET types (1%) and 402 DET tokens (1%). Out of 16 observed tags, the rank of DET is: 9 in number of lemmas, 9 in number of types and 13 in number of tokens.

The 10 most frequent DET lemmas: see, üks, mõni, iga, kõik, teine, mingi, mitu, kogu, mis

The 10 most frequent DET types: see, üks, kõik, mitu, iga, selle, ühe, kogu, mis, mõned

The 10 most frequent ambiguous lemmas: see (PRON 375, DET 96), üks (DET 67, NUM 28, PRON 11), mõni (DET 31, PRON 6, ADJ 2), iga (DET 30, PRON 3, ADJ 1), kõik (PRON 52, DET 30, ADV 1), teine (PRON 37, DET 22, ADJ 20, NUM 1), mingi (DET 20, ADJ 7), mitu (DET 20, PRON 7), kogu (DET 11, ADJ 6, NOUN 2), mis (PRON 168, DET 10)

The 10 most frequent ambiguous types: see (PRON 95, DET 21), üks (DET 17, NUM 11, PRON 1), kõik (PRON 31, DET 14), mitu (DET 19, PRON 3), selle (PRON 33, DET 13), ühe (DET 11, NUM 7, PRON 2), kogu (DET 8, ADJ 5), mis (PRON 82, DET 5), mõned (DET 8, PRON 3, ADJ 1), need (PRON 15, DET 7)

Morphology

The form / lemma ratio of DET is 3.625000 (the average of all parts of speech is 1.545328).

The 1st highest number of forms (16) was observed with the lemma “see”: Seegi, need, neid, neil, neis, neisse, nende, seda, see, sel, selle, sellel, sellele, selles, sellesse, sellest.

The 2nd highest number of forms (13) was observed with the lemma “üks”: ühe, ühed, üheks, ühel, ühele, ühelt, ühes, ühest, üht, ühtegi, ühtki, üks, ükski.

The 3rd highest number of forms (9) was observed with the lemma “mingi”: mingeid, mingi, mingil, mingis, mingisse, mingist, mingit, mingite, mingitest.

DET occurs with 3 features: et-feat/PronType (402; 100% instances), et-feat/Case (390; 97% instances), et-feat/Number (390; 97% instances)

DET occurs with 17 feature-value pairs: Case=Abl, Case=Ade, Case=All, Case=Ela, Case=Gen, Case=Ill, Case=Ine, Case=Nom, Case=Par, Case=Tra, Number=Plur, Number=Sing, PronType=Dem, PronType=Ind, PronType=Int, PronType=Rel, PronType=Tot

DET occurs with 48 feature combinations. The most frequent feature combination is Case=Nom|Number=Sing|PronType=Ind (70 tokens). Examples: üks, mitu, iga, mõni, mingi, teine, keegi, mingisugune, ükski

Relations

DET nodes are attached to their parents using 3 different relations: et-dep/det (400; 100% instances), et-dep/conj (1; 0% instances), et-dep/nmod (1; 0% instances)

Parents of DET nodes belong to 7 different parts of speech: NOUN (379; 94% instances), PRON (7; 2% instances), PROPN (6; 1% instances), DET (4; 1% instances), NUM (4; 1% instances), ADJ (1; 0% instances), ADV (1; 0% instances)

389 (97%) DET nodes are leaves.

13 (3%) DET nodes have one child.

The highest child degree of a DET node is 1.

Children of DET nodes are attached using 5 different relations: et-dep/advmod (6; 46% instances), et-dep/det (3; 23% instances), et-dep/conj (2; 15% instances), et-dep/cc (1; 8% instances), et-dep/compound:prt (1; 8% instances)

Children of DET nodes belong to 4 different parts of speech: ADV (7; 54% instances), DET (4; 31% instances), ADJ (1; 8% instances), CCONJ (1; 8% instances)


DET in other languages: [am] [ar] [bg] [bxr] [ca] [ckb] [cop] [cs] [cu] [da] [de] [el] [en] [es] [et] [eu] [fa] [fi] [fo] [fr] [ga] [gl] [got] [grc] [he] [hi] [hr] [hu] [id] [it] [ja] [kk] [kmr] [ko] [la] [lv] [mr] [nl] [no] [pl] [pt] [ro] [ru] [sa] [sk] [sla] [sl] [so] [sr] [sv] [swl] [ta] [tr] [ug] [uk] [u] [urj] [ur] [vi] [yue] [zh]