home fi/pos edit page issue tracker

This page still pertains to UD version 1.

DET: determiner

Determiners are words that modify nouns or noun phrases and express the reference of the noun phrase in context.

Finnish has no true articles (see e.g. WALS) and many formalizations of Finnish morphology don’t involve a determiner (or related) tag. However, words such as yksi “one” and se “that” are used similarly to articles, especially in spoken language.

Examples

References

Diffs

Turku Dependency Treebank

No DET tag (or related) is annotated in TDT, and DET is not used in the current version of the UD Finnish corpus.


Treebank Statistics (UD_Finnish-FTB)

There are 51 DET lemmas (0%), 531 DET types (1%) and 3670 DET tokens (3%). Out of 17 observed tags, the rank of DET is: 11 in number of lemmas, 8 in number of types and 11 in number of tokens.

The 10 most frequent DET lemmas: se, tämä, kaikki, jokin, mikään, muu, mikä, ne, hän, moni

The 10 most frequent DET types: se, sen, hänen, kaikki, mitään, tämä, joka, tällä, sitä, joku

The 10 most frequent ambiguous lemmas: se (PRON 2069, DET 488, NOUN 2), tämä (DET 441, PRON 334), kaikki (DET 230, PRON 223), jokin (DET 169, PRON 77), mikään (DET 150, PRON 109), muu (DET 150, PRON 131), mikä (PRON 520, DET 136), ne (PRON 350, DET 134), hän (PRON 1305, DET 125), moni (DET 120, PRON 62)

The 10 most frequent ambiguous types: se (PRON 794, DET 136), sen (PRON 244, DET 138, PART 3), hänen (DET 97, PRON 76), kaikki (PRON 98, DET 89), mitään (DET 84, PRON 79, ADV 1), tämä (PRON 62, DET 42), joka (PRON 244, DET 55), tällä (DET 54, PRON 2), sitä (PRON 241, DET 61, PART 25), joku (DET 58, PRON 37, PART 1)

Morphology

The form / lemma ratio of DET is 10.411765 (the average of all parts of speech is 2.026917).

The 1st highest number of forms (28) was observed with the lemma “jokin”: Joillakin, Jollekin, Jossaki, johonki, johonkin, joihinkin, joillekin, joissakin, joistakin, joitain, joitaki, joitakin, jokin, jollain, jollakin, joltakin, jonakin, jonkin, jossai, jossain, jossakin, jostain, jostakin, jotai, jotain, jotaki, jotakin, jottais.

The 2nd highest number of forms (28) was observed with the lemma “muu”: Muissakin, muiden, muidenkaan, muihin, muilla, muille, muilta, muin, muina, muissa, muista, muita, muitakin, muitta, muitten, muu, muuhun, muukin, muulla, muulta, muun, muuna, muussa, muusta, muut, muuta, muutakin, muutkin.

The 3rd highest number of forms (27) was observed with the lemma “tämä”: Tämähän, Tämäkään, Tämäkö, Tänäkään, tähän, täksi, täl, tälle, tällä, tältä, tämä, tämän, tämänkin, tämänkään, tän, tänä, tänäkin, täs, tässä, tässäkin, täst, tästä, täsä, tätä, tätäkä, tää, tään.

DET occurs with 9 features: fi-feat/PronType (3650; 99% instances), fi-feat/Case (3553; 97% instances), fi-feat/Number (3217; 88% instances), fi-feat/Person (364; 10% instances), fi-feat/Style (289; 8% instances), fi-feat/Clitic (71; 2% instances), fi-feat/Degree (20; 1% instances), fi-feat/Reflex (20; 1% instances), fi-feat/Person[psor] (13; 0% instances)

DET occurs with 37 feature-value pairs: Case=Abe, Case=Abl, Case=Ade, Case=All, Case=Com, Case=Ela, Case=Ess, Case=Gen, Case=Ill, Case=Ine, Case=Ins, Case=Nom, Case=Par, Case=Tra, Clitic=Han, Clitic=Ka,S, Clitic=Kaan, Clitic=Kin, Clitic=Ko, Clitic=Ko,S, Clitic=S, Degree=Cmp, Degree=Sup, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Person[psor]=3, PronType=Dem, PronType=Ind, PronType=Int, PronType=Prs, PronType=Rcp, PronType=Rel, Reflex=Yes, Style=Coll

DET occurs with 181 feature combinations. The most frequent feature combination is Case=Nom|Number=Sing|PronType=Dem (304 tokens). Examples: se, tämä, tuo, sellainen, semmoinen, tällainen, tuollainen, tämmöinen

Relations

DET nodes are attached to their parents using 4 different relations: fi-dep/det (3441; 94% instances), fi-dep/amod (217; 6% instances), fi-dep/fixed (9; 0% instances), fi-dep/conj (3; 0% instances)

Parents of DET nodes belong to 10 different parts of speech: NOUN (3107; 85% instances), ADJ (225; 6% instances), PRON (86; 2% instances), PROPN (77; 2% instances), NUM (63; 2% instances), ADV (37; 1% instances), DET (37; 1% instances), VERB (32; 1% instances), ADP (5; 0% instances), X (1; 0% instances)

3446 (94%) DET nodes are leaves.

205 (6%) DET nodes have one child.

19 (1%) DET nodes have two children.

The highest child degree of a DET node is 2.

Children of DET nodes are attached using 12 different relations: fi-dep/advmod (98; 40% instances), fi-dep/advcl (38; 16% instances), fi-dep/det (34; 14% instances), fi-dep/punct (22; 9% instances), fi-dep/conj (17; 7% instances), fi-dep/case (9; 4% instances), fi-dep/nummod (9; 4% instances), fi-dep/nmod (8; 3% instances), fi-dep/amod (3; 1% instances), fi-dep/mark (3; 1% instances), fi-dep/acl (1; 0% instances), fi-dep/reparandum (1; 0% instances)

Children of DET nodes belong to 12 different parts of speech: PART (80; 33% instances), DET (37; 15% instances), ADV (24; 10% instances), PUNCT (22; 9% instances), NOUN (16; 7% instances), PRON (16; 7% instances), VERB (15; 6% instances), NUM (10; 4% instances), ADP (9; 4% instances), PROPN (7; 3% instances), ADJ (6; 2% instances), X (1; 0% instances)


DET in other languages: [am] [ar] [bg] [bxr] [ca] [ckb] [cop] [cs] [cu] [da] [de] [el] [en] [es] [et] [eu] [fa] [fi] [fo] [fr] [ga] [gl] [got] [grc] [he] [hi] [hr] [hu] [id] [it] [ja] [kk] [kmr] [ko] [la] [lv] [mr] [nl] [no] [pl] [pt] [ro] [ru] [sa] [sk] [sla] [sl] [so] [sr] [sv] [swl] [ta] [tr] [ug] [uk] [u] [urj] [ur] [vi] [yue] [zh]