DET
: determiner
Determiners are words that modify nouns or noun phrases and express the reference of the noun phrase in context.
Finnish has no true articles (see e.g. WALS) and many formalizations of Finnish morphology don’t involve a determiner (or related) tag. However, words such as yksi “one” and se “that” are used similarly to articles, especially in spoken language.
Examples
- [fi] yksi mies lähti “a/one man left”
- [fi] se mies lähti “the/that man left”
References
Diffs
Turku Dependency Treebank
No DET
tag (or related) is annotated in TDT, and DET
is
not used in the current version of the UD Finnish corpus.
Treebank Statistics (UD_Finnish-FTB)
There are 51 DET
lemmas (0%), 531 DET
types (1%) and 3670 DET
tokens (3%).
Out of 17 observed tags, the rank of DET
is: 11 in number of lemmas, 8 in number of types and 11 in number of tokens.
The 10 most frequent DET
lemmas: se, tämä, kaikki, jokin, mikään, muu, mikä, ne, hän, moni
The 10 most frequent DET
types: se, sen, hänen, kaikki, mitään, tämä, joka, tällä, sitä, joku
The 10 most frequent ambiguous lemmas: se (PRON 2069, DET 488, NOUN 2), tämä (DET 441, PRON 334), kaikki (DET 230, PRON 223), jokin (DET 169, PRON 77), mikään (DET 150, PRON 109), muu (DET 150, PRON 131), mikä (PRON 520, DET 136), ne (PRON 350, DET 134), hän (PRON 1305, DET 125), moni (DET 120, PRON 62)
The 10 most frequent ambiguous types: se (PRON 794, DET 136), sen (PRON 244, DET 138, PART 3), hänen (DET 97, PRON 76), kaikki (PRON 98, DET 89), mitään (DET 84, PRON 79, ADV 1), tämä (PRON 62, DET 42), joka (PRON 244, DET 55), tällä (DET 54, PRON 2), sitä (PRON 241, DET 61, PART 25), joku (DET 58, PRON 37, PART 1)
- se
- sen
- hänen
- kaikki
- mitään
- tämä
- joka
- tällä
- sitä
- joku
Morphology
The form / lemma ratio of DET
is 10.411765 (the average of all parts of speech is 2.026917).
The 1st highest number of forms (28) was observed with the lemma “jokin”: Joillakin, Jollekin, Jossaki, johonki, johonkin, joihinkin, joillekin, joissakin, joistakin, joitain, joitaki, joitakin, jokin, jollain, jollakin, joltakin, jonakin, jonkin, jossai, jossain, jossakin, jostain, jostakin, jotai, jotain, jotaki, jotakin, jottais.
The 2nd highest number of forms (28) was observed with the lemma “muu”: Muissakin, muiden, muidenkaan, muihin, muilla, muille, muilta, muin, muina, muissa, muista, muita, muitakin, muitta, muitten, muu, muuhun, muukin, muulla, muulta, muun, muuna, muussa, muusta, muut, muuta, muutakin, muutkin.
The 3rd highest number of forms (27) was observed with the lemma “tämä”: Tämähän, Tämäkään, Tämäkö, Tänäkään, tähän, täksi, täl, tälle, tällä, tältä, tämä, tämän, tämänkin, tämänkään, tän, tänä, tänäkin, täs, tässä, tässäkin, täst, tästä, täsä, tätä, tätäkä, tää, tään.
DET
occurs with 9 features: fi-feat/PronType (3650; 99% instances), fi-feat/Case (3553; 97% instances), fi-feat/Number (3217; 88% instances), fi-feat/Person (364; 10% instances), fi-feat/Style (289; 8% instances), fi-feat/Clitic (71; 2% instances), fi-feat/Degree (20; 1% instances), fi-feat/Reflex (20; 1% instances), fi-feat/Person[psor] (13; 0% instances)
DET
occurs with 37 feature-value pairs: Case=Abe
, Case=Abl
, Case=Ade
, Case=All
, Case=Com
, Case=Ela
, Case=Ess
, Case=Gen
, Case=Ill
, Case=Ine
, Case=Ins
, Case=Nom
, Case=Par
, Case=Tra
, Clitic=Han
, Clitic=Ka,S
, Clitic=Kaan
, Clitic=Kin
, Clitic=Ko
, Clitic=Ko,S
, Clitic=S
, Degree=Cmp
, Degree=Sup
, Number=Plur
, Number=Sing
, Person=1
, Person=2
, Person=3
, Person[psor]=3
, PronType=Dem
, PronType=Ind
, PronType=Int
, PronType=Prs
, PronType=Rcp
, PronType=Rel
, Reflex=Yes
, Style=Coll
DET
occurs with 181 feature combinations.
The most frequent feature combination is Case=Nom|Number=Sing|PronType=Dem
(304 tokens).
Examples: se, tämä, tuo, sellainen, semmoinen, tällainen, tuollainen, tämmöinen
Relations
DET
nodes are attached to their parents using 4 different relations: fi-dep/det (3441; 94% instances), fi-dep/amod (217; 6% instances), fi-dep/fixed (9; 0% instances), fi-dep/conj (3; 0% instances)
Parents of DET
nodes belong to 10 different parts of speech: NOUN (3107; 85% instances), ADJ (225; 6% instances), PRON (86; 2% instances), PROPN (77; 2% instances), NUM (63; 2% instances), ADV (37; 1% instances), DET (37; 1% instances), VERB (32; 1% instances), ADP (5; 0% instances), X (1; 0% instances)
3446 (94%) DET
nodes are leaves.
205 (6%) DET
nodes have one child.
19 (1%) DET
nodes have two children.
The highest child degree of a DET
node is 2.
Children of DET
nodes are attached using 12 different relations: fi-dep/advmod (98; 40% instances), fi-dep/advcl (38; 16% instances), fi-dep/det (34; 14% instances), fi-dep/punct (22; 9% instances), fi-dep/conj (17; 7% instances), fi-dep/case (9; 4% instances), fi-dep/nummod (9; 4% instances), fi-dep/nmod (8; 3% instances), fi-dep/amod (3; 1% instances), fi-dep/mark (3; 1% instances), fi-dep/acl (1; 0% instances), fi-dep/reparandum (1; 0% instances)
Children of DET
nodes belong to 12 different parts of speech: PART (80; 33% instances), DET (37; 15% instances), ADV (24; 10% instances), PUNCT (22; 9% instances), NOUN (16; 7% instances), PRON (16; 7% instances), VERB (15; 6% instances), NUM (10; 4% instances), ADP (9; 4% instances), PROPN (7; 3% instances), ADJ (6; 2% instances), X (1; 0% instances)
DET in other languages: [am] [ar] [bg] [bxr] [ca] [ckb] [cop] [cs] [cu] [da] [de] [el] [en] [es] [et] [eu] [fa] [fi] [fo] [fr] [ga] [gl] [got] [grc] [he] [hi] [hr] [hu] [id] [it] [ja] [kk] [kmr] [ko] [la] [lv] [mr] [nl] [no] [pl] [pt] [ro] [ru] [sa] [sk] [sla] [sl] [so] [sr] [sv] [swl] [ta] [tr] [ug] [uk] [u] [urj] [ur] [vi] [yue] [zh]