PRON
: pronoun
Pronouns are words that substitute for nouns or noun phrases, whose meaning is recoverable from the linguistic or extralinguistic context.
See also PronType.
Examples
- [fi] minä, sinä, hän, me, te, he “I, you, he/she, we, you, they” (personal pronouns)
- [fi] itse “self” (reflexive pronoun)
- [fi] tämä, tuo, se, nämä, nuo, ne “this, that, it/that, these, those, they/those” (demonstrative pronouns)
- [fi] kuka, mikä, kumpi “who, what, which” (interrogative pronouns)
- [fi] joka, mikä “who, that” (relative pronouns)
- [fi] TODO (indefinite pronouns)
- [fi] TODO (totality pronouns)
- [fi] TODO (negative pronouns)
- [fi] muu “other”, sama “same”
References
- http://scripta.kotus.fi/visk/sisallys.php?p=713 (in Finnish)
Treebank Statistics (UD_Finnish)
There are 48 PRON
lemmas (0%), 600 PRON
types (1%) and 11933 PRON
tokens (7%).
Out of 15 observed tags, the rank of PRON
is: 11 in number of lemmas, 7 in number of types and 6 in number of tokens.
The 10 most frequent PRON
lemmas: se, joka, tämä, hän, minä, mikä, kaikki, muu, jokin, sinä
The 10 most frequent PRON
types: se, hän, sen, joka, sitä, siitä, tämä, tämän, jotka, ne
The 10 most frequent ambiguous lemmas: kaikki (PRON 487, ADV 1), muu (PRON 378, ADV 1), toinen (PRON 171, ADJ 136, NOUN 36), itse (PRON 164, ADV 120), yksi (NUM 173, PRON 47), ainoa (PRON 34, ADJ 2, NOUN 1), harva (PRON 8, ADJ 4), usea (ADJ 107, PRON 4), useampi (PRON 2, ADJ 2), jonne (ADV 4, PRON 1)
The 10 most frequent ambiguous types: sen (PRON 476, ADV 13), sitä (PRON 303, ADV 18, CCONJ 2), tämän (PRON 181, ADV 1), mitä (PRON 177, ADV 12, CCONJ 4, SCONJ 1), kaikki (PRON 170, ADV 1), siinä (PRON 74, ADV 13), tästä (PRON 55, ADV 1), jotain (PRON 79, ADV 2), sillä (SCONJ 138, PRON 59), muiden (PRON 49, ADV 4)
- sen
- sitä
- tämän
- mitä
- PRON 177: Ja mitä kustantaa valmis dolly ?
- ADV 12: Vatsani oli aivan pohjaton ja veti sitä enemmän , mitä enemmän söin .
- CCONJ 4: Kotitalousvähennyksen käyttö vähenee , mitä niukemmin kotitalous ansaitsee .
- SCONJ 1: 1 kotitekoinen vuokaleipä tehty tällä ohjeella ( leipä kannattaa tehdä ainakin pari päivää aikaisemmin mitä aikoo tehdä kakun )
- kaikki
- siinä
- tästä
- jotain
- sillä
- muiden
Morphology
The form / lemma ratio of PRON
is 12.500000 (the average of all parts of speech is 2.037154).
The 1st highest number of forms (46) was observed with the lemma “se”: Senhän, Siinäpä, Sitähän, ne, nekin, niiden, niidenkin, niihin, niil, niilki, niille, niillä, niilläkin, niiltä, niinä, niissä, niistä, niistäkin, niit, niitten, niitä, se, sehän, sekin, sekään, sen, senkin, senkään, sieltä, siihe, siihen, siihenkin, siin, siinä, siitä, siitähän, siitäkin, siitäs, sille, sillä, silläkin, siltä, sinä, sitä, sitäkin, sitäkään.
The 2nd highest number of forms (45) was observed with the lemma “minä”: Mehän, me, meidän, meidät, meil, meille, meillä, meilläkin, meiltä, meistä, meitä, meiäm, meiän, mekin, mie, minua, minulla, minulle, minulta, minultakin, minun, minunkin, minussa, minussakin, minusta, minut, minutkin, minuun, minä, minähän, minäkin, minäkään, minäpä, mua, mul, mull, mulla, mulle, mullekin, multa, mun, musta, mut, mä, mää.
The 3rd highest number of forms (39) was observed with the lemma “tämä”: Näine, Tähänkin, Tässäkö, Täst, näiden, näihin, näille, näillä, näinä, näis, näissä, näissäkin, näistä, näitä, näitäkin, nämä, nää, tähän, tähän(kin), täksi, tälle, tällekin, tällä, tältä, tämä, tämäkin, tämän, tämänkin, tän, tänä, tänäkin, tänäkään, tässä, tässäkin, tästä, tästäkin, tätä, tätäkä, tää.
PRON
occurs with 11 features: fi-feat/Case (11928; 100% instances), fi-feat/Number (11928; 100% instances), fi-feat/PronType (11735; 98% instances), fi-feat/Person (2475; 21% instances), fi-feat/Style (285; 2% instances), fi-feat/Clitic (192; 2% instances), fi-feat/Person[psor] (176; 1% instances), fi-feat/Reflex (164; 1% instances), fi-feat/Number[psor] (46; 0% instances), fi-feat/Typo (15; 0% instances), fi-feat/Degree (4; 0% instances)
PRON
occurs with 41 feature-value pairs: Case=Abl
, Case=Acc
, Case=Ade
, Case=All
, Case=Com
, Case=Ela
, Case=Ess
, Case=Gen
, Case=Ill
, Case=Ine
, Case=Ins
, Case=Nom
, Case=Par
, Case=Tra
, Clitic=Han
, Clitic=Han,Ko
, Clitic=Kaan
, Clitic=Kin
, Clitic=Ko
, Clitic=Pa
, Clitic=S
, Degree=Pos
, Number=Plur
, Number=Sing
, Number[psor]=Plur
, Number[psor]=Sing
, Person=1
, Person=2
, Person=3
, Person[psor]=1
, Person[psor]=2
, Person[psor]=3
, PronType=Dem
, PronType=Ind
, PronType=Int
, PronType=Prs
, PronType=Rcp
, PronType=Rel
, Reflex=Yes
, Style=Coll
, Typo=Yes
PRON
occurs with 361 feature combinations.
The most frequent feature combination is Case=Nom|Number=Sing|PronType=Dem
(1145 tokens).
Examples: se, tämä, tuo
Relations
PRON
nodes are attached to their parents using 29 different relations: fi-dep/det (3139; 26% instances), fi-dep/nsubj (2697; 23% instances), fi-dep/obl (1810; 15% instances), fi-dep/obj (1525; 13% instances), fi-dep/nsubj:cop (835; 7% instances), fi-dep/nmod:poss (736; 6% instances), fi-dep/root (274; 2% instances), fi-dep/nmod (268; 2% instances), fi-dep/conj (160; 1% instances), fi-dep/acl:relcl (101; 1% instances), fi-dep/ccomp (87; 1% instances), fi-dep/nmod:gobj (71; 1% instances), fi-dep/advcl (70; 1% instances), fi-dep/nmod:gsubj (45; 0% instances), fi-dep/advmod (42; 0% instances), fi-dep/appos (21; 0% instances), fi-dep/parataxis (11; 0% instances), fi-dep/xcomp:ds (9; 0% instances), fi-dep/fixed (6; 0% instances), fi-dep/xcomp (6; 0% instances), fi-dep/amod (4; 0% instances), fi-dep/csubj:cop (3; 0% instances), fi-dep/compound:nn (2; 0% instances), fi-dep/dep (2; 0% instances), fi-dep/discourse (2; 0% instances), fi-dep/nummod (2; 0% instances), fi-dep/orphan (2; 0% instances), fi-dep/vocative (2; 0% instances), fi-dep/_ (1; 0% instances)
Parents of PRON
nodes belong to 15 different parts of speech: VERB (6085; 51% instances), NOUN (4493; 38% instances), ADJ (497; 4% instances), PRON (303; 3% instances), ROOT (275; 2% instances), ADV (151; 1% instances), PROPN (70; 1% instances), NUM (29; 0% instances), AUX (15; 0% instances), PUNCT (5; 0% instances), SYM (3; 0% instances), X (3; 0% instances), ADP (2; 0% instances), CCONJ (1; 0% instances), SCONJ (1; 0% instances)
10006 (84%) PRON
nodes are leaves.
1140 (10%) PRON
nodes have one child.
209 (2%) PRON
nodes have two children.
578 (5%) PRON
nodes have three or more children.
The highest child degree of a PRON
node is 9.
Children of PRON
nodes are attached using 36 different relations: fi-dep/punct (614; 15% instances), fi-dep/nsubj:cop (495; 12% instances), fi-dep/case (396; 10% instances), fi-dep/advmod (368; 9% instances), fi-dep/ccomp (345; 9% instances), fi-dep/cop (307; 8% instances), fi-dep/cop:own (197; 5% instances), fi-dep/cc (156; 4% instances), fi-dep/acl:relcl (140; 3% instances), fi-dep/nmod (138; 3% instances), fi-dep/conj (136; 3% instances), fi-dep/advcl (121; 3% instances), fi-dep/det (104; 3% instances), fi-dep/aux (103; 3% instances), fi-dep/mark (97; 2% instances), fi-dep/obl (87; 2% instances), fi-dep/fixed (84; 2% instances), fi-dep/appos (23; 1% instances), fi-dep/_ (19; 0% instances), fi-dep/orphan (19; 0% instances), fi-dep/amod (12; 0% instances), fi-dep/parataxis (12; 0% instances), fi-dep/discourse (11; 0% instances), fi-dep/nmod:poss (8; 0% instances), fi-dep/nsubj (5; 0% instances), fi-dep/root (5; 0% instances), fi-dep/acl (4; 0% instances), fi-dep/csubj:cop (4; 0% instances), fi-dep/vocative (4; 0% instances), fi-dep/xcomp (4; 0% instances), fi-dep/xcomp:ds (4; 0% instances), fi-dep/nummod (3; 0% instances), fi-dep/obj (3; 0% instances), fi-dep/cc:preconj (2; 0% instances), fi-dep/compound:nn (1; 0% instances), fi-dep/nmod:gsubj (1; 0% instances)
Children of PRON
nodes belong to 14 different parts of speech: NOUN (747; 19% instances), PUNCT (614; 15% instances), AUX (607; 15% instances), VERB (541; 13% instances), ADV (464; 12% instances), ADP (395; 10% instances), PRON (274; 7% instances), CCONJ (146; 4% instances), SCONJ (100; 2% instances), ADJ (78; 2% instances), PROPN (46; 1% instances), INTJ (11; 0% instances), NUM (5; 0% instances), SYM (4; 0% instances)
Treebank Statistics (UD_Finnish-FTB)
There are 43 PRON
lemmas (0%), 562 PRON
types (1%) and 9529 PRON
tokens (7%).
Out of 17 observed tags, the rank of PRON
is: 12 in number of lemmas, 7 in number of types and 4 in number of tokens.
The 10 most frequent PRON
lemmas: se, hän, minä, joka, sinä, mikä, me, ne, tämä, he
The 10 most frequent PRON
types: se, hän, minä, sen, sitä, mitä, joka, mä, ne, sinä
The 10 most frequent ambiguous lemmas: se (PRON 2069, DET 488, NOUN 2), hän (PRON 1305, DET 125), minä (PRON 1258, DET 75, NOUN 2), joka (PRON 663, DET 99), sinä (PRON 541, DET 40), mikä (PRON 520, DET 136), me (PRON 443, DET 66, X 1), ne (PRON 350, DET 134), tämä (DET 441, PRON 334), he (PRON 292, DET 41)
The 10 most frequent ambiguous types: se (PRON 794, DET 136), minä (PRON 263, DET 3), sen (PRON 244, DET 138, PART 3), sitä (PRON 241, DET 61, PART 25), mitä (PRON 198, DET 16, PART 14, ADV 6), joka (PRON 244, DET 55), ne (PRON 143, DET 37), sinä (PRON 147, DET 11), siitä (PRON 154, DET 23, ADV 6), me (PRON 114, X 1)
- se
- minä
- sen
- sitä
- mitä
- joka
- ne
- sinä
- siitä
- me
Morphology
The form / lemma ratio of PRON
is 13.069767 (the average of all parts of speech is 2.026917).
The 1st highest number of forms (49) was observed with the lemma “minä”: Minähän, Minäkään, Minäkö, Minäpä, Mullahan, Mäkin, m, m-, ma, mi, mie, miekii, minua, minulla, minullakin, minulle, minullekin, minulta, minultakin, minun, minunhan, minunkin, minussa, minussakin, minusta, minut, minuun, minä, minäki, minäkin, minäkä, minäkös, miul, miäpäs, mua, mul, mull, mulla, mulle, mullekin, multa, mum, mun, must, musta, mut, mä, mää, määki.
The 2nd highest number of forms (47) was observed with the lemma “se”: Sepä, Sepäs, Siihenkö, Siitäkin, Siitäpä, Sitähän, Sitäkö, s, se, se-, sehän, sej, sekin, sekään, sekö, sen, senhän, senkään, si, siihe, siihem, siihen, siihenkin, siin, siint, siinä, siinähä, siinäkin, siinäpä, siit, siitä, siitähän, siitäkään, siksi, sil, sille, sillä, silläkin, silt, siltä, sit, sitä, sitäkin, sitäkään, sitäkäänkö, so, s´.
The 3rd highest number of forms (39) was observed with the lemma “itse”: Iteki, ihtiisäj, ite, itse, itseemme, itseen, itseensä, itsekin, itselle, itselleen, itselleenkin, itsellekin, itselleni, itsellensä, itselläsikin, itsellään, itselläänkin, itseltään, itseni, itsenne, itsensä, itsensäkin, itsesi, itsestä, itsestämme, itsestänikään, itsestäsi, itsestään, itseä, itseämme, itseäni, itseäsi, itseään, itte, itteeni, ittees, ittelläs, ittellää, ittenä.
PRON
occurs with 10 features: fi-feat/Case (9527; 100% instances), fi-feat/PronType (9261; 97% instances), fi-feat/Number (8879; 93% instances), fi-feat/Person (3996; 42% instances), fi-feat/Style (741; 8% instances), fi-feat/Clitic (277; 3% instances), fi-feat/Reflex (268; 3% instances), fi-feat/Person[psor] (230; 2% instances), fi-feat/Number[psor] (48; 1% instances), fi-feat/Degree (14; 0% instances)
PRON
occurs with 43 feature-value pairs: Case=Abl
, Case=Acc
, Case=Ade
, Case=All
, Case=Ela
, Case=Ess
, Case=Gen
, Case=Ill
, Case=Ine
, Case=Ins
, Case=Nom
, Case=Par
, Case=Tra
, Clitic=Han
, Clitic=Han,Ko
, Clitic=Kaan
, Clitic=Kaan,Ko
, Clitic=Kin
, Clitic=Ko
, Clitic=Ko,S
, Clitic=Pa
, Clitic=Pa,S
, Clitic=S
, Degree=Cmp
, Degree=Sup
, Number=Plur
, Number=Sing
, Number[psor]=Plur
, Number[psor]=Sing
, Person=1
, Person=2
, Person=3
, Person[psor]=1
, Person[psor]=2
, Person[psor]=3
, PronType=Dem
, PronType=Ind
, PronType=Int
, PronType=Prs
, PronType=Rcp
, PronType=Rel
, Reflex=Yes
, Style=Coll
PRON
occurs with 372 feature combinations.
The most frequent feature combination is Case=Nom|Number=Sing|PronType=Dem
(1123 tokens).
Examples: se, tämä, tuo, se-
Relations
PRON
nodes are attached to their parents using 21 different relations: fi-dep/nsubj (3839; 40% instances), fi-dep/nmod (1888; 20% instances), fi-dep/obj (1548; 16% instances), fi-dep/nsubj:cop (815; 9% instances), fi-dep/nmod:own (403; 4% instances), fi-dep/expl (399; 4% instances), fi-dep/conj (173; 2% instances), fi-dep/root (147; 2% instances), fi-dep/advmod (103; 1% instances), fi-dep/advcl (58; 1% instances), fi-dep/fixed (33; 0% instances), fi-dep/nmod:gobj (25; 0% instances), fi-dep/ccomp (23; 0% instances), fi-dep/vocative (23; 0% instances), fi-dep/mark (19; 0% instances), fi-dep/dep (12; 0% instances), fi-dep/acl (8; 0% instances), fi-dep/nmod:gsubj (7; 0% instances), fi-dep/det (4; 0% instances), fi-dep/amod (1; 0% instances), fi-dep/xcomp (1; 0% instances)
Parents of PRON
nodes belong to 13 different parts of speech: VERB (7684; 81% instances), NOUN (762; 8% instances), ADJ (539; 6% instances), PRON (259; 3% instances), ROOT (147; 2% instances), PROPN (58; 1% instances), ADV (29; 0% instances), DET (16; 0% instances), NUM (15; 0% instances), ADP (8; 0% instances), X (5; 0% instances), PART (4; 0% instances), INTJ (3; 0% instances)
7522 (79%) PRON
nodes are leaves.
1541 (16%) PRON
nodes have one child.
271 (3%) PRON
nodes have two children.
195 (2%) PRON
nodes have three or more children.
The highest child degree of a PRON
node is 9.
Children of PRON
nodes are attached using 24 different relations: fi-dep/punct (990; 35% instances), fi-dep/case (353; 13% instances), fi-dep/nmod (258; 9% instances), fi-dep/advmod (246; 9% instances), fi-dep/cop (151; 5% instances), fi-dep/conj (123; 4% instances), fi-dep/nsubj:cop (114; 4% instances), fi-dep/cc (109; 4% instances), fi-dep/fixed (83; 3% instances), fi-dep/det (82; 3% instances), fi-dep/mark (77; 3% instances), fi-dep/acl (76; 3% instances), fi-dep/advcl (45; 2% instances), fi-dep/aux (32; 1% instances), fi-dep/amod (26; 1% instances), fi-dep/discourse (12; 0% instances), fi-dep/nummod (10; 0% instances), fi-dep/expl (9; 0% instances), fi-dep/dep (8; 0% instances), fi-dep/csubj:cop (5; 0% instances), fi-dep/vocative (4; 0% instances), fi-dep/nsubj (2; 0% instances), fi-dep/reparandum (2; 0% instances), fi-dep/xcomp (2; 0% instances)
Children of PRON
nodes belong to 16 different parts of speech: PUNCT (990; 35% instances), ADP (357; 13% instances), PRON (259; 9% instances), NOUN (252; 9% instances), PART (179; 6% instances), AUX (151; 5% instances), VERB (140; 5% instances), CCONJ (110; 4% instances), ADV (104; 4% instances), DET (86; 3% instances), SCONJ (79; 3% instances), PROPN (46; 2% instances), ADJ (39; 1% instances), NUM (13; 0% instances), INTJ (12; 0% instances), X (2; 0% instances)
PRON in other languages: [am] [ar] [bg] [bxr] [ca] [ckb] [cop] [cs] [cu] [da] [de] [el] [en] [es] [et] [eu] [fa] [fi] [fo] [fr] [ga] [gl] [got] [grc] [he] [hi] [hr] [hu] [id] [it] [ja] [kk] [kmr] [ko] [la] [lv] [mr] [nl] [no] [pl] [pt] [ro] [ru] [sa] [sk] [sla] [sl] [so] [sr] [sv] [swl] [ta] [tr] [ug] [uk] [u] [urj] [ur] [vi] [yue] [zh]