home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Czech-CLTT: POS Tags: PRON

There are 4 PRON lemmas (0%), 37 PRON types (1%) and 631 PRON tokens (2%). Out of 15 observed tags, the rank of PRON is: 13 in number of lemmas, 9 in number of types and 10 in number of tokens.

The 10 most frequent PRON lemmas: se, jenž, on, veškerý

The 10 most frequent PRON types: se, nichž, němž, jej, němuž, je, jim, jí, jimiž, veškeré

The 10 most frequent ambiguous lemmas: jenž (PRON 73, DET 21)

The 10 most frequent ambiguous types: se (PRON 467, ADP 34), je (AUX 189, PRON 11, VERB 10), jehož (DET 6, PRON 5)

Morphology

The form / lemma ratio of PRON is 9.250000 (the average of all parts of speech is 1.766716).

The 1st highest number of forms (15) was observed with the lemma “on”: ho, je, jej, jemu, ji, jim, jimi, jí, nich, nim, nimi, ní, ním, ně, něj.

The 2nd highest number of forms (14) was observed with the lemma “jenž”: jehož, jenž, jež, jimiž, jímž, nichž, nimž, niž, nímž, níž, nějž, němuž, němž, něž.

The 3rd highest number of forms (4) was observed with the lemma “se”: se, sebou, si, sobě.

PRON occurs with 10 features: Case (631; 100% instances), PronType (631; 100% instances), Reflex (475; 75% instances), Variant (469; 74% instances), Number (156; 25% instances), PrepCase (93; 15% instances), Gender (90; 14% instances), Person (71; 11% instances), Style (12; 2% instances), Animacy (2; 0% instances)

PRON occurs with 22 feature-value pairs: Animacy=Inan, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Gender=Fem, Gender=Masc, Gender=Masc,Neut, Gender=Neut, Number=Plur, Number=Sing, Person=3, PrepCase=Npr, PrepCase=Pre, PronType=Prs, PronType=Rel, PronType=Tot, Reflex=Yes, Style=Arch, Variant=Short

PRON occurs with 49 feature combinations. The most frequent feature combination is Case=Acc|PronType=Prs|Reflex=Yes|Variant=Short (467 tokens). Examples: se

Relations

PRON nodes are attached to their parents using 11 different relations: expl:pass (353; 56% instances), expl:pv (113; 18% instances), obl (68; 11% instances), obj (47; 7% instances), nmod (24; 4% instances), obl:arg (9; 1% instances), nsubj (6; 1% instances), acl (4; 1% instances), iobj (4; 1% instances), conj (2; 0% instances), obl:agent (1; 0% instances)

Parents of PRON nodes belong to 6 different parts of speech: VERB (524; 83% instances), ADJ (60; 10% instances), NOUN (36; 6% instances), X (9; 1% instances), ADV (1; 0% instances), DET (1; 0% instances)

543 (86%) PRON nodes are leaves.

80 (13%) PRON nodes have one child.

4 (1%) PRON nodes have two children.

4 (1%) PRON nodes have three or more children.

The highest child degree of a PRON node is 3.

Children of PRON nodes are attached using 7 different relations: case (83; 83% instances), cop (4; 4% instances), nsubj (4; 4% instances), punct (4; 4% instances), cc (2; 2% instances), xcomp (2; 2% instances), advmod (1; 1% instances)

Children of PRON nodes belong to 7 different parts of speech: ADP (83; 83% instances), NOUN (5; 5% instances), AUX (4; 4% instances), PUNCT (4; 4% instances), CCONJ (2; 2% instances), ADJ (1; 1% instances), ADV (1; 1% instances)