Statistics of PRON in UD

home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Czech-CLTT: POS Tags: `PRON`

There are 4 PRON lemmas (0%), 37 PRON types (1%) and 631 PRON tokens (2%). Out of 15 observed tags, the rank of PRON is: 13 in number of lemmas, 9 in number of types and 10 in number of tokens.

The 10 most frequent PRON lemmas: se, jenž, on, veškerý

The 10 most frequent PRON types: se, nichž, němž, jej, němuž, je, jim, jí, jimiž, veškeré

The 10 most frequent ambiguous lemmas: jenž (PRON 73, DET 21)

The 10 most frequent ambiguous types: se (PRON 467, ADP 34), je (AUX 189, PRON 11, VERB 10), jehož (DET 6, PRON 5)

se
- PRON 467: Výroční zpráva se nevyhotovuje v případech uvedených v §_20_odst._2 .
- ADP 34: Do nákladů nebo výnosů jsou zaúčtovány ve stejných obdobích , kdy jsou zaúčtovány náklady nebo výnosy spojené se zajišťovanými položkami .
je
- AUX 189: Spotřeba povolenek je vykázána bez ohledu na jejich následné vyřazení .
- PRON 11: V případě , že je nelze přiřadit , uvedou se v provozní činnosti .
- VERB 10: Hospodářským rokem je účetní období , které může začínat pouze prvním dnem jiného měsíce , než je leden .
jehož
- DET 6: (6) Účetní jednotky uvedené v odstavci 1 jsou povinny sestavovat odpisový plán , na jehož podkladě provádějí odpisování majetku v průběhu jeho používání .
- PRON 5: (4) Podpisovým záznamem se rozumí účetní záznam , jehož obsahem je vlastnoruční podpis nebo uznávaný elektronický podpis podle zvláštního právního předpisu , anebo obdobný průkazný účetní záznam v technické formě , který zaručuje průkaznou a jednoznačnou původnost .

Morphology

The form / lemma ratio of PRON is 9.250000 (the average of all parts of speech is 1.766716).

The 1st highest number of forms (15) was observed with the lemma “on”: ho, je, jej, jemu, ji, jim, jimi, jí, nich, nim, nimi, ní, ním, ně, něj.

The 2nd highest number of forms (14) was observed with the lemma “jenž”: jehož, jenž, jež, jimiž, jímž, nichž, nimž, niž, nímž, níž, nějž, němuž, němž, něž.

The 3rd highest number of forms (4) was observed with the lemma “se”: se, sebou, si, sobě.

PRON occurs with 10 features: Case (631; 100% instances), PronType (631; 100% instances), Reflex (475; 75% instances), Variant (469; 74% instances), Number (156; 25% instances), PrepCase (93; 15% instances), Gender (90; 14% instances), Person (71; 11% instances), Style (12; 2% instances), Animacy (2; 0% instances)

PRON occurs with 22 feature-value pairs: Animacy=Inan, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Gender=Fem, Gender=Masc, Gender=Masc,Neut, Gender=Neut, Number=Plur, Number=Sing, Person=3, PrepCase=Npr, PrepCase=Pre, PronType=Prs, PronType=Rel, PronType=Tot, Reflex=Yes, Style=Arch, Variant=Short

PRON occurs with 49 feature combinations. The most frequent feature combination is Case=Acc|PronType=Prs|Reflex=Yes|Variant=Short (467 tokens). Examples: se

Relations

PRON nodes are attached to their parents using 11 different relations: expl:pass (353; 56% instances), expl:pv (113; 18% instances), obl (68; 11% instances), obj (47; 7% instances), nmod (24; 4% instances), obl:arg (9; 1% instances), nsubj (6; 1% instances), acl (4; 1% instances), iobj (4; 1% instances), conj (2; 0% instances), obl:agent (1; 0% instances)

Parents of PRON nodes belong to 6 different parts of speech: VERB (524; 83% instances), ADJ (60; 10% instances), NOUN (36; 6% instances), X (9; 1% instances), ADV (1; 0% instances), DET (1; 0% instances)

543 (86%) PRON nodes are leaves.

80 (13%) PRON nodes have one child.

4 (1%) PRON nodes have two children.

4 (1%) PRON nodes have three or more children.

The highest child degree of a PRON node is 3.

Children of PRON nodes are attached using 7 different relations: case (83; 83% instances), cop (4; 4% instances), nsubj (4; 4% instances), punct (4; 4% instances), cc (2; 2% instances), xcomp (2; 2% instances), advmod (1; 1% instances)

Children of PRON nodes belong to 7 different parts of speech: ADP (83; 83% instances), NOUN (5; 5% instances), AUX (4; 4% instances), PUNCT (4; 4% instances), CCONJ (2; 2% instances), ADJ (1; 1% instances), ADV (1; 1% instances)

Treebank Statistics: UD_Czech-CLTT: POS Tags: PRON

Morphology

Relations

Treebank Statistics: UD_Czech-CLTT: POS Tags: `PRON`