Statistics of PRON in UD

home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Czech: POS Tags: `PRON`

There are 56 PRON lemmas (0%), 199 PRON types (0%) and 44925 PRON tokens (3%). Out of 17 observed tags, the rank of PRON is: 10 in number of lemmas, 8 in number of types and 10 in number of tokens.

The 10 most frequent PRON lemmas: se, on, já, jenž, co, kdo, což, nic, něco, ty

The 10 most frequent PRON types: se, si, co, nás, je, nám, nich, kdo, což, mu

The 10 most frequent ambiguous lemmas: on (PRON 7262, ADP 9, PART 1), já (PRON 3373, NOUN 1), jenž (PRON 2201, DET 658), co (PRON 1859, ADV 239, SCONJ 210, PART 21), což (PRON 748, INTJ 3, PART 1), I (NUM 97, PROPN 62, ADJ 17, PRON 16), all (ADV 1, PRON 1, ADJ 1), sa (PRON 4, PROPN 2), ja (PRON 3, PART 1), von (ADP 24, PRON 3)

The 10 most frequent ambiguous types: se (PRON 21370, ADP 1901), si (PRON 3737, AUX 1, VERB 1), co (PRON 1187, ADV 233, SCONJ 207, PART 7), je (AUX 10151, VERB 1986, PRON 887), nám (PRON 740, NOUN 5), což (PRON 631, INTJ 2), jež (PRON 318, PROPN 1), níž (PRON 303, ADV 2), já (PRON 187, NOUN 1), jí (PRON 271, VERB 5)

se
- PRON 21370: Z kukly se vyklubal motýl
- ADP 1901: Mohou zde porovnat svůj vývoj , záměry se světovými trendy .
si
- PRON 3737: Firma , která si je vyžádá , platí pouze náklady na jejich pobyt .
- AUX 1: Bollettieri tenkrát ke mně přišel a povídá : , Porazil si jednoho z nejlepších tenistů budoucnosti . ‘ .
- VERB 1: Nebylo by žádným uměním dospět ke sporu , když bychom si během téže úvahy vykládali chování světelných paprsků různými způsoby .
co
- PRON 1187: Nevíte , co kam započítat ?
- ADV 233: Samozřejmě nejen co do efektu , ale i co do nákladů .
- SCONJ 207: Učinil tak 24 hodin poté , co nabídku zprvu odmítl .
- PART 7: A co když by Maastricht v neděli neprošel ?
je
- AUX 10151: Váš obecně platný dotaz je připraven zodpovědět spolupracovník Profitu .
- VERB 1986: U každého výrobku je krátká charakteristika a kontaktní adresa výrobce .
- PRON 887: Změny jsou citelné , je třeba je lépe prezentovat
nám
- PRON 740: Zdůraznil , že banka bude půjčovat ne firmě , ale nám osobně .
- NOUN 5: Kontakt : CMC , nám . 5 . května 2 , 250 88 Čelákovice , tel . : ( 0202 ) 92151 - 9 , 92237 , FAX : ( 0202 ) 91997 .
což
- PRON 631: Vědí , že by byli považováni za arogantní , což by mohla být pravda .
- INTJ 2: Z naivních přání , která jsme v listopadu měli , se nám toto vyplnilo jen což .
jež
- PRON 318: To je prvé konstatování , jež mi v Netopýru scházelo .
- PROPN 1: ( jež ) ( ČT 2 - 20.10 )
níž
- PRON 303: A podmínka , bez níž to nejde ?
- ADV 2: V roce 1981 pro ni poslední místo ve skupině C v Pekingu ještě nemohlo znamenat sestup níž , poněvadž nebylo kam .
já
- PRON 187: My oba , ty i já , my všichni . “
- NOUN 1: Jenže falešné my a hrozba všemocného oni dokázaly některá já pěkně poznamenat a neochota nebo neschopnost vyprostit se z jejich objetí se občas připisuje jakémusi záhadnému ono : ono je to holt těžký . . .
jí
- PRON 271: Asi patnáctkrát jsem jí třísknul o zem , ale sotva jsem ji poškrábal .
- VERB 5: Převládá však názor , že jde o “ nějaký svátek , kdy se hodně jí a pije “ , či o jakýsi den slávy a posvícení , kdy “ vlaje naše československá vlajka “ .

Morphology

The form / lemma ratio of PRON is 3.553571 (the average of all parts of speech is 2.181792).

The 1st highest number of forms (28) was observed with the lemma “on”: ho, je, jeho, jej, jemu, ji, jich, jim, jimi, jí, jím, mu, ni, nich, nim, nimi, ní, ním, ně, něho, něj, něm, němu, on, ona, oni, ono, ony.

The 2nd highest number of forms (22) was observed with the lemma “jenž”: jehož, jejž, jemuž, jenž, jež, jichž, jimiž, jimž, již, jímž, jíž, nichž, nimiž, nimž, niž, nímž, níž, něhož, nějž, němuž, němž, něž.

The 3rd highest number of forms (11) was observed with the lemma “samý”: samou, samá, samé, samého, samém, samému, samí, samý, samých, samým, samými.

PRON occurs with 12 features: PronType (44925; 100% instances), Case (44862; 100% instances), Variant (27181; 61% instances), Reflex (25786; 57% instances), Number (13609; 30% instances), Person (11130; 25% instances), Gender (7932; 18% instances), PrepCase (4925; 11% instances), Animacy (3633; 8% instances), Style (312; 1% instances), Foreign (62; 0% instances), NameType (12; 0% instances)

PRON occurs with 35 feature-value pairs: Animacy=Anim, Animacy=Inan, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Case=Voc, Foreign=Yes, Gender=Fem, Gender=Masc, Gender=Masc,Neut, Gender=Neut, NameType=Com, NameType=Oth, NameType=Pro, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, PrepCase=Npr, PrepCase=Pre, PronType=Ind, PronType=Int,Rel, PronType=Neg, PronType=Prs, PronType=Rel, PronType=Tot, Reflex=Yes, Style=Arch, Style=Coll, Style=Vrnc, Variant=Short

PRON occurs with 215 feature combinations. The most frequent feature combination is Case=Acc|PronType=Prs|Reflex=Yes|Variant=Short (21416 tokens). Examples: se

Relations

PRON nodes are attached to their parents using 27 different relations: expl:pv (17180; 38% instances), obj (7390; 16% instances), obl (5587; 12% instances), expl:pass (4906; 11% instances), nsubj (3706; 8% instances), iobj (2021; 4% instances), nmod (1584; 4% instances), obl:arg (1263; 3% instances), conj (275; 1% instances), root (194; 0% instances), nsubj:pass (191; 0% instances), discourse (157; 0% instances), dep (144; 0% instances), advcl (52; 0% instances), orphan (47; 0% instances), acl (44; 0% instances), xcomp (43; 0% instances), flat:foreign (38; 0% instances), ccomp (36; 0% instances), appos (33; 0% instances), obl:agent (20; 0% instances), parataxis (7; 0% instances), cc (2; 0% instances), det (2; 0% instances), csubj (1; 0% instances), csubj:pass (1; 0% instances), fixed (1; 0% instances)

Parents of PRON nodes belong to 14 different parts of speech: VERB (39920; 89% instances), ADJ (2085; 5% instances), NOUN (1794; 4% instances), ADV (300; 1% instances), DET (228; 1% instances), (194; 0% instances), NUM (183; 0% instances), PRON (142; 0% instances), PROPN (52; 0% instances), SYM (11; 0% instances), PART (10; 0% instances), CCONJ (3; 0% instances), INTJ (2; 0% instances), AUX (1; 0% instances)

36837 (82%) PRON nodes are leaves.

6999 (16%) PRON nodes have one child.

607 (1%) PRON nodes have two children.

482 (1%) PRON nodes have three or more children.

The highest child degree of a PRON node is 10.

Children of PRON nodes are attached using 30 different relations: case (6197; 61% instances), punct (604; 6% instances), amod (505; 5% instances), advmod:emph (396; 4% instances), conj (336; 3% instances), xcomp (272; 3% instances), cc (265; 3% instances), cop (208; 2% instances), nmod (191; 2% instances), nsubj (166; 2% instances), acl (135; 1% instances), orphan (129; 1% instances), appos (117; 1% instances), nummod:gov (105; 1% instances), mark (98; 1% instances), advmod (90; 1% instances), dep (73; 1% instances), det (67; 1% instances), flat:foreign (35; 0% instances), obl (34; 0% instances), advcl (28; 0% instances), det:numgov (18; 0% instances), discourse (18; 0% instances), csubj (13; 0% instances), nummod (12; 0% instances), aux (9; 0% instances), parataxis (2; 0% instances), ccomp (1; 0% instances), obj (1; 0% instances), obl:arg (1; 0% instances)

Children of PRON nodes belong to 15 different parts of speech: ADP (6182; 61% instances), NOUN (694; 7% instances), PUNCT (604; 6% instances), ADJ (570; 6% instances), CCONJ (420; 4% instances), ADV (357; 4% instances), DET (292; 3% instances), VERB (235; 2% instances), AUX (217; 2% instances), PRON (142; 1% instances), NUM (125; 1% instances), PROPN (100; 1% instances), SCONJ (97; 1% instances), PART (90; 1% instances), SYM (1; 0% instances)

Treebank Statistics: UD_Czech: POS Tags: PRON

Morphology

Relations

Treebank Statistics: UD_Czech: POS Tags: `PRON`