home ru/pos edit page issue tracker

This page still pertains to UD version 1.

PRON: pronoun

Definition

Pronouns are words that substitute for nouns or noun phrases, whose meaning is recoverable from the linguistic or extralinguistic context.

Pronouns under this definition function like nouns. Note that Russian grammar traditionally extends the term pronoun to words that substitute for adjectives. Such words are not tagged PRON under our universal scheme. They are tagged as determiners in order to annotate the same thing same way across languages.

For instance, ‘это  “this” is traditionally called pronoun in Russian grammar, regardless of context (the notion of determiners does not exist in Russian grammar). To make the annotation parallel across languages, it should be now tagged PRON in Я видел это вчера.  “I saw this yesterday.” and DET in Я видел эту машину вчера.  “I saw this car yesterday.”

Examples


Treebank Statistics (UD_Russian)

There are 27 PRON lemmas (0%), 90 PRON types (0%) and 1697 PRON tokens (2%). Out of 16 observed tags, the rank of PRON is: 10 in number of lemmas, 10 in number of types and 10 in number of tokens.

The 10 most frequent PRON lemmas: ОН, КОТОРЫЙ, ТО, ОНИ, ОНА, ЭТО, СЕБЯ, ЧТО, МЫ, Я

The 10 most frequent PRON types: он, который, они, она, которые, это, его, того, что, которой

The 10 most frequent ambiguous lemmas: ТО (PRON 167, ADV 22, CCONJ 7, ADP 2, SCONJ 2), ЭТО (PRON 128, AUX 25, PART 1), ЧТО (SCONJ 227, PRON 66, DET 12, ADP 10), Я (PRON 26, NOUN 1), ВСЁ (PRON 19, ADV 10, PART 3), Т. (PRON 11, ADV 7, SCONJ 2), I (ADJ 19, NOUN 3, PRON 1), ME (PRON 1, NOUN 1)

The 10 most frequent ambiguous types: это (PRON 46, AUX 22, DET 22, PART 1), его (DET 164, PRON 59), того (PRON 54, DET 15), что (SCONJ 227, PRON 45, DET 11, ADP 10), тем (PRON 33, DET 6, ADV 1), им (PRON 31, ADJ 1), том (PRON 34, DET 34, NOUN 2), то (DET 32, PRON 28, ADV 22, CCONJ 7, ADP 2, SCONJ 2), их (DET 62, PRON 27), этом (PRON 27, DET 18)

Morphology

The form / lemma ratio of PRON is 3.333333 (the average of all parts of speech is 1.576680).

The 1st highest number of forms (12) was observed with the lemma “КОТОРЫЙ”: которая, которого, которое, которой, котором, которому, которую, которые, который, которым, которыми, которых.

The 2nd highest number of forms (9) was observed with the lemma “ОН”: его, ему, им, него, нем, нему, ним, нём, он.

The 3rd highest number of forms (8) was observed with the lemma “ОНА”: ее, ей, ею, её, нее, ней, неё, она.

PRON occurs with 6 features: ru-feat/Case (1697; 100% instances), ru-feat/Number (1621; 96% instances), ru-feat/Gender (1249; 74% instances), ru-feat/Animacy (818; 48% instances), ru-feat/Person (803; 47% instances), ru-feat/Reflex (76; 4% instances)

PRON occurs with 18 feature-value pairs: Animacy=Anim, Animacy=Inan, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Case=Voc, Gender=Fem, Gender=Masc, Gender=Neut, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Reflex=Yes

PRON occurs with 83 feature combinations. The most frequent feature combination is Case=Nom|Gender=Masc|Number=Sing|Person=3 (237 tokens). Examples: он, He

Relations

PRON nodes are attached to their parents using 18 different relations: ru-dep/nsubj (668; 39% instances), ru-dep/obl (372; 22% instances), ru-dep/nmod (210; 12% instances), ru-dep/obj (190; 11% instances), ru-dep/iobj (156; 9% instances), ru-dep/nsubj:pass (37; 2% instances), ru-dep/advmod (19; 1% instances), ru-dep/mark (11; 1% instances), ru-dep/case (7; 0% instances), ru-dep/det (7; 0% instances), ru-dep/fixed (6; 0% instances), ru-dep/goeswith (4; 0% instances), ru-dep/obl:agent (4; 0% instances), ru-dep/vocative (2; 0% instances), ru-dep/appos (1; 0% instances), ru-dep/conj (1; 0% instances), ru-dep/discourse (1; 0% instances), ru-dep/root (1; 0% instances)

Parents of PRON nodes belong to 11 different parts of speech: VERB (1328; 78% instances), NOUN (206; 12% instances), ADJ (82; 5% instances), ADV (27; 2% instances), ADP (23; 1% instances), NUM (11; 1% instances), PROPN (6; 0% instances), DET (5; 0% instances), PUNCT (5; 0% instances), SYM (3; 0% instances), ROOT (1; 0% instances)

1125 (66%) PRON nodes are leaves.

469 (28%) PRON nodes have one child.

87 (5%) PRON nodes have two children.

16 (1%) PRON nodes have three or more children.

The highest child degree of a PRON node is 11.

Children of PRON nodes are attached using 21 different relations: ru-dep/case (486; 69% instances), ru-dep/punct (48; 7% instances), ru-dep/fixed (43; 6% instances), ru-dep/acl:relcl (31; 4% instances), ru-dep/discourse (16; 2% instances), ru-dep/amod (13; 2% instances), ru-dep/ccomp (12; 2% instances), ru-dep/det (11; 2% instances), ru-dep/advmod (7; 1% instances), ru-dep/goeswith (6; 1% instances), ru-dep/advcl (5; 1% instances), ru-dep/appos (5; 1% instances), ru-dep/conj (5; 1% instances), ru-dep/nmod (4; 1% instances), ru-dep/acl (3; 0% instances), ru-dep/cc:preconj (1; 0% instances), ru-dep/nsubj (1; 0% instances), ru-dep/nummod (1; 0% instances), ru-dep/nummod:gov (1; 0% instances), ru-dep/obj (1; 0% instances), ru-dep/parataxis (1; 0% instances)

Children of PRON nodes belong to 11 different parts of speech: ADP (485; 69% instances), VERB (52; 7% instances), PUNCT (51; 7% instances), PART (35; 5% instances), ADV (24; 3% instances), NOUN (22; 3% instances), ADJ (15; 2% instances), DET (12; 2% instances), NUM (2; 0% instances), PROPN (2; 0% instances), CCONJ (1; 0% instances)


Treebank Statistics (UD_Russian-SynTagRus)

There are 23 PRON lemmas (0%), 111 PRON types (0%) and 45581 PRON tokens (5%). Out of 18 observed tags, the rank of PRON is: 12 in number of lemmas, 9 in number of types and 7 in number of tokens.

The 10 most frequent PRON lemmas: он, они, это, который, то, я, она, мы, что, все

The 10 most frequent PRON types: он, это, его, я, их, мы, что, они, ее, которые

The 10 most frequent ambiguous lemmas: это (PRON 4795, PART 635), то (PRON 3741, SCONJ 1055, PART 209), что (SCONJ 7148, PRON 2498, PART 1), все (PRON 1847, PART 505), вы (PRON 944, X 1), нечего (PRON 21, NOUN 9, ADV 8), некого (PRON 4, NOUN 2)

The 10 most frequent ambiguous types: это (PRON 2421, PART 616, DET 364), их (PRON 1861, NOUN 1), что (SCONJ 7120, PRON 1457, NOUN 1), то (SCONJ 1042, PRON 862, DET 262, PART 209), все (DET 1036, PRON 886, PART 466), того (PRON 868, DET 154), том (PRON 818, DET 366, NOUN 4), этом (PRON 718, DET 454, PART 1), тем (PRON 521, DET 147, SCONJ 80, NOUN 15), вы (PRON 428, X 1)

Morphology

The form / lemma ratio of PRON is 4.826087 (the average of all parts of speech is 2.644632).

The 1st highest number of forms (12) was observed with the lemma “который”: которая, которого, которое, которой, котором, которому, которую, которые, который, которым, которыми, которых.

The 2nd highest number of forms (11) was observed with the lemma “то”: т, т., т.е, т.е., т.п, т.п., тем, то, того, том, тому.

The 3rd highest number of forms (9) was observed with the lemma “он”: его, ему, им, него, нем, нему, ним, нём, он.

PRON occurs with 5 features: ru-feat/Case (45573; 100% instances), ru-feat/Number (35173; 77% instances), ru-feat/Person (24759; 54% instances), ru-feat/Gender (21515; 47% instances), ru-feat/Animacy (10414; 23% instances)

PRON occurs with 16 feature-value pairs: Animacy=Anim, Animacy=Inan, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Gender=Fem, Gender=Masc, Gender=Neut, Number=Plur, Number=Sing, Person=1, Person=2, Person=3

PRON occurs with 77 feature combinations. The most frequent feature combination is Case=Nom (4312 tokens). Examples: он, это, что, я, которые, мы, они, кто, она, который

Relations

PRON nodes are attached to their parents using 26 different relations: ru-dep/nsubj (18932; 42% instances), ru-dep/obl (9321; 20% instances), ru-dep/nmod (7686; 17% instances), ru-dep/obj (5024; 11% instances), ru-dep/root (857; 2% instances), ru-dep/nsubj:pass (669; 1% instances), ru-dep/conj (479; 1% instances), ru-dep/mark (458; 1% instances), ru-dep/parataxis (366; 1% instances), ru-dep/iobj (348; 1% instances), ru-dep/fixed (271; 1% instances), ru-dep/advmod (222; 0% instances), ru-dep/dep (206; 0% instances), ru-dep/obl:agent (203; 0% instances), ru-dep/advcl (130; 0% instances), ru-dep/discourse (126; 0% instances), ru-dep/orphan (111; 0% instances), ru-dep/acl:relcl (61; 0% instances), ru-dep/expl (34; 0% instances), ru-dep/amod (28; 0% instances), ru-dep/acl (13; 0% instances), ru-dep/appos (12; 0% instances), ru-dep/flat:name (11; 0% instances), ru-dep/ccomp (10; 0% instances), ru-dep/xcomp (2; 0% instances), ru-dep/cc (1; 0% instances)

Parents of PRON nodes belong to 18 different parts of speech: VERB (30839; 68% instances), NOUN (8147; 18% instances), ADJ (3237; 7% instances), ADV (1134; 2% instances), ROOT (857; 2% instances), PRON (504; 1% instances), ADP (237; 1% instances), PROPN (167; 0% instances), NUM (151; 0% instances), DET (116; 0% instances), PART (114; 0% instances), _ (25; 0% instances), PUNCT (23; 0% instances), CCONJ (16; 0% instances), AUX (5; 0% instances), SCONJ (5; 0% instances), INTJ (2; 0% instances), X (2; 0% instances)

28770 (63%) PRON nodes are leaves.

10638 (23%) PRON nodes have one child.

2897 (6%) PRON nodes have two children.

3276 (7%) PRON nodes have three or more children.

The highest child degree of a PRON node is 11.

Children of PRON nodes are attached using 31 different relations: ru-dep/case (9484; 33% instances), ru-dep/punct (7260; 25% instances), ru-dep/advmod (2226; 8% instances), ru-dep/advcl (1671; 6% instances), ru-dep/mark (1511; 5% instances), ru-dep/nsubj (1116; 4% instances), ru-dep/amod (972; 3% instances), ru-dep/fixed (921; 3% instances), ru-dep/cc (891; 3% instances), ru-dep/conj (555; 2% instances), ru-dep/acl (522; 2% instances), ru-dep/cop (476; 2% instances), ru-dep/parataxis (411; 1% instances), ru-dep/nmod (321; 1% instances), ru-dep/acl:relcl (252; 1% instances), ru-dep/nummod:gov (176; 1% instances), ru-dep/appos (144; 0% instances), ru-dep/obl (66; 0% instances), ru-dep/_ (34; 0% instances), ru-dep/orphan (27; 0% instances), ru-dep/aux (14; 0% instances), ru-dep/obj (7; 0% instances), ru-dep/nummod (6; 0% instances), ru-dep/root (6; 0% instances), ru-dep/flat:name (5; 0% instances), ru-dep/iobj (5; 0% instances), ru-dep/discourse (4; 0% instances), ru-dep/dep (2; 0% instances), ru-dep/xcomp (2; 0% instances), ru-dep/ccomp (1; 0% instances), ru-dep/nsubj:pass (1; 0% instances)

Children of PRON nodes belong to 16 different parts of speech: ADP (9450; 32% instances), PUNCT (7260; 25% instances), VERB (3128; 11% instances), PART (1909; 7% instances), SCONJ (1885; 6% instances), NOUN (1679; 6% instances), ADJ (1056; 4% instances), ADV (908; 3% instances), CCONJ (535; 2% instances), DET (492; 2% instances), PRON (431; 1% instances), PROPN (135; 0% instances), AUX (112; 0% instances), NUM (72; 0% instances), _ (34; 0% instances), INTJ (3; 0% instances)


PRON in other languages: [am] [ar] [bg] [bxr] [ca] [ckb] [cop] [cs] [cu] [da] [de] [el] [en] [es] [et] [eu] [fa] [fi] [fo] [fr] [ga] [gl] [got] [grc] [he] [hi] [hr] [hu] [id] [it] [ja] [kk] [kmr] [ko] [la] [lv] [mr] [nl] [no] [pl] [pt] [ro] [ru] [sa] [sk] [sla] [sl] [so] [sr] [sv] [swl] [ta] [tr] [ug] [uk] [u] [urj] [ur] [vi] [yue] [zh]