PRON
: pronoun
Definition
Pronouns are words that substitute for nouns or noun phrases, whose meaning is recoverable from the linguistic or extralinguistic context.
Pronouns under this definition function like nouns. Note that
Russian grammar traditionally extends the term pronoun to words that
substitute for adjectives. Such words are not tagged PRON
under our universal scheme. They are tagged as determiners in
order to annotate the same thing same way across languages.
For instance, ‘это “this” is traditionally called pronoun in
Russian grammar, regardless of context (the notion of determiners does
not exist in Russian grammar). To make the annotation parallel across
languages, it should be now tagged PRON
in Я видел это вчера. “I saw this yesterday.” and DET
in
Я видел эту машину вчера. “I saw this car yesterday.”
Examples
- personal pronouns: я, ты, он, она, оно, мы, вы, они “I, you, he, she, it, we, you, they”
- reflexive pronouns: себе, сам “oneself”
- demonstrative pronouns: это as in Я видел это вчера. “I saw this yesterday.”
- interrogative pronouns: кто, что “who, what” as in Что ты думаешь? “What do you think?”
- relative pronouns: кто, что “who, what” as in Мне интересно, что ты думаешь. “I wonder what you think.”
- indefinite pronouns: кто-то, что-то “somebody, something”
- total pronouns: каждый, все “everybody, all”
- negative pronouns: никто, ничто “nobody, nothing”
Treebank Statistics (UD_Russian)
There are 27 PRON
lemmas (0%), 90 PRON
types (0%) and 1697 PRON
tokens (2%).
Out of 16 observed tags, the rank of PRON
is: 10 in number of lemmas, 10 in number of types and 10 in number of tokens.
The 10 most frequent PRON
lemmas: ОН, КОТОРЫЙ, ТО, ОНИ, ОНА, ЭТО, СЕБЯ, ЧТО, МЫ, Я
The 10 most frequent PRON
types: он, который, они, она, которые, это, его, того, что, которой
The 10 most frequent ambiguous lemmas: ТО (PRON 167, ADV 22, CCONJ 7, ADP 2, SCONJ 2), ЭТО (PRON 128, AUX 25, PART 1), ЧТО (SCONJ 227, PRON 66, DET 12, ADP 10), Я (PRON 26, NOUN 1), ВСЁ (PRON 19, ADV 10, PART 3), Т. (PRON 11, ADV 7, SCONJ 2), I (ADJ 19, NOUN 3, PRON 1), ME (PRON 1, NOUN 1)
The 10 most frequent ambiguous types: это (PRON 46, AUX 22, DET 22, PART 1), его (DET 164, PRON 59), того (PRON 54, DET 15), что (SCONJ 227, PRON 45, DET 11, ADP 10), тем (PRON 33, DET 6, ADV 1), им (PRON 31, ADJ 1), том (PRON 34, DET 34, NOUN 2), то (DET 32, PRON 28, ADV 22, CCONJ 7, ADP 2, SCONJ 2), их (DET 62, PRON 27), этом (PRON 27, DET 18)
- это
- PRON 46: Для обеих японок это стало олимпийским дебютом .
- AUX 22: Рэгги - метал – это музыкальный жанр , сплав рэгги и метала .
- DET 22: До наших дней не сохранилось ни одного представителя это модели .
- PART 1: Первая вышедшая в открытую продажу баночка крема NIVEA была причудливо оформлена в соответствии с веяниями тогдашней моды – это был стиль Art Nouveau .
- его
- того
- что
- SCONJ 227: Возможно , что причиной аварии стало недомогание немца .
- PRON 45: что соответствует следующему разностному уравнению
- DET 11: Поэтому то , что произошло в следующем году , явилось самой настоящей сенсацией .
- ADP 10: Я за возрождение религиозной веры , потому что это не привнесенное извне , это органичное состояние человека , которое сформировано в течение сотен тысяч лет .
- тем
- им
- том
- то
- DET 32: Абсолют делим ( бхеда ) и неделим ( а - бхеда ) в одно и то же время .
- PRON 28: И ты деи его такъжо в то не въвазывал .
- ADV 22: Если диски не попали в корт соперников , то очко ( или 2 ) засчитывается противнику .
- CCONJ 7: Далее больше атаковали хозяева , но римлян выручали то Марко Баллотта , то его защитники .
- ADP 2: Если налог перелагается , то это значит , что он выступает в роли особого ценообразующего фактора .
- SCONJ 2: Возможно , существует проход , который соединяет Темное Озеро с одной из подземных рек , протекающей в Скуллпорте , и если удастся обнаружить этот проход и открывающий его ключ , то это сильно поможет облегчить торговлю между Подземьем и поверхностью .
- их
- этом
Morphology
The form / lemma ratio of PRON
is 3.333333 (the average of all parts of speech is 1.576680).
The 1st highest number of forms (12) was observed with the lemma “КОТОРЫЙ”: которая, которого, которое, которой, котором, которому, которую, которые, который, которым, которыми, которых.
The 2nd highest number of forms (9) was observed with the lemma “ОН”: его, ему, им, него, нем, нему, ним, нём, он.
The 3rd highest number of forms (8) was observed with the lemma “ОНА”: ее, ей, ею, её, нее, ней, неё, она.
PRON
occurs with 6 features: ru-feat/Case (1697; 100% instances), ru-feat/Number (1621; 96% instances), ru-feat/Gender (1249; 74% instances), ru-feat/Animacy (818; 48% instances), ru-feat/Person (803; 47% instances), ru-feat/Reflex (76; 4% instances)
PRON
occurs with 18 feature-value pairs: Animacy=Anim
, Animacy=Inan
, Case=Acc
, Case=Dat
, Case=Gen
, Case=Ins
, Case=Loc
, Case=Nom
, Case=Voc
, Gender=Fem
, Gender=Masc
, Gender=Neut
, Number=Plur
, Number=Sing
, Person=1
, Person=2
, Person=3
, Reflex=Yes
PRON
occurs with 83 feature combinations.
The most frequent feature combination is Case=Nom|Gender=Masc|Number=Sing|Person=3
(237 tokens).
Examples: он, He
Relations
PRON
nodes are attached to their parents using 18 different relations: ru-dep/nsubj (668; 39% instances), ru-dep/obl (372; 22% instances), ru-dep/nmod (210; 12% instances), ru-dep/obj (190; 11% instances), ru-dep/iobj (156; 9% instances), ru-dep/nsubj:pass (37; 2% instances), ru-dep/advmod (19; 1% instances), ru-dep/mark (11; 1% instances), ru-dep/case (7; 0% instances), ru-dep/det (7; 0% instances), ru-dep/fixed (6; 0% instances), ru-dep/goeswith (4; 0% instances), ru-dep/obl:agent (4; 0% instances), ru-dep/vocative (2; 0% instances), ru-dep/appos (1; 0% instances), ru-dep/conj (1; 0% instances), ru-dep/discourse (1; 0% instances), ru-dep/root (1; 0% instances)
Parents of PRON
nodes belong to 11 different parts of speech: VERB (1328; 78% instances), NOUN (206; 12% instances), ADJ (82; 5% instances), ADV (27; 2% instances), ADP (23; 1% instances), NUM (11; 1% instances), PROPN (6; 0% instances), DET (5; 0% instances), PUNCT (5; 0% instances), SYM (3; 0% instances), ROOT (1; 0% instances)
1125 (66%) PRON
nodes are leaves.
469 (28%) PRON
nodes have one child.
87 (5%) PRON
nodes have two children.
16 (1%) PRON
nodes have three or more children.
The highest child degree of a PRON
node is 11.
Children of PRON
nodes are attached using 21 different relations: ru-dep/case (486; 69% instances), ru-dep/punct (48; 7% instances), ru-dep/fixed (43; 6% instances), ru-dep/acl:relcl (31; 4% instances), ru-dep/discourse (16; 2% instances), ru-dep/amod (13; 2% instances), ru-dep/ccomp (12; 2% instances), ru-dep/det (11; 2% instances), ru-dep/advmod (7; 1% instances), ru-dep/goeswith (6; 1% instances), ru-dep/advcl (5; 1% instances), ru-dep/appos (5; 1% instances), ru-dep/conj (5; 1% instances), ru-dep/nmod (4; 1% instances), ru-dep/acl (3; 0% instances), ru-dep/cc:preconj (1; 0% instances), ru-dep/nsubj (1; 0% instances), ru-dep/nummod (1; 0% instances), ru-dep/nummod:gov (1; 0% instances), ru-dep/obj (1; 0% instances), ru-dep/parataxis (1; 0% instances)
Children of PRON
nodes belong to 11 different parts of speech: ADP (485; 69% instances), VERB (52; 7% instances), PUNCT (51; 7% instances), PART (35; 5% instances), ADV (24; 3% instances), NOUN (22; 3% instances), ADJ (15; 2% instances), DET (12; 2% instances), NUM (2; 0% instances), PROPN (2; 0% instances), CCONJ (1; 0% instances)
Treebank Statistics (UD_Russian-SynTagRus)
There are 23 PRON
lemmas (0%), 111 PRON
types (0%) and 45581 PRON
tokens (5%).
Out of 18 observed tags, the rank of PRON
is: 12 in number of lemmas, 9 in number of types and 7 in number of tokens.
The 10 most frequent PRON
lemmas: он, они, это, который, то, я, она, мы, что, все
The 10 most frequent PRON
types: он, это, его, я, их, мы, что, они, ее, которые
The 10 most frequent ambiguous lemmas: это (PRON 4795, PART 635), то (PRON 3741, SCONJ 1055, PART 209), что (SCONJ 7148, PRON 2498, PART 1), все (PRON 1847, PART 505), вы (PRON 944, X 1), нечего (PRON 21, NOUN 9, ADV 8), некого (PRON 4, NOUN 2)
The 10 most frequent ambiguous types: это (PRON 2421, PART 616, DET 364), их (PRON 1861, NOUN 1), что (SCONJ 7120, PRON 1457, NOUN 1), то (SCONJ 1042, PRON 862, DET 262, PART 209), все (DET 1036, PRON 886, PART 466), того (PRON 868, DET 154), том (PRON 818, DET 366, NOUN 4), этом (PRON 718, DET 454, PART 1), тем (PRON 521, DET 147, SCONJ 80, NOUN 15), вы (PRON 428, X 1)
- это
- их
- что
- то
- все
- того
- том
- этом
- тем
- вы
Morphology
The form / lemma ratio of PRON
is 4.826087 (the average of all parts of speech is 2.644632).
The 1st highest number of forms (12) was observed with the lemma “который”: которая, которого, которое, которой, котором, которому, которую, которые, который, которым, которыми, которых.
The 2nd highest number of forms (11) was observed with the lemma “то”: т, т., т.е, т.е., т.п, т.п., тем, то, того, том, тому.
The 3rd highest number of forms (9) was observed with the lemma “он”: его, ему, им, него, нем, нему, ним, нём, он.
PRON
occurs with 5 features: ru-feat/Case (45573; 100% instances), ru-feat/Number (35173; 77% instances), ru-feat/Person (24759; 54% instances), ru-feat/Gender (21515; 47% instances), ru-feat/Animacy (10414; 23% instances)
PRON
occurs with 16 feature-value pairs: Animacy=Anim
, Animacy=Inan
, Case=Acc
, Case=Dat
, Case=Gen
, Case=Ins
, Case=Loc
, Case=Nom
, Gender=Fem
, Gender=Masc
, Gender=Neut
, Number=Plur
, Number=Sing
, Person=1
, Person=2
, Person=3
PRON
occurs with 77 feature combinations.
The most frequent feature combination is Case=Nom
(4312 tokens).
Examples: он, это, что, я, которые, мы, они, кто, она, который
Relations
PRON
nodes are attached to their parents using 26 different relations: ru-dep/nsubj (18932; 42% instances), ru-dep/obl (9321; 20% instances), ru-dep/nmod (7686; 17% instances), ru-dep/obj (5024; 11% instances), ru-dep/root (857; 2% instances), ru-dep/nsubj:pass (669; 1% instances), ru-dep/conj (479; 1% instances), ru-dep/mark (458; 1% instances), ru-dep/parataxis (366; 1% instances), ru-dep/iobj (348; 1% instances), ru-dep/fixed (271; 1% instances), ru-dep/advmod (222; 0% instances), ru-dep/dep (206; 0% instances), ru-dep/obl:agent (203; 0% instances), ru-dep/advcl (130; 0% instances), ru-dep/discourse (126; 0% instances), ru-dep/orphan (111; 0% instances), ru-dep/acl:relcl (61; 0% instances), ru-dep/expl (34; 0% instances), ru-dep/amod (28; 0% instances), ru-dep/acl (13; 0% instances), ru-dep/appos (12; 0% instances), ru-dep/flat:name (11; 0% instances), ru-dep/ccomp (10; 0% instances), ru-dep/xcomp (2; 0% instances), ru-dep/cc (1; 0% instances)
Parents of PRON
nodes belong to 18 different parts of speech: VERB (30839; 68% instances), NOUN (8147; 18% instances), ADJ (3237; 7% instances), ADV (1134; 2% instances), ROOT (857; 2% instances), PRON (504; 1% instances), ADP (237; 1% instances), PROPN (167; 0% instances), NUM (151; 0% instances), DET (116; 0% instances), PART (114; 0% instances), _ (25; 0% instances), PUNCT (23; 0% instances), CCONJ (16; 0% instances), AUX (5; 0% instances), SCONJ (5; 0% instances), INTJ (2; 0% instances), X (2; 0% instances)
28770 (63%) PRON
nodes are leaves.
10638 (23%) PRON
nodes have one child.
2897 (6%) PRON
nodes have two children.
3276 (7%) PRON
nodes have three or more children.
The highest child degree of a PRON
node is 11.
Children of PRON
nodes are attached using 31 different relations: ru-dep/case (9484; 33% instances), ru-dep/punct (7260; 25% instances), ru-dep/advmod (2226; 8% instances), ru-dep/advcl (1671; 6% instances), ru-dep/mark (1511; 5% instances), ru-dep/nsubj (1116; 4% instances), ru-dep/amod (972; 3% instances), ru-dep/fixed (921; 3% instances), ru-dep/cc (891; 3% instances), ru-dep/conj (555; 2% instances), ru-dep/acl (522; 2% instances), ru-dep/cop (476; 2% instances), ru-dep/parataxis (411; 1% instances), ru-dep/nmod (321; 1% instances), ru-dep/acl:relcl (252; 1% instances), ru-dep/nummod:gov (176; 1% instances), ru-dep/appos (144; 0% instances), ru-dep/obl (66; 0% instances), ru-dep/_ (34; 0% instances), ru-dep/orphan (27; 0% instances), ru-dep/aux (14; 0% instances), ru-dep/obj (7; 0% instances), ru-dep/nummod (6; 0% instances), ru-dep/root (6; 0% instances), ru-dep/flat:name (5; 0% instances), ru-dep/iobj (5; 0% instances), ru-dep/discourse (4; 0% instances), ru-dep/dep (2; 0% instances), ru-dep/xcomp (2; 0% instances), ru-dep/ccomp (1; 0% instances), ru-dep/nsubj:pass (1; 0% instances)
Children of PRON
nodes belong to 16 different parts of speech: ADP (9450; 32% instances), PUNCT (7260; 25% instances), VERB (3128; 11% instances), PART (1909; 7% instances), SCONJ (1885; 6% instances), NOUN (1679; 6% instances), ADJ (1056; 4% instances), ADV (908; 3% instances), CCONJ (535; 2% instances), DET (492; 2% instances), PRON (431; 1% instances), PROPN (135; 0% instances), AUX (112; 0% instances), NUM (72; 0% instances), _ (34; 0% instances), INTJ (3; 0% instances)
PRON in other languages: [am] [ar] [bg] [bxr] [ca] [ckb] [cop] [cs] [cu] [da] [de] [el] [en] [es] [et] [eu] [fa] [fi] [fo] [fr] [ga] [gl] [got] [grc] [he] [hi] [hr] [hu] [id] [it] [ja] [kk] [kmr] [ko] [la] [lv] [mr] [nl] [no] [pl] [pt] [ro] [ru] [sa] [sk] [sla] [sl] [so] [sr] [sv] [swl] [ta] [tr] [ug] [uk] [u] [urj] [ur] [vi] [yue] [zh]