home ru/pos edit page issue tracker

This page still pertains to UD version 1.

NOUN: noun

Definition

Nouns are a part of speech typically denoting a person, place, thing, animal or idea.

The NOUN tag is intended for common nouns only. See PROPN for proper nouns and PRON for pronouns.

Russian nouns have the lexical feature ru-feat/Gender. Furthermore, the nouns inflect for ru-feat/Number and ru-feat/Case.

A verbal noun can be derived productively from almost every verb (e.g. есть  “to eat” → поедание  “eating”). While in other languages a corresponding form may be called gerund and tagged VERB, in Russian it is tagged NOUN. It has always the neuter gender and the full number-case inflectional paradigm.

Examples


Treebank Statistics (UD_Russian)

There are 5915 NOUN lemmas (33%), 10526 NOUN types (38%) and 24062 NOUN tokens (27%). Out of 16 observed tags, the rank of NOUN is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.

The 10 most frequent NOUN lemmas: ГОД, ВРЕМЯ, ГОРОД, ЧЕЛОВЕК, ЧАСТЬ, РАЙОН, ОБЛАСТЬ, СОСТАВ, НАСЕЛЕНИЕ, ВОЙНА

The 10 most frequent NOUN types: года, году, время, области, лет, человек, войны, км, реки, год

The 10 most frequent ambiguous lemmas: МИР (NOUN 78, AUX 1), Г. (NOUN 51, PROPN 1), ЗЕМЛЯ (NOUN 51, PROPN 1), ЧЛЕН (NOUN 50, ADV 1), ОСТРОВ (NOUN 45, PROPN 1), ДОМ (NOUN 42, PROPN 1), ПЕСНЯ (NOUN 39, AUX 1), АВГУСТ (NOUN 37, PROPN 6), ВОСТОК (NOUN 36, PROPN 1), СЛОВО (NOUN 33, PROPN 1)

The 10 most frequent ambiguous types: мм (NOUN 26, ADJ 4), м (NOUN 23, ADJ 12), No (NOUN 21, PART 1), песни (NOUN 17, AUX 1), дома (NOUN 13, ADV 3), основном (NOUN 16, ADJ 1), мир (NOUN 15, AUX 1), б (NOUN 5, ADJ 1), начала (NOUN 10, VERB 5), начало (NOUN 9, VERB 1)

Morphology

The form / lemma ratio of NOUN is 1.779544 (the average of all parts of speech is 1.576680).

The 1st highest number of forms (10) was observed with the lemma “ГОД”: год, года, годам, годами, годах, годов, годом, году, годы, лет.

The 2nd highest number of forms (9) was observed with the lemma “АКТЕР”: актер, актера, актеров, актёр, актёра, актёрами, актёров, актёром, актёры.

The 3rd highest number of forms (9) was observed with the lemma “ЗАКОН”: закон, закона, законам, законами, законе, законов, законом, закону, законы.

NOUN occurs with 4 features: ru-feat/Animacy (24010; 100% instances), ru-feat/Case (24010; 100% instances), ru-feat/Gender (24010; 100% instances), ru-feat/Number (24010; 100% instances)

NOUN occurs with 15 feature-value pairs: Animacy=Anim, Animacy=Inan, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Case=Par, Case=Voc, Gender=Fem, Gender=Masc, Gender=Neut, Number=Plur, Number=Sing

NOUN occurs with 72 feature combinations. The most frequent feature combination is Animacy=Inan|Case=Gen|Gender=Masc|Number=Sing (2654 tokens). Examples: года, города, мира, века, района, декабря, января, марта, сентября, августа

Relations

NOUN nodes are attached to their parents using 31 different relations: ru-dep/nmod (8488; 35% instances), ru-dep/obl (5016; 21% instances), ru-dep/nsubj (2783; 12% instances), ru-dep/obj (1958; 8% instances), ru-dep/conj (1847; 8% instances), ru-dep/appos (950; 4% instances), ru-dep/root (779; 3% instances), ru-dep/iobj (685; 3% instances), ru-dep/nsubj:pass (481; 2% instances), ru-dep/advmod (431; 2% instances), ru-dep/goeswith (189; 1% instances), ru-dep/list (123; 1% instances), ru-dep/parataxis (73; 0% instances), ru-dep/fixed (53; 0% instances), ru-dep/nummod:gov (51; 0% instances), ru-dep/ccomp (30; 0% instances), ru-dep/acl:relcl (29; 0% instances), ru-dep/orphan (24; 0% instances), ru-dep/acl (21; 0% instances), ru-dep/amod (13; 0% instances), ru-dep/advcl (11; 0% instances), ru-dep/xcomp (10; 0% instances), ru-dep/discourse (4; 0% instances), ru-dep/nummod (3; 0% instances), ru-dep/case (2; 0% instances), ru-dep/compound (2; 0% instances), ru-dep/vocative (2; 0% instances), ru-dep/csubj (1; 0% instances), ru-dep/dep (1; 0% instances), ru-dep/flat (1; 0% instances), ru-dep/mark (1; 0% instances)

Parents of NOUN nodes belong to 14 different parts of speech: NOUN (10743; 45% instances), VERB (10495; 44% instances), ROOT (779; 3% instances), ADJ (584; 2% instances), PROPN (400; 2% instances), ADP (385; 2% instances), ADV (338; 1% instances), NUM (189; 1% instances), PUNCT (48; 0% instances), SYM (38; 0% instances), DET (33; 0% instances), PRON (22; 0% instances), AUX (4; 0% instances), CCONJ (4; 0% instances)

3490 (15%) NOUN nodes are leaves.

7952 (33%) NOUN nodes have one child.

7601 (32%) NOUN nodes have two children.

5019 (21%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 31.

Children of NOUN nodes are attached using 37 different relations: ru-dep/nmod (9338; 22% instances), ru-dep/amod (9128; 22% instances), ru-dep/case (7270; 17% instances), ru-dep/punct (5315; 13% instances), ru-dep/appos (2135; 5% instances), ru-dep/conj (1955; 5% instances), ru-dep/det (1094; 3% instances), ru-dep/cc (1048; 2% instances), ru-dep/acl (910; 2% instances), ru-dep/nummod:gov (764; 2% instances), ru-dep/nsubj (692; 2% instances), ru-dep/nummod (499; 1% instances), ru-dep/acl:relcl (447; 1% instances), ru-dep/advmod (364; 1% instances), ru-dep/cop (321; 1% instances), ru-dep/list (303; 1% instances), ru-dep/discourse (136; 0% instances), ru-dep/parataxis (113; 0% instances), ru-dep/iobj (89; 0% instances), ru-dep/goeswith (64; 0% instances), ru-dep/compound (51; 0% instances), ru-dep/mark (45; 0% instances), ru-dep/fixed (43; 0% instances), ru-dep/cc:preconj (42; 0% instances), ru-dep/advcl (32; 0% instances), ru-dep/orphan (28; 0% instances), ru-dep/obj (14; 0% instances), ru-dep/ccomp (13; 0% instances), ru-dep/csubj (3; 0% instances), ru-dep/dep (3; 0% instances), ru-dep/obl (3; 0% instances), ru-dep/aux (2; 0% instances), ru-dep/aux:pass (2; 0% instances), ru-dep/flat (2; 0% instances), ru-dep/xcomp (2; 0% instances), ru-dep/nsubj:pass (1; 0% instances), ru-dep/vocative (1; 0% instances)

Children of NOUN nodes belong to 16 different parts of speech: NOUN (10743; 25% instances), ADJ (9116; 22% instances), ADP (7493; 18% instances), PUNCT (5308; 13% instances), PROPN (2850; 7% instances), VERB (1506; 4% instances), NUM (1394; 3% instances), DET (1248; 3% instances), CCONJ (1080; 3% instances), ADV (763; 2% instances), AUX (325; 1% instances), PRON (206; 0% instances), PART (130; 0% instances), SYM (63; 0% instances), SCONJ (45; 0% instances), X (2; 0% instances)


Treebank Statistics (UD_Russian-SynTagRus)

There are 15804 NOUN lemmas (38%), 39426 NOUN types (36%) and 243588 NOUN tokens (25%). Out of 18 observed tags, the rank of NOUN is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.

The 10 most frequent NOUN lemmas: год, человек, время, страна, дело, система, работа, жизнь, власть, проблема

The 10 most frequent NOUN types: время, года, лет, году, %, раз, человек, жизни, люди, власти

The 10 most frequent ambiguous lemmas: год (NOUN 4212, PROPN 10), человек (NOUN 2465, PROPN 11), время (NOUN 1969, PROPN 8), страна (NOUN 1482, PROPN 2, X 1), дело (NOUN 1140, PROPN 2), система (NOUN 1008, PROPN 12), жизнь (NOUN 923, PROPN 6), власть (NOUN 915, PROPN 3), проблема (NOUN 842, PROPN 5), вопрос (NOUN 840, PROPN 2)

The 10 most frequent ambiguous types: раз (NOUN 626, SCONJ 27, ADV 4), ученые (NOUN 190, ADJ 3), что-то (NOUN 194, ADV 7), ученых (NOUN 157, ADJ 7), право (NOUN 145, ADJ 2, ADV 2), начала (NOUN 151, VERB 39), данным (NOUN 142, ADJ 2), целом (NOUN 125, ADJ 5), дома (NOUN 120, ADV 46), права (NOUN 119, ADJ 2)

Morphology

The form / lemma ratio of NOUN is 2.494685 (the average of all parts of speech is 2.644632).

The 1st highest number of forms (15) was observed with the lemma “тоннель”: тоннеле, тоннелей, тоннели, тоннель, тоннелю, тоннеля, тоннелям, тоннелями, тоннелях, туннеле, туннелем, туннель, туннелю, туннеля, туннелями.

The 2nd highest number of forms (14) was observed with the lemma “год”: г, г., гг, гг., год, года, годам, годами, годах, годов, годом, году, годы, лет.

The 3rd highest number of forms (13) was observed with the lemma “век”: в, в., вв, век, века, векам, веками, веках, веке, веков, веком, веку, полвека.

NOUN occurs with 5 features: ru-feat/Animacy (243522; 100% instances), ru-feat/Number (243364; 100% instances), ru-feat/Case (243362; 100% instances), ru-feat/Gender (243108; 100% instances), ru-feat/Foreign (2; 0% instances)

NOUN occurs with 16 feature-value pairs: Animacy=Anim, Animacy=Inan, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Case=Par, Case=Voc, Foreign=Yes, Gender=Fem, Gender=Masc, Gender=Neut, Number=Plur, Number=Sing

NOUN occurs with 93 feature combinations. The most frequent feature combination is Animacy=Inan|Case=Gen|Gender=Fem|Number=Sing (19054 tokens). Examples: жизни, страны, экономики, власти, стороны, работы, системы, науки, войны, партии

Relations

NOUN nodes are attached to their parents using 32 different relations: ru-dep/nmod (77086; 32% instances), ru-dep/obl (53361; 22% instances), ru-dep/nsubj (40926; 17% instances), ru-dep/obj (24149; 10% instances), ru-dep/conj (19485; 8% instances), ru-dep/root (6980; 3% instances), ru-dep/parataxis (5548; 2% instances), ru-dep/fixed (3609; 1% instances), ru-dep/nsubj:pass (3599; 1% instances), ru-dep/appos (2150; 1% instances), ru-dep/advmod (1874; 1% instances), ru-dep/obl:agent (1307; 1% instances), ru-dep/advcl (1019; 0% instances), ru-dep/orphan (898; 0% instances), ru-dep/iobj (808; 0% instances), ru-dep/nummod:gov (199; 0% instances), ru-dep/compound (197; 0% instances), ru-dep/acl:relcl (152; 0% instances), ru-dep/acl (122; 0% instances), ru-dep/nummod:entity (52; 0% instances), ru-dep/amod (13; 0% instances), ru-dep/dep (10; 0% instances), ru-dep/flat:name (10; 0% instances), ru-dep/vocative (10; 0% instances), ru-dep/ccomp (6; 0% instances), ru-dep/nummod (5; 0% instances), ru-dep/xcomp (5; 0% instances), ru-dep/flat (3; 0% instances), ru-dep/cc (2; 0% instances), ru-dep/aux:pass (1; 0% instances), ru-dep/expl (1; 0% instances), ru-dep/mark (1; 0% instances)

Parents of NOUN nodes belong to 19 different parts of speech: VERB (118438; 49% instances), NOUN (94139; 39% instances), ADJ (10027; 4% instances), ROOT (6980; 3% instances), ADV (3227; 1% instances), PROPN (2940; 1% instances), ADP (2615; 1% instances), NUM (1872; 1% instances), PRON (1699; 1% instances), DET (715; 0% instances), PART (224; 0% instances), PUNCT (219; 0% instances), SCONJ (217; 0% instances), _ (169; 0% instances), CCONJ (59; 0% instances), X (33; 0% instances), AUX (9; 0% instances), SYM (4; 0% instances), INTJ (2; 0% instances)

28970 (12%) NOUN nodes are leaves.

76472 (31%) NOUN nodes have one child.

81169 (33%) NOUN nodes have two children.

56977 (23%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 16.

Children of NOUN nodes are attached using 38 different relations: ru-dep/amod (101261; 23% instances), ru-dep/punct (93633; 21% instances), ru-dep/nmod (84636; 19% instances), ru-dep/case (71170; 16% instances), ru-dep/conj (19262; 4% instances), ru-dep/cc (14418; 3% instances), ru-dep/advmod (9266; 2% instances), ru-dep/nummod (7996; 2% instances), ru-dep/appos (6376; 1% instances), ru-dep/parataxis (6261; 1% instances), ru-dep/nsubj (5579; 1% instances), ru-dep/acl:relcl (4840; 1% instances), ru-dep/nummod:gov (3541; 1% instances), ru-dep/cop (2831; 1% instances), ru-dep/dep (1474; 0% instances), ru-dep/mark (1195; 0% instances), ru-dep/advcl (1031; 0% instances), ru-dep/fixed (876; 0% instances), ru-dep/flat:foreign (840; 0% instances), ru-dep/acl (405; 0% instances), ru-dep/compound (257; 0% instances), ru-dep/orphan (216; 0% instances), ru-dep/obl:agent (193; 0% instances), ru-dep/discourse (112; 0% instances), ru-dep/_ (86; 0% instances), ru-dep/flat:name (81; 0% instances), ru-dep/iobj (80; 0% instances), ru-dep/obl (75; 0% instances), ru-dep/aux (56; 0% instances), ru-dep/nummod:entity (44; 0% instances), ru-dep/obj (33; 0% instances), ru-dep/root (8; 0% instances), ru-dep/xcomp (5; 0% instances), ru-dep/aux:pass (2; 0% instances), ru-dep/vocative (2; 0% instances), ru-dep/ccomp (1; 0% instances), ru-dep/expl (1; 0% instances), ru-dep/nsubj:pass (1; 0% instances)

Children of NOUN nodes belong to 18 different parts of speech: NOUN (93928; 21% instances), PUNCT (93633; 21% instances), ADJ (77282; 18% instances), ADP (71632; 16% instances), VERB (20091; 5% instances), DET (17411; 4% instances), PROPN (16119; 4% instances), CCONJ (12639; 3% instances), NUM (11098; 3% instances), PRON (8149; 2% instances), PART (5992; 1% instances), ADV (5594; 1% instances), SCONJ (2904; 1% instances), AUX (1404; 0% instances), X (160; 0% instances), _ (86; 0% instances), INTJ (18; 0% instances), SYM (4; 0% instances)


NOUN in other languages: [am] [ar] [bg] [bxr] [ca] [ckb] [cop] [cs] [cu] [da] [de] [el] [en] [es] [et] [eu] [fa] [fi] [fo] [fr] [ga] [gl] [got] [grc] [he] [hi] [hr] [hu] [id] [it] [ja] [kk] [kmr] [ko] [la] [lv] [mr] [nl] [no] [pl] [pt] [ro] [ru] [sa] [sk] [sla] [sl] [so] [sr] [sv] [swl] [ta] [tr] [ug] [uk] [u] [urj] [ur] [vi] [yue] [zh]