home ru/feat edit page issue tracker

This page still pertains to UD version 1.

Gender: gender

Gender is a lexical feature of nouns and inflectional feature of other parts of speech (adjectives, verbs) that mark agreement with nouns. There are three values of gender: masculine, feminine, and neuter.

See also the related feature of Animacy.

Masc: masculine gender

Nouns denoting male persons are masculine. Other nouns may be also grammatically masculine, without any relation to sex.

Examples

Fem: feminine gender

Nouns denoting female persons are feminine. Other nouns may be also grammatically feminine, without any relation to sex.

Examples

Neut: neuter gender

This third gender is for nouns that are neither masculine nor feminine (grammatically). Nouns whose nominative suffix is -о  or -е  (including a large group of deverbative nouns denoting actions) are usually neuter.

Examples


Treebank Statistics (UD_Russian)

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut.

45156 tokens (51%) have a non-empty value of Gender. 22932 types (84%) occur at least once with a non-empty value of Gender. 14288 lemmas (83%) occur at least once with a non-empty value of Gender. The feature is used with 8 part-of-speech tags: ru-pos/NOUN (24010; 27% instances), ru-pos/ADJ (8422; 10% instances), ru-pos/PROPN (6294; 7% instances), ru-pos/VERB (3283; 4% instances), ru-pos/PRON (1249; 1% instances), ru-pos/DET (738; 1% instances), ru-pos/AUX (623; 1% instances), ru-pos/NUM (537; 1% instances).

NOUN

24010 ru-pos/NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Animacy=Inan (20746; 86%), Number=Sing (18071; 75%).

NOUN tokens may have the following values of Gender:

Paradigm ГОДMascFem
Case=Acc|Number=Singгод, года
Case=Acc|Number=Plurгоды, лет, годовгоды
Case=Dat|Number=Singгоду
Case=Dat|Number=Plurгодам
Case=Gen|Number=Singгода
Case=Gen|Number=Plurлет, годов
Case=Ins|Number=Singгодом
Case=Ins|Number=Plurгодами
Case=Loc|Number=Singгоду
Case=Loc|Number=Plurгодах, годы
Case=Nom|Number=Singгод
Case=Nom|Number=Plurгоды

Gender seems to be lexical feature of NOUN. 99% lemmas (5838) occur only with one value of Gender.

ADJ

8422 ru-pos/ADJ tokens (77% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (8421; 100%), Animacy=Inan (7652; 91%).

ADJ tokens may have the following values of Gender:

Paradigm ЙMascFemNeut
Animacy=Anim|Case=Genго
Animacy=Anim|Case=Insим
Animacy=Inan|Case=Accйюе
Animacy=Inan|Case=Datмуй
Animacy=Inan|Case=Genгойго
Animacy=Inan|Case=Insмй
Animacy=Inan|Case=Locмй
Animacy=Inan|Case=Nomйяе

PROPN

6294 ru-pos/PROPN tokens (100% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (6083; 97%), Animacy=Inan (3285; 52%).

PROPN tokens may have the following values of Gender:

Paradigm ДЕMascFemNeut
Animacy=Anim|Case=Accде
Animacy=Anim|Case=Genде
Animacy=Anim|Case=Insдеде
Animacy=Anim|Case=Locде
Animacy=Anim|Case=Nomде
Animacy=Inan|Case=LocДе
Animacy=Inan|Case=Nomде

Gender seems to be lexical feature of PROPN. 99% lemmas (4361) occur only with one value of Gender.

VERB

3283 ru-pos/VERB tokens (45% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Number=Sing (3283; 100%), Person=EMPTY (3283; 100%), Tense=Past (3064; 93%), Variant=EMPTY (2632; 80%), Aspect=Perf (2213; 67%), VerbForm=Fin (2040; 62%), Case=EMPTY (2040; 62%), Mood=Ind (2040; 62%), Animacy=EMPTY (2040; 62%).

VERB tokens may have the following values of Gender:

Paradigm БЫТЬMascFemNeut
былбылабыло

PRON

1249 ru-pos/PRON tokens (74% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (1249; 100%), Person=EMPTY (685; 55%).

PRON tokens may have the following values of Gender:

Paradigm КОТОРЫЙMascFemNeut
Animacy=Anim|Case=Accкоторого, которыйкоторую
Animacy=Anim|Case=Datкоторому
Animacy=Anim|Case=Genкоторогокоторой
Animacy=Anim|Case=Insкоторымкоторой
Animacy=Anim|Case=Nomкоторыйкоторая
Animacy=Inan|Case=Accкоторыйкоторуюкоторое, которого
Animacy=Inan|Case=Datкоторомукоторойкоторому
Animacy=Inan|Case=Genкоторогокоторойкоторого
Animacy=Inan|Case=Insкоторымкоторой
Animacy=Inan|Case=Locкоторомкоторойкотором
Animacy=Inan|Case=Nomкоторыйкотораякоторое

Gender seems to be lexical feature of PRON. 92% lemmas (12) occur only with one value of Gender.

DET

738 ru-pos/DET tokens (53% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (738; 100%), Person=EMPTY (705; 96%), Animacy=Inan (667; 90%), Reflex=EMPTY (569; 77%).

DET tokens may have the following values of Gender:

Paradigm ЭТОТMascFemNeut
Animacy=Anim|Case=Accэтого
Animacy=Anim|Case=Nomэтот
Animacy=Inan|Case=Accэтотэтуэто
Animacy=Inan|Case=Datэтомуэтой
Animacy=Inan|Case=Genэтогоэтой, этоэтого
Animacy=Inan|Case=InsэтимэтойЭтим
Animacy=Inan|Case=Locэтомэтойэтом
Animacy=Inan|Case=Nomэтотэтаэто

AUX

623 ru-pos/AUX tokens (62% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Person=EMPTY (623; 100%), Number=Sing (621; 100%), Tense=Past (617; 99%), Mood=Ind (603; 97%), VerbForm=Fin (603; 97%), Voice=EMPTY (583; 94%), Aspect=Imp (524; 84%).

AUX tokens may have the following values of Gender:

Paradigm БЫТЬMascFemNeut
Animacy=Anim|Case=Gen|VerbForm=Part|Voice=Actбывшего
Animacy=Anim|Case=Ins|VerbForm=Part|Voice=Actбывшим
Mood=Ind|VerbForm=Finбылбылабыло

NUM

537 ru-pos/NUM tokens (29% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (537; 100%), Animacy=Inan (434; 81%), Number=Sing (273; 51%).

NUM tokens may have the following values of Gender:

Paradigm ОДИНMascFemNeut
Animacy=Anim|Case=Accодного
Animacy=Anim|Case=Datодному
Animacy=Anim|Case=Genодногоодного
Animacy=Anim|Case=Insоднимодной
Animacy=Anim|Case=Nomодинодна
Animacy=Inan|Case=Accодиноднуодно, одного
Animacy=Inan|Case=Datодномуодной
Animacy=Inan|Case=Genодногооднойодного
Animacy=Inan|Case=Insоднимоднойодним
Animacy=Inan|Case=Locодномоднойодном
Animacy=Inan|Case=Nomодиноднаодно

Gender seems to be lexical feature of NUM. 92% lemmas (114) occur only with one value of Gender.

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (6284; 73%), NOUN –[conj]–> NOUN (900; 54%), PROPN –[flat]–> PROPN (855; 100%), NOUN –[appos]–> PROPN (730; 67%), NOUN –[det]–> DET (564; 52%), NOUN –[acl]–> VERB (463; 53%), NOUN –[appos]–> NOUN (406; 52%), VERB –[nsubj]–> PROPN (393; 68%), PROPN –[conj]–> PROPN (379; 75%), VERB –[aux:pass]–> AUX (353; 96%).


Treebank Statistics (UD_Russian-SynTagRus)

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut.

417908 tokens (42%) have a non-empty value of Gender. 83600 types (78%) occur at least once with a non-empty value of Gender. 31823 lemmas (80%) occur at least once with a non-empty value of Gender. The feature is used with 8 part-of-speech tags: ru-pos/NOUN (243108; 25% instances), ru-pos/ADJ (66634; 7% instances), ru-pos/PROPN (33696; 3% instances), ru-pos/VERB (33560; 3% instances), ru-pos/PRON (21515; 2% instances), ru-pos/DET (13062; 1% instances), ru-pos/AUX (3662; 0% instances), ru-pos/NUM (2671; 0% instances).

NOUN

243108 ru-pos/NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Animacy=Inan (210311; 87%), Number=Sing (170107; 70%).

NOUN tokens may have the following values of Gender:

Paradigm спецпитаниеMascFemNeut
Case=Accспецпитание
Case=Genспецпитанияспецпитания

Gender seems to be lexical feature of NOUN. 100% lemmas (15697) occur only with one value of Gender.

ADJ

66634 ru-pos/ADJ tokens (66% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (66634; 100%), Degree=Pos (66282; 99%).

ADJ tokens may have the following values of Gender:

Paradigm другойMascFemNeut
Animacy=Anim|Case=Accдругого
Animacy=Inan|Case=Accдругой
Case=Accдругую, другойдругое
Case=Datдругомудругойдругому
Case=Genдругогодругойдругого
Case=Insдругимдругойдругим
Case=Locдругомдругойдругом
Case=Nomдругойдругая, другойдругое, др.

PROPN

33696 ru-pos/PROPN tokens (93% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (32606; 97%), Animacy=Inan (17652; 52%).

PROPN tokens may have the following values of Gender:

Paradigm gongoMascFemNeut
Case=Gen|Number=SingGONGO
Case=Ins|Number=PlurGONGO
Case=Nom|Number=PlurGONGO

Gender seems to be lexical feature of PROPN. 98% lemmas (6952) occur only with one value of Gender.

VERB

33560 ru-pos/VERB tokens (30% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Number=Sing (33560; 100%), Person=EMPTY (33560; 100%), Tense=Past (30982; 92%), Case=EMPTY (26995; 80%), Mood=Ind (23675; 71%), VerbForm=Fin (23675; 71%), Aspect=Perf (21111; 63%), Voice=Act (19516; 58%).

VERB tokens may have the following values of Gender:

Paradigm мочьMascFemNeut
Aspect=Imp|Case=Acc|Tense=Pres|VerbForm=Partмогущую
Aspect=Imp|Mood=Ind|Tense=Past|VerbForm=Finмогмогламогло
Aspect=Perf|Mood=Ind|Tense=Past|VerbForm=Finсмогсмогласмогло

PRON

21515 ru-pos/PRON tokens (47% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (21512; 100%), Person=3 (11556; 54%), Animacy=EMPTY (11556; 54%).

PRON tokens may have the following values of Gender:

Paradigm тоMascFemNeut
Case=Accтомто
Case=Datтому, т.п., т.п, т.
Case=Genтоготого
Case=Insтемтем
Case=Locтом
Case=Nomто, т.е., т.е, т., т

DET

13062 ru-pos/DET tokens (65% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (13060; 100%).

DET tokens may have the following values of Gender:

Paradigm этотMascFemNeut
Case=Accэтот, этогоэтуэто
Case=Datэтомуэтойэтому
Case=Genэтогоэтойэтого
Case=Insэтимэтойэтим
Case=Locэтомэтойэтом
Case=Nomэтотэтаэто

AUX

3662 ru-pos/AUX tokens (50% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Voice=Act (3662; 100%), Person=EMPTY (3662; 100%), Tense=Past (3662; 100%), Number=Sing (3662; 100%), Aspect=Imp (3662; 100%), VerbForm=Fin (3659; 100%), Mood=Ind (3659; 100%).

AUX tokens may have the following values of Gender:

Paradigm бытьMascFemNeut
Case=Loc|VerbForm=Partбывшем
Case=Nom|VerbForm=Partбывший
Mood=Ind|VerbForm=Finбылбылабыло

NUM

2671 ru-pos/NUM tokens (18% of all NUM tokens) have a non-empty value of Gender.

NUM tokens may have the following values of Gender:

Paradigm одинMascFemNeut
Animacy=Anim|Case=Accодного
Animacy=Inan|Case=Accодин
Case=Accоднуодно
Case=Datодномуоднойодному
Case=Genодногооднойодного
Case=Insоднимоднойодним
Case=Locодномоднойодном
Case=Nomодиноднаодно

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (48848; 66%), NOUN –[amod]–> DET (11375; 67%), NOUN –[amod]–> VERB (5284; 56%), PROPN –[flat:name]–> PROPN (4546; 99%), NOUN –[appos]–> PROPN (3744; 81%), VERB –[conj]–> VERB (2919; 54%), ADJ –[nsubj]–> NOUN (2695; 63%), ADJ –[conj]–> ADJ (2305; 94%), VERB –[nsubj]–> PROPN (2167; 58%), PROPN –[amod]–> ADJ (1666; 89%).


Gender in other languages: [am] [ar] [bg] [bxr] [ca] [ckb] [cop] [cs] [cu] [da] [de] [el] [en] [es] [et] [eu] [fa] [fo] [fr] [ga] [gl] [got] [grc] [he] [hi] [hr] [hu] [id] [it] [ja] [kk] [kmr] [ko] [la] [lv] [mr] [nl] [no] [pl] [pt] [ro] [ru] [sa] [sk] [sla] [sl] [so] [sr] [sv] [swl] [ta] [tr] [u] [ug] [uk] [ur] [vi] [yue] [zh]