home uk/feat edit page issue tracker

This page still pertains to UD version 1.

Gender: gender

Gender is a lexical feature of nouns and inflectional feature of other parts of speech (adjectives, verbs) that mark agreement with nouns. There are three values of gender: masculine, feminine, and neuter.

See also the related feature of Animacy.

Masc: masculine gender

Nouns denoting male persons are masculine. Other nouns may be also grammatically masculine, without any relation to sex.

Examples

Note that the last two nouns above can also function as feminine (technically these are two different lemmas), depending on whether these functions designate men or women, with exactly the same (feminine in this case) morphological paradigm and agreeing with adjectivals and verbal forms in the feminine form, respectively. (Historically they are feminine too, with the typical endings -а  or -я .)

Fem: feminine gender

Nouns denoting female persons are feminine. Other nouns may be also grammatically feminine, without any relation to sex.

Examples

Neut: neuter gender

This third gender is for nouns that are neither masculine nor feminine (grammatically). Nouns whose nominative suffix is -о  or -е  (including a large group of deverbative nouns denoting actions) are usually neuter.

Examples


Treebank Statistics (UD_Ukrainian)

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut.

5356 tokens (42%) have a non-empty value of Gender. 3754 types (70%) occur at least once with a non-empty value of Gender. 2630 lemmas (68%) occur at least once with a non-empty value of Gender. The feature is used with 8 part-of-speech tags: NOUN (2896; 23% instances), ADJ (752; 6% instances), VERB (581; 5% instances), PROPN (343; 3% instances), PRON (327; 3% instances), DET (317; 2% instances), AUX (103; 1% instances), NUM (37; 0% instances).

NOUN

2896 NOUN tokens (98% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Animacy=Inan (2401; 83%), Number=EMPTY (2045; 71%).

NOUN tokens may have the following values of Gender:

Paradigm малийMascNeut
Animacy=Anim|Case=Nomмалий
Animacy=Inan|Case=Accмале

Gender seems to be lexical feature of NOUN. 100% lemmas (1556) occur only with one value of Gender.

ADJ

752 ADJ tokens (67% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=EMPTY (752; 100%), Animacy=EMPTY (681; 91%), Aspect=EMPTY (679; 90%), VerbForm=EMPTY (679; 90%), Voice=EMPTY (679; 90%), Degree=EMPTY (509; 68%).

ADJ tokens may have the following values of Gender:

Paradigm цілийMascFemNeut
Animacy=Inan|Case=Accцілий
Case=Accцілуціле
Case=Genцілогоцілої
Case=Nomцілийцілаціле

VERB

581 VERB tokens (37% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Number=EMPTY (581; 100%), Person=EMPTY (581; 100%), Tense=Past (581; 100%), Mood=Ind (581; 100%), VerbForm=Fin (581; 100%), Aspect=Imp (332; 57%).

VERB tokens may have the following values of Gender:

Paradigm бутиMascFemNeut
бувбулабуло

PROPN

343 PROPN tokens (94% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Animacy=Anim (222; 65%).

PROPN tokens may have the following values of Gender:

Paradigm ПроскурняMascFem
Case=AccПроскурню
Case=GenПроскурні

Gender seems to be lexical feature of PROPN. 99% lemmas (166) occur only with one value of Gender.

PRON

327 PRON tokens (45% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=EMPTY (327; 100%), PronType=Prs (174; 53%), Person=3 (173; 53%), Animacy=EMPTY (173; 53%), Case=Nom (167; 51%).

PRON tokens may have the following values of Gender:

Paradigm тойMascFem
Case=Datтій
Case=Genтого
Case=Nomта

Gender seems to be lexical feature of PRON. 94% lemmas (17) occur only with one value of Gender.

DET

317 DET tokens (61% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=EMPTY (317; 100%), Animacy=EMPTY (274; 86%), Person=EMPTY (242; 76%), Poss=EMPTY (215; 68%).

DET tokens may have the following values of Gender:

Paradigm якийMascFemNeut
Animacy=Anim|Case=Accякого
Animacy=Inan|Case=Accякий
Case=Accякуяке
Case=Datякому
Case=Genякогоякої
Case=Insякимякою
Case=Locякомуякій
Case=Nomякийяка

AUX

103 AUX tokens (60% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Person=EMPTY (103; 100%), Number=EMPTY (103; 100%), VerbForm=Fin (103; 100%), Aspect=Imp (103; 100%), Mood=Ind (103; 100%), Tense=Past (103; 100%).

AUX tokens may have the following values of Gender:

Paradigm бутиMascFemNeut
бувбулабуло

NUM

37 NUM tokens (31% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: Number=EMPTY (37; 100%), NumType=Card (37; 100%), Case=Acc (22; 59%).

NUM tokens may have the following values of Gender:

Paradigm дваMascFemNeut
Case=Accдвадві, двохдва
Case=Genдвохдвох
Case=Insдвома
Case=Nomдва

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (597; 68%), NOUN –[det]–> DET (217; 63%), VERB –[conj]–> VERB (88; 70%), VERB –[nsubj]–> PROPN (60; 83%), NOUN –[appos]–> PROPN (44; 77%), PROPN –[flat:name]–> PROPN (40; 87%), ADJ –[conj]–> ADJ (28; 100%), ADJ –[cop]–> AUX (20; 69%), ADJ –[nsubj]–> NOUN (17; 59%), VERB –[nsubj]–> DET (14; 70%).


Gender in other languages: [am] [ar] [bg] [bxr] [ca] [ckb] [cop] [cs] [cu] [da] [de] [el] [en] [es] [et] [eu] [fa] [fo] [fr] [ga] [gl] [got] [grc] [he] [hi] [hr] [hu] [id] [it] [ja] [kk] [kmr] [ko] [la] [lv] [mr] [nl] [no] [pl] [pt] [ro] [ru] [sa] [sk] [sla] [sl] [so] [sr] [sv] [swl] [ta] [tr] [u] [ug] [uk] [ur] [vi] [yue] [zh]