home sl/feat edit page issue tracker

This page still pertains to UD version 1.

Gender: gender

Gender is a lexical feature of nouns and proper nouns, and an inflectional feature of other parts of speech (adjectives, verbs, auxiliary, pronouns, determiners and numerals) that mark agreement with nouns.

Masc: masculine gender

Examples

Fem: feminine gender

Examples

Neut: neuter gender

Examples

Conversion from JOS

All tokens with feature Gender=masculine are converted to Gender=Masc, all tokens with feature Gender=feminine are converted to Gender=Fem and all tokens with feature Gender=neuter are converted to Gender=Neut.


Treebank Statistics (UD_Slovenian)

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut.

This is a layered feature with the following layers: Gender, Gender[psor].

58331 tokens (46%) have a non-empty value of Gender. 26978 types (92%) occur at least once with a non-empty value of Gender. 13677 lemmas (86%) occur at least once with a non-empty value of Gender. The feature is used with 8 part-of-speech tags: sl-pos/NOUN (27150; 21% instances), sl-pos/ADJ (13529; 11% instances), sl-pos/VERB (6228; 5% instances), sl-pos/PROPN (4298; 3% instances), sl-pos/DET (3994; 3% instances), sl-pos/PRON (2031; 2% instances), sl-pos/AUX (657; 1% instances), sl-pos/NUM (444; 0% instances).

NOUN

27150 sl-pos/NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (19228; 71%).

NOUN tokens may have the following values of Gender:

Paradigm potMascFem
Case=Acc|Number=Singpot
Case=Acc|Number=Plurpoti
Case=Dat|Number=Singpoti
Case=Gen|Number=Singpotapoti
Case=Ins|Number=Singpotjo
Case=Ins|Number=Plurpotmi
Case=Loc|Number=Singpoti
Case=Loc|Number=Plurpoteh
Case=Nom|Number=Singpot

Gender seems to be lexical feature of NOUN. 100% lemmas (6005) occur only with one value of Gender.

ADJ

13529 sl-pos/ADJ tokens (100% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Degree=Pos (12388; 92%), VerbForm=EMPTY (11801; 87%), Definite=EMPTY (11664; 86%), Number=Sing (9120; 67%).

ADJ tokens may have the following values of Gender:

Paradigm drugMascFemNeut
Case=Acc|Definite=Def|Number=Singdrugi
Case=Acc|Definite=Ind|Number=Singdrug
Case=Acc|Number=Singdrugegadrugodrugo
Case=Acc|Number=Plurdrugedrugedruga
Case=Dat|Number=Singdrugemudrugi
Case=Dat|Number=Plurdrugim
Case=Gen|Number=Singdrugegadrugedrugega
Case=Gen|Number=Plurdrugihdrugih
Case=Ins|Number=Singdrugimdrugodrugim
Case=Ins|Number=Plurdrugimidrugimi
Case=Loc|Number=Singdrugemdrugidrugem
Case=Loc|Number=Dualdrugih
Case=Loc|Number=Plurdrugihdrugihdrugih
Case=Nom|Definite=Def|Number=Singdrugi
Case=Nom|Definite=Ind|Number=Singdrug
Case=Nom|Number=Singdrugadrugo
Case=Nom|Number=Plurdrugidrugedruga

VERB

6228 sl-pos/VERB tokens (48% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Person=EMPTY (6228; 100%), Tense=EMPTY (6228; 100%), Mood=EMPTY (6228; 100%), VerbForm=Part (6228; 100%), Number=Sing (4049; 65%), Aspect=Perf (3767; 60%).

VERB tokens may have the following values of Gender:

Paradigm bitiMascFemNeut
Number=Singbilbilabilo, blo
Number=Dualbila, blabili
Number=Plurbilibile

PROPN

4298 sl-pos/PROPN tokens (100% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (4046; 94%), Case=Nom (2204; 51%).

PROPN tokens may have the following values of Gender:

Paradigm EUMascFem
Case=AccEU
Case=GenEU
Case=LocEU
Case=NomEUEU

Gender seems to be lexical feature of PROPN. 99% lemmas (2395) occur only with one value of Gender.

DET

3994 sl-pos/DET tokens (85% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Person=EMPTY (3262; 82%), Number[psor]=EMPTY (3262; 82%), Number=Sing (2834; 71%), Poss=EMPTY (2833; 71%).

DET tokens may have the following values of Gender:

Paradigm taMascFemNeut
Case=Acc|Number=Singta, tegatoto
Case=Acc|Number=Dualti
Case=Acc|Number=Plurteteta
Case=Dat|Number=Singtemutejtemu
Case=Dat|Number=Plurtemtem
Case=Gen|Number=Singtegatetega
Case=Gen|Number=Dualteh
Case=Gen|Number=Plurtehtehteh
Case=Ins|Number=Singtemtotem
Case=Ins|Number=Plurtemitemi
Case=Loc|Number=Singtemtejtem
Case=Loc|Number=Plurtehtehteh
Case=Nom|Number=Singtatato
Case=Nom|Number=Dualta
Case=Nom|Number=Plurtiteta

PRON

2031 sl-pos/PRON tokens (42% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Reflex=EMPTY (2031; 100%), Number=Sing (1546; 76%), PronType=Prs (1496; 74%), Person=3 (1474; 73%), Variant=Short (1129; 56%), Case=Acc (1041; 51%).

PRON tokens may have the following values of Gender:

Paradigm onMascFemNeut
Case=Acc|Number=Singnjeganjo
Case=Acc|Number=Sing|Variant=Shortgajoga
Case=Acc|Number=Dualnjiju
Case=Acc|Number=Dual|Variant=Shortjujuju
Case=Acc|Number=Plurnjih, nje
Case=Acc|Number=Plur|Variant=Shortjihjihjih
Case=Dat|Number=Singnjemunjej
Case=Dat|Number=Sing|Variant=Shortmujimu
Case=Dat|Number=Dualnjima
Case=Dat|Number=Dual|Variant=Shortjimajima
Case=Dat|Number=Plurnjimnjim
Case=Dat|Number=Plur|Variant=Shortjimjimjim
Case=Gen|Number=Singnjeganje
Case=Gen|Number=Sing|Variant=Shortgajega
Case=Gen|Number=Dualnjiju
Case=Gen|Number=Dual|Variant=Shortju
Case=Gen|Number=Plurnjihnjihnjih
Case=Gen|Number=Plur|Variant=Shortjihjihjih
Case=Ins|Number=Singnjimnjonjim
Case=Ins|Number=Dualnjimanjima
Case=Ins|Number=Plurnjiminjiminjimi
Case=Loc|Number=Singnjemnjejnjem
Case=Loc|Number=Dualnjiju
Case=Loc|Number=Plurnjihnjih
Case=Nom|Number=Singonona
Case=Nom|Number=Dualonadva
Case=Nom|Number=Pluroni

AUX

657 sl-pos/AUX tokens (7% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (657; 100%), Tense=EMPTY (657; 100%), VerbForm=Part (657; 100%), Person=EMPTY (657; 100%), Polarity=EMPTY (657; 100%), Number=Sing (500; 76%).

AUX tokens may have the following values of Gender:

Paradigm bitiMascFemNeut
Number=Singbilbilabilo
Number=Dualbilabili
Number=Plurbilibilebila

NUM

444 sl-pos/NUM tokens (25% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumForm=Word (444; 100%), NumType=Card (440; 99%).

NUM tokens may have the following values of Gender:

Paradigm enMascFemNeut
Case=Acc|Number=Singen, enegaenoeno
Case=Dat|Number=Singenemueni
Case=Gen|Number=Singenegaeneenega
Case=Ins|Number=Singenimenoenim
Case=Loc|Number=Singenemenienem
Case=Loc|Number=Plurenih
Case=Nom|Number=Singenenaeno
Case=Nom|Number=Plureni

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (10142; 99%), NOUN –[det]–> DET (2551; 87%), ADJ –[nsubj]–> NOUN (790; 98%), NOUN –[nmod]–> PROPN (756; 55%), PROPN –[flat:name]–> PROPN (624; 100%), ADJ –[conj]–> ADJ (559; 93%), VERB –[nsubj]–> PROPN (506; 72%), VERB –[conj]–> VERB (500; 70%), PROPN –[amod]–> ADJ (232; 100%), PROPN –[conj]–> PROPN (216; 71%).


Treebank Statistics (UD_Slovenian-SST)

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut.

This is a layered feature with the following layers: Gender, Gender[psor].

6337 tokens (33%) have a non-empty value of Gender. 3262 types (72%) occur at least once with a non-empty value of Gender. 2285 lemmas (74%) occur at least once with a non-empty value of Gender. The feature is used with 8 part-of-speech tags: sl-pos/NOUN (2374; 12% instances), sl-pos/ADJ (1093; 6% instances), sl-pos/DET (1051; 5% instances), sl-pos/VERB (776; 4% instances), sl-pos/PRON (470; 2% instances), sl-pos/PROPN (307; 2% instances), sl-pos/NUM (194; 1% instances), sl-pos/AUX (72; 0% instances).

NOUN

2374 sl-pos/NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Animacy=EMPTY (2135; 90%), Number=Sing (1785; 75%).

NOUN tokens may have the following values of Gender:

Gender seems to be lexical feature of NOUN. 100% lemmas (1183) occur only with one value of Gender.

ADJ

1093 sl-pos/ADJ tokens (100% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: VerbForm=EMPTY (965; 88%), Degree=Pos (944; 86%), Definite=EMPTY (888; 81%), Number=Sing (820; 75%), Case=Nom (572; 52%).

ADJ tokens may have the following values of Gender:

Paradigm drugMascFemNeut
Case=Acc|Definite=Def|Number=Singdrugi
Case=Acc|Number=Singdrugodrugo
Case=Acc|Number=Plurdrugedruge
Case=Dat|Number=Singdrugemu
Case=Gen|Number=Singdrugegadrugedrugega
Case=Gen|Number=Plurdrugihdrugih
Case=Ins|Number=Singdrugodrugim
Case=Ins|Number=Plurdrugimi
Case=Loc|Number=Singdrugidrugem
Case=Loc|Number=Dualdrugih
Case=Nom|Definite=Def|Number=Singdrugi
Case=Nom|Definite=Ind|Number=Singdrug
Case=Nom|Number=Singdrugadrugo
Case=Nom|Number=Plurdrugi

DET

1051 sl-pos/DET tokens (87% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (871; 83%), PronType=Dem (687; 65%).

DET tokens may have the following values of Gender:

Paradigm taMascFemNeut
Case=Acc|Number=Singta, tegatoto
Case=Acc|Number=Plurteteta
Case=Dat|Number=Singtemutejtemu
Case=Dat|Number=Plurtemtemtem
Case=Gen|Number=Singtegatetega
Case=Gen|Number=Plurtehtehteh
Case=Ins|Number=Singtemtotem
Case=Ins|Number=Plurtemitemi
Case=Loc|Number=Singtemtejtem
Case=Loc|Number=Plurtehteh
Case=Nom|Number=Singtatato
Case=Nom|Number=Dualti
Case=Nom|Number=Plurtite

VERB

776 sl-pos/VERB tokens (30% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Person=EMPTY (776; 100%), Polarity=EMPTY (776; 100%), Tense=EMPTY (776; 100%), Mood=EMPTY (776; 100%), VerbForm=Part (776; 100%), Number=Sing (521; 67%).

VERB tokens may have the following values of Gender:

Paradigm bitiMascFemNeut
Aspect=Imp|Number=Singbil
Number=Singbilbilabilo
Number=Dualbila
Number=Plurbilibile

PRON

470 sl-pos/PRON tokens (43% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (342; 73%), Variant=EMPTY (333; 71%), PronType=Prs (263; 56%).

PRON tokens may have the following values of Gender:

Paradigm onMascFemNeut
Case=Acc|Number=Singnjega
Case=Acc|Number=Sing|Variant=Shortgajoga
Case=Acc|Number=Plurnjih
Case=Acc|Number=Plur|Variant=Shortjihjihjih
Case=Dat|Number=Singnjemu
Case=Dat|Number=Sing|Variant=Shortmuji
Case=Dat|Number=Plurnjim
Case=Dat|Number=Plur|Variant=Shortjimjim
Case=Gen|Number=Singnjeganje
Case=Gen|Number=Sing|Variant=Shortgaje
Case=Gen|Number=Plur|Variant=Shortjihjih
Case=Ins|Number=Singnjimnjo
Case=Ins|Number=Plurnjiminjimi
Case=Loc|Number=Singnjej
Case=Loc|Number=Plurnjihnjih
Case=Nom|Number=Singonona
Case=Nom|Number=Dualonadva
Case=Nom|Number=Pluronione

PROPN

307 sl-pos/PROPN tokens (62% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (280; 91%).

PROPN tokens may have the following values of Gender:

Gender seems to be lexical feature of PROPN. 100% lemmas (233) occur only with one value of Gender.

NUM

194 sl-pos/NUM tokens (55% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumForm=Word (194; 100%), NumType=Card (193; 99%), Number=Sing (105; 54%), Case=Acc (98; 51%).

NUM tokens may have the following values of Gender:

Paradigm enMascFemNeut
Case=Acc|Number=Singen, enegaenoeno
Case=Acc|Number=Plurene
Case=Dat|Number=Singenemu
Case=Gen|Number=Singenegaene
Case=Gen|Number=Plurenih
Case=Ins|Number=Singenimenoenim
Case=Loc|Number=Singeni
Case=Nom|Number=Singenenaeno
Case=Nom|Number=Plureniena

AUX

72 sl-pos/AUX tokens (6% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Person=EMPTY (72; 100%), Polarity=EMPTY (72; 100%), Mood=EMPTY (72; 100%), Tense=EMPTY (72; 100%), VerbForm=Part (72; 100%), Number=Sing (56; 78%).

AUX tokens may have the following values of Gender:

Paradigm bitiMascFemNeut
Aspect=Imp|Number=Singbilbilo
Number=Singbilbilabilo
Number=Dualbila
Number=Plurbilibila

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (605; 99%), NOUN –[det]–> DET (389; 90%), NOUN –[nummod]–> NUM (102; 55%), NOUN –[conj]–> NOUN (71; 57%), PROPN –[flat:name]–> PROPN (52; 100%), ADJ –[nsubj]–> NOUN (48; 96%), ADJ –[conj]–> ADJ (36; 90%), ADJ –[det]–> DET (21; 95%), ADJ –[nsubj]–> DET (19; 90%), NOUN –[appos]–> NOUN (19; 59%).


Gender in other languages: [am] [ar] [bg] [bxr] [ca] [ckb] [cop] [cs] [cu] [da] [de] [el] [en] [es] [et] [eu] [fa] [fo] [fr] [ga] [gl] [got] [grc] [he] [hi] [hr] [hu] [id] [it] [ja] [kk] [kmr] [ko] [la] [lv] [mr] [nl] [no] [pl] [pt] [ro] [ru] [sa] [sk] [sla] [sl] [so] [sr] [sv] [swl] [ta] [tr] [u] [ug] [uk] [ur] [vi] [yue] [zh]