home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Czech-CAC: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut. Some words have combined values of the feature; 3 combinations have been observed: Fem|Masc, Fem|Neut, Masc|Neut.

This is a layered feature with the following layers: Gender, Gender[psor].

252494 tokens (51%) have a non-empty value of Gender. 58315 types (93%) occur at least once with a non-empty value of Gender. 25144 lemmas (88%) occur at least once with a non-empty value of Gender. The feature is used with 8 part-of-speech tags: NOUN (136143; 28% instances), ADJ (73917; 15% instances), DET (15571; 3% instances), VERB (10563; 2% instances), PROPN (9803; 2% instances), PRON (2848; 1% instances), AUX (2450; 0% instances), NUM (1199; 0% instances).

NOUN

136143 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Polarity=Pos (135949; 100%), Number=Sing (95308; 70%), Animacy=EMPTY (79760; 59%).

NOUN tokens may have the following values of Gender:

Paradigm rokMascNeut
Animacy=Inan|Case=Acc|Number=Singrok
Animacy=Inan|Case=Acc|Number=Plurroky
Animacy=Inan|Case=Dat|Number=Singroku
Animacy=Inan|Case=Gen|Number=Singroku, roka
Animacy=Inan|Case=Gen|Number=Plurroků
Animacy=Inan|Case=Ins|Number=Singrokem
Animacy=Inan|Case=Ins|Number=Plurroky
Animacy=Inan|Case=Loc|Number=Singroce
Animacy=Inan|Case=Nom|Number=Singrok
Animacy=Inan|Case=Nom|Number=Plurroky
Case=Gen|Number=Plurlet
Case=Ins|Number=Plurlety
Case=Loc|Number=Plurletech

Gender seems to be lexical feature of NOUN. 100% lemmas (11076) occur only with one value of Gender.

ADJ

73917 ADJ tokens (100% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Polarity=Pos (71070; 96%), Degree=Pos (62554; 85%), Number=Sing (47115; 64%), Animacy=EMPTY (44952; 61%).

ADJ tokens may have the following values of Gender:

Paradigm uvedenýFem,MascFem,NeutMascFemNeut
Animacy=Anim|Case=Gen|Degree=Pos|Number=Plur|Polarity=Posuvedených
Animacy=Anim|Case=Nom|Degree=Pos|Number=Sing|Polarity=Posuvedený
Animacy=Anim|Case=Nom|Degree=Pos|Number=Plur|Polarity=Posuvedení
Animacy=Anim|Number=Plur|Polarity=Pos|Variant=Short|VerbForm=Part|Voice=Passuvedeni
Animacy=Inan|Case=Acc|Degree=Pos|Number=Sing|Polarity=Posuvedený
Animacy=Inan|Case=Acc|Degree=Pos|Number=Plur|Polarity=Negneuvedené
Animacy=Inan|Case=Acc|Degree=Pos|Number=Plur|Polarity=Posuvedené
Animacy=Inan|Case=Dat|Degree=Pos|Number=Plur|Polarity=Posuvedeným
Animacy=Inan|Case=Gen|Degree=Pos|Number=Sing|Polarity=Posuvedeného
Animacy=Inan|Case=Gen|Degree=Pos|Number=Plur|Polarity=Posuvedených
Animacy=Inan|Case=Ins|Degree=Pos|Number=Sing|Polarity=Posuvedeným
Animacy=Inan|Case=Ins|Degree=Pos|Number=Plur|Polarity=Posuvedenými
Animacy=Inan|Case=Loc|Degree=Pos|Number=Sing|Polarity=Posuvedeném
Animacy=Inan|Case=Loc|Degree=Pos|Number=Plur|Polarity=Posuvedených
Animacy=Inan|Case=Nom|Degree=Pos|Number=Sing|Polarity=Posuvedený
Animacy=Inan|Case=Nom|Degree=Pos|Number=Plur|Polarity=Posuvedené
Animacy=Inan|Number=Plur|Polarity=Pos|Variant=Short|VerbForm=Part|Voice=Passuvedeny
Case=Acc|Degree=Pos|Number=Sing|Polarity=Posuvedenouuvedené
Case=Acc|Degree=Pos|Number=Plur|Polarity=PosuvedenéUvedená
Case=Dat|Degree=Pos|Number=Sing|Polarity=Posuvedenéuvedenému
Case=Dat|Degree=Pos|Number=Plur|Polarity=Posuvedeným
Case=Gen|Degree=Pos|Number=Sing|Polarity=Posuvedenéuvedeného
Case=Gen|Degree=Pos|Number=Plur|Polarity=Posuvedenýchuvedených
Case=Ins|Degree=Pos|Number=Sing|Polarity=Posuvedenouuvedeným
Case=Ins|Degree=Pos|Number=Plur|Polarity=Posuvedenýmiuvedenými
Case=Loc|Degree=Pos|Number=Sing|Polarity=Posuvedenéuvedeném
Case=Loc|Degree=Pos|Number=Plur|Polarity=Posuvedenýchuvedených
Case=Nom|Degree=Pos|Number=Sing|Polarity=Posuvedenáuvedené
Case=Nom|Degree=Pos|Number=Plur|Polarity=Posuvedenéuvedená
Number=Sing|Polarity=Pos|Variant=Short|VerbForm=Part|Voice=Passuvedenuvedeno
Number=Plur,Sing|Polarity=Pos|Variant=Short|VerbForm=Part|Voice=Passuvedena

DET

15571 DET tokens (78% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Person=EMPTY (14047; 90%), Number[psor]=EMPTY (14047; 90%), Animacy=EMPTY (13146; 84%), Poss=EMPTY (12859; 83%), Number=Sing (12438; 80%).

DET tokens may have the following values of Gender:

Paradigm můjFem,NeutMascMasc,NeutFemNeut
Animacy=Anim|Case=Acc|Number=Sing|Number[psor]=Plurnašeho
Animacy=Anim|Case=Nom|Number=Plur|Number[psor]=Singmoji
Animacy=Anim|Case=Nom|Number=Plur|Number[psor]=Plurnaši
Animacy=Inan|Case=Acc|Number=Sing|Number[psor]=Singmůj
Animacy=Inan|Case=Acc|Number=Sing|Number[psor]=Plurnáš
Animacy=Inan|Case=Nom|Number=Plur|Number[psor]=Plurnaše
Case=Acc|Number=Sing|Number[psor]=Singmoumoje
Case=Acc|Number=Sing|Number[psor]=Plurnašinaše
Case=Acc|Number=Plur|Number[psor]=Sing
Case=Dat|Number=Sing|Number[psor]=Singmému
Case=Dat|Number=Sing|Number[psor]=Plurnašemunaší
Case=Gen|Number=Sing|Number[psor]=Singméhomé, mojí
Case=Gen|Number=Sing|Number[psor]=Plurnašehonaší
Case=Ins|Number=Sing|Number[psor]=Singmýmmou, mojí
Case=Ins|Number=Sing|Number[psor]=Plurnašímnaší
Case=Ins|Number=Dual|Number[psor]=Singmýma
Case=Ins|Number=Dual|Number[psor]=Plurnašima
Case=Loc|Number=Sing|Number[psor]=Singmém
Case=Loc|Number=Sing|Number[psor]=Plurnašemnaší
Case=Nom|Number=Sing|Number[psor]=Singmojemůj
Case=Nom|Number=Sing|Number[psor]=Plurnašenáš
Case=Nom|Number=Plur|Number[psor]=Singmoje
Case=Nom|Number=Plur|Number[psor]=Plurnaše

VERB

10563 VERB tokens (26% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Person=EMPTY (10563; 100%), Mood=EMPTY (10563; 100%), Voice=Act (10563; 100%), Tense=Past (10530; 100%), VerbForm=Part (10529; 100%), Polarity=Pos (9746; 92%).

VERB tokens may have the following values of Gender:

Paradigm mítFem,MascFem,NeutMascNeut
Animacy=Anim|Number=Plur|Polarity=Negneměli
Animacy=Anim|Number=Plur|Polarity=Posměli
Animacy=Inan|Number=Plur|Polarity=Negneměly
Animacy=Inan|Number=Plur|Polarity=Posměly
Number=Sing|Polarity=Negnemělnemělo
Number=Sing|Polarity=Posmělmělo
Number=Plur,Sing|Polarity=Negneměla
Number=Plur,Sing|Polarity=Posměla

PROPN

9803 PROPN tokens (100% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Polarity=Pos (9803; 100%), Abbr=EMPTY (7931; 81%), Number=Sing (7187; 73%).

PROPN tokens may have the following values of Gender:

Paradigm KSČMascFem
Animacy=InanKSČ
KSČ

Gender seems to be lexical feature of PROPN. 99% lemmas (3427) occur only with one value of Gender.

PRON

2848 PRON tokens (18% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Reflex=EMPTY (2848; 100%), Variant=EMPTY (2536; 89%), Number=Sing (2103; 74%), PrepCase=EMPTY (1914; 67%), Person=EMPTY (1576; 55%).

PRON tokens may have the following values of Gender:

Paradigm onMascMasc,NeutFemNeut
Animacy=Anim|Case=Nom|Number=Pluroni
Case=Acc|Number=Sing|PrepCase=Preněj, něhoni
Case=Acc|Number=Singjehojije
Case=Acc|Number=Sing|Style=Archjej
Case=Acc|Number=Sing|Variant=Shortho
Case=Dat|Number=Sing|PrepCase=Preněmu
Case=Dat|Number=Singjemu
Case=Dat|Number=Sing|Variant=Shortmu
Case=Gen|Number=Sing|PrepCase=Preněho, něj
Case=Gen|Number=Singjehojej
Case=Ins|Number=Sing|PrepCase=Prením
Case=Ins|Number=Singjím
Case=Loc|Number=Sing|PrepCase=Preněm
Case=Nom|Number=Singononaono
Case=Nom|Number=Plurony

AUX

2450 AUX tokens (17% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Person=EMPTY (2450; 100%), Mood=EMPTY (2450; 100%), Voice=Act (2450; 100%), Tense=Past (2449; 100%), VerbForm=Part (2449; 100%), Polarity=Pos (2266; 92%), Number=Sing (1313; 54%).

AUX tokens may have the following values of Gender:

Paradigm býtFem,MascFem,NeutMascNeut
Animacy=Anim|Number=Plur|Polarity=Neg|Tense=Past|VerbForm=Partnebyli
Animacy=Anim|Number=Plur|Polarity=Pos|Tense=Past|VerbForm=Partbyli
Animacy=Inan|Number=Plur|Polarity=Neg|Tense=Past|VerbForm=Partnebyly
Animacy=Inan|Number=Plur|Polarity=Pos|Tense=Past|VerbForm=Partbyly
Aspect=Imp|Number=Sing|Polarity=Pos|Tense=Pres|VerbForm=Convjsouc
Number=Sing|Polarity=Neg|Tense=Past|VerbForm=Partnebylnebylo
Number=Sing|Polarity=Pos|Tense=Past|VerbForm=Partbylbylo
Number=Plur,Sing|Polarity=Neg|Tense=Past|VerbForm=Partnebyla
Number=Plur,Sing|Polarity=Pos|Tense=Past|VerbForm=Partbyla

NUM

1199 NUM tokens (16% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumValue=1,2,3 (1139; 95%), NumForm=Word (1139; 95%), NumType=Card (1139; 95%), Number=Sing (798; 67%).

NUM tokens may have the following values of Gender:

Paradigm jedenMascMasc,NeutFemNeut
Animacy=Anim|Case=Accjednoho
Animacy=Inan|Case=Accjeden
Case=Accjednujedno
Case=Datjednomujedné
Case=Genjednohojedné
Case=Insjednímjednou
Case=Locjednomjedné
Case=Nomjedenjednajedno

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (59331; 99%), NOUN –[conj]–> NOUN (7050; 50%), ADJ –[conj]–> ADJ (3642; 92%), ADJ –[nsubj]–> NOUN (1912; 77%), VERB –[conj]–> VERB (1111; 59%), PROPN –[flat]–> PROPN (838; 99%), PROPN –[nmod]–> NOUN (754; 85%), PROPN –[conj]–> PROPN (746; 65%), VERB –[nsubj]–> PROPN (733; 54%), NOUN –[appos]–> NOUN (692; 50%).