Gender
: gender
Gender is usually a lexical feature of nouns and inflectional feature of other parts of speech (adjectives, verbs) that mark agreement with nouns. In Bulgarian gender is grammatical.
There are three genders: masculine(m), feminine (f) and neuter (n).
Masc: masculine gender
Nouns denoting male persons are masculine. Other nouns may be also grammatically masculine, without any relation to sex.
Example: [bg] замък / zamak “castle”
Fem: feminine gender
Nouns denoting female persons are feminine. Other nouns may be also grammatically feminine, without any relation to sex.
Example: [bg] маса / masa “table”
Neut: neuter gender
Neither masculine nor feminine (grammatically).
Example: [bg] дете / dete “child”
Treebank Statistics (UD_Bulgarian)
This feature is universal.
It occurs with 3 different values: Fem
, Masc
, Neut
.
52997 tokens (38%) have a non-empty value of Gender
.
18352 types (74%) occur at least once with a non-empty value of Gender
.
10734 lemmas (76%) occur at least once with a non-empty value of Gender
.
The feature is used with 9 part-of-speech tags: bg-pos/NOUN (30163; 21% instances), bg-pos/ADJ (8547; 6% instances), bg-pos/PROPN (7546; 5% instances), bg-pos/PRON (2934; 2% instances), bg-pos/VERB (1663; 1% instances), bg-pos/DET (1525; 1% instances), bg-pos/NUM (464; 0% instances), bg-pos/AUX (154; 0% instances), bg-pos/ADP (1; 0% instances).
NOUN
30163 bg-pos/NOUN tokens (98% of all NOUN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NOUN
and Gender
co-occurred: Number=Sing (21455; 71%), Definite=Ind (18466; 61%).
NOUN
tokens may have the following values of Gender
:
Fem
(10836; 36% of non-emptyGender
): г., година, години, част, страната, страна, политика, страни, пари, работаMasc
(12736; 42% of non-emptyGender
): %, лв., млн., президентът, път, съвет, края, човек, министър, начинNeut
(6591; 22% of non-emptyGender
): време, събрание, решение, правителството, право, място, началото, времето, решението, справянеEMPTY
(503): хората, хора, души, преговори, преговорите, глава, партия, собственост, финансите, интерес
Paradigm глава | Masc | Fem |
---|---|---|
Definite=Def|Number=Sing | главата | |
Definite=Def|Number=Plur | главите | |
Definite=Ind|Number=Sing | глава | глава |
Definite=Ind|Number=Plur | глави |
Gender
seems to be lexical feature of NOUN
. 100% lemmas (5259) occur only with one value of Gender
.
ADJ
8547 bg-pos/ADJ tokens (70% of all ADJ
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADJ
and Gender
co-occurred: Number=Sing (8547; 100%), Degree=Pos (8086; 95%), Voice=EMPTY (7783; 91%), Aspect=EMPTY (7783; 91%), VerbForm=EMPTY (7783; 91%), Definite=Ind (4678; 55%).
ADJ
tokens may have the following values of Gender
:
Fem
(3436; 40% of non-emptyGender
): българската, нова, 2001, европейската, голяма, националната, цялата, 2000, миналата, новатаMasc
(3289; 38% of non-emptyGender
): друг, новия, европейския, 1, българския, нов, първи, българският, втори, новиятNeut
(1822; 21% of non-emptyGender
): народното, същото, цялото, друго, ново, новото, българското, народно, голямо, политическоEMPTY
(3667): други, другите, последните, нови, новите, различни, първите, българските, големи, български
Paradigm нов | Masc | Fem | Neut |
---|---|---|---|
Case=Voc|Degree=Pos | Нови | ||
Definite=Def|Degree=Pos | новия, новият | новата | новото |
Definite=Def|Degree=Sup | най-новият | най-новата | Най-новото |
Definite=Ind|Degree=Pos | нов | нова | ново |
PROPN
7546 bg-pos/PROPN tokens (99% of all PROPN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PROPN
and Gender
co-occurred: Number=Sing (7432; 98%), Definite=Ind (7281; 96%).
PROPN
tokens may have the following values of Gender
:
Fem
(2487; 33% of non-emptyGender
): България, София, Европа, Турция, Русия, Югославия, БСП, Румъния, Франция, ГерманияMasc
(4709; 62% of non-emptyGender
): Иван, ЕС, СДС, Петър, Стоянов, Костов, Георги, САЩ, ЦСКА, ЙорданNeut
(350; 5% of non-emptyGender
): МВР, Косово, ДПС, БНР, Русе, НС, Би, Панчарево, Ауди, БеленеEMPTY
(84): де, ван, -, 2000, Кремиковци, Р-300, ал, дела, ди, дьо
Paradigm бел | Masc | Fem | Neut |
---|---|---|---|
БЕЛ | БЕЛ | БЕЛ |
Gender
seems to be lexical feature of PROPN
. 99% lemmas (2712) occur only with one value of Gender
.
PRON
2934 bg-pos/PRON tokens (32% of all PRON
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PRON
and Gender
co-occurred: Poss=EMPTY (2934; 100%), Number=Sing (2934; 100%), Reflex=EMPTY (2934; 100%), Case=Nom (2028; 69%), PronType=Prs (1609; 55%), Person=3 (1609; 55%).
PRON
tokens may have the following values of Gender
:
Fem
(619; 21% of non-emptyGender
): тя, която, я, нея, й, коя, Едната, ТазиMasc
(1404; 48% of non-emptyGender
): той, го, който, му, него, кой, никой, кого, някой, всекиNeut
(911; 31% of non-emptyGender
): това, което, го, то, всичко, нищо, нещо, него, кое, всичкотоEMPTY
(6179): се, си, които, му, ни, те, им, ми, аз, ти
Paradigm аз | Masc | Fem | Neut |
---|---|---|---|
Case=Acc | го, него | я, нея | го, него |
Case=Dat | му, нему | й | му |
Case=Nom | той | тя | то |
й |
VERB
1663 bg-pos/VERB tokens (11% of all VERB
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which VERB
and Gender
co-occurred: Person=EMPTY (1663; 100%), VerbForm=Part (1663; 100%), Number=Sing (1663; 100%), Definite=Ind (1662; 100%), Mood=EMPTY (1650; 99%), Aspect=Perf (1248; 75%), Voice=Act (1030; 62%), Tense=Past (870; 52%).
VERB
tokens may have the following values of Gender
:
Fem
(444; 27% of non-emptyGender
): била, могла, можела, получила, представена, започнала, поставена, приета, включена, дошлаMasc
(899; 54% of non-emptyGender
): дал, заминал, искал, направил, имал, станал, дошъл, видял, избран, казалNeut
(320; 19% of non-emptyGender
): имало, трябвало, станало, направено, нямало, било, налагало, могло, свързано, извършеноEMPTY
(13829): има, може, няма, трябва, е, каза, могат, съобщи, заяви, стана
Paradigm мога | Masc | Fem | Neut |
---|---|---|---|
Tense=Imp | можел | можела | можело |
Tense=Past | могъл | могла | могло |
DET
1525 bg-pos/DET tokens (71% of all DET
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which DET
and Gender
co-occurred: Number=Sing (1525; 100%), Person=EMPTY (1293; 85%), Poss=EMPTY (1220; 80%), Definite=EMPTY (987; 65%), Case=EMPTY (912; 60%).
DET
tokens may have the following values of Gender
:
Fem
(532; 35% of non-emptyGender
): тази, една, всяка, каква, нашата, такава, тая, неговата, своята, някакваMasc
(549; 36% of non-emptyGender
): този, един, всеки, своя, такъв, какъв, тоя, никакъв, някакъв, някойNeut
(444; 29% of non-emptyGender
): това, какво, едно, всяко, такова, своето, тяхното, нашето, негово, неговотоEMPTY
(635): тези, всички, нашите, някои, какви, своите, такива, наши, техните, тия
Paradigm този | Masc | Fem | Neut |
---|---|---|---|
Case=Nom | тази, тая, онази, тeзи | това, онова, туй | |
този, тоя, оня, онзи |
NUM
464 bg-pos/NUM tokens (25% of all NUM
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NUM
and Gender
co-occurred: NumType=Card (464; 100%), Definite=Ind (403; 87%), Number=Plur (254; 55%).
NUM
tokens may have the following values of Gender
:
Fem
(179; 39% of non-emptyGender
): две, една, двете, 2, 1, 22, 0, 52, 42, 0.00Masc
(191; 41% of non-emptyGender
): един, два, 2, двата, 1, 22, Единият, 32, 4162, 72Neut
(94; 20% of non-emptyGender
): едно, 1, две, двете, едното, 42EMPTY
(1419): 3, три, 10, двамата, 20, 000, 4, 15, 30, 6
Paradigm два | Masc | Fem | Neut |
---|---|---|---|
Definite=Def | двата | двете | двете |
Definite=Ind | два, 2 | две, 2 | две |
AUX
154 bg-pos/AUX tokens (2% of all AUX
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which AUX
and Gender
co-occurred: Person=EMPTY (154; 100%), Tense=EMPTY (154; 100%), VerbForm=Part (154; 100%), Number=Sing (154; 100%), Mood=Ind (154; 100%), Voice=Act (154; 100%), Aspect=Imp (154; 100%).
AUX
tokens may have the following values of Gender
:
Fem
(51; 33% of non-emptyGender
): билаMasc
(73; 47% of non-emptyGender
): билNeut
(30; 19% of non-emptyGender
): билоEMPTY
(7748): да, е, ще, са, бе, бъде, беше, бяха, съм, бъдат
Paradigm съм | Masc | Fem | Neut |
---|---|---|---|
бил | била | било |
ADP
1 bg-pos/ADP tokens (0% of all ADP
tokens) have a non-empty value of Gender
.
ADP
tokens may have the following values of Gender
:
Neut
(1; 100% of non-emptyGender
): сравнениеEMPTY
(19858): на, в, за, от, с, по, до, след, като, през
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender
:
NOUN –[amod]–> ADJ (7143; 70%),
NOUN –[nmod]–> PROPN (1608; 55%),
PROPN –[flat]–> PROPN (1404; 95%),
NOUN –[det]–> DET (1209; 69%),
PROPN –[conj]–> PROPN (391; 72%),
ADJ –[nsubj]–> NOUN (248; 73%),
ADJ –[conj]–> ADJ (219; 97%),
PROPN –[amod]–> ADJ (216; 83%),
PROPN –[nmod]–> PROPN (215; 70%),
PROPN –[nmod]–> NOUN (208; 67%).
Gender in other languages: [am] [ar] [bg] [bxr] [ca] [ckb] [cop] [cs] [cu] [da] [de] [el] [en] [es] [et] [eu] [fa] [fo] [fr] [ga] [gl] [got] [grc] [he] [hi] [hr] [hu] [id] [it] [ja] [kk] [kmr] [ko] [la] [lv] [mr] [nl] [no] [pl] [pt] [ro] [ru] [sa] [sk] [sla] [sl] [so] [sr] [sv] [swl] [ta] [tr] [u] [ug] [uk] [ur] [vi] [yue] [zh]