Number
: number
Number is an inflectional feature of nouns, adjectives, verbs. In the tagset it is encoded as: singular (s), plural (p), count (c), pluralia tantum (l). Singularia tantum is not encoded.
Sing: singular number
A singular noun denotes one person, animal or thing.
Examples: [bg] молив / moliv (pencil)
Plur: plural number
A plural noun denotes several persons, animals or things.
Examples: [bg] моливи / molivi (pencils)
Count: count plural form
A form that is used as plural for masculine non-person nouns after numerals. This is a remnant of the dual form.
Examples: [bg] 2 молива / (2) moliva (2 pencils-count)
Ptan: plurale tantum
Some nouns appear only in the plural form even though they denote one thing (semantic singular); some tagsets mark this distinction.
Examples: [bg] финанси, дънки / finansi, danki (finances, jeans)
Coll: collective / mass / singulare tantum
Collective or mass or singulare tantum is a special case of singular. It applies to words that use grammatical singular to describe sets of objects, i.e. semantic plural.
Examples: [bg] човечество / chovechestvo (mankind)
Treebank Statistics (UD_Bulgarian)
This feature is universal but the values Count
are language-specific.
It occurs with 4 different values: Count
, Plur
, Ptan
, Sing
.
78884 tokens (56%) have a non-empty value of Number
.
26015 types (105%) occur at least once with a non-empty value of Number
.
13377 lemmas (94%) occur at least once with a non-empty value of Number
.
The feature is used with 10 part-of-speech tags: bg-pos/NOUN (30458; 22% instances), bg-pos/VERB (15492; 11% instances), bg-pos/ADJ (12133; 9% instances), bg-pos/PROPN (7566; 5% instances), bg-pos/PRON (4827; 3% instances), bg-pos/AUX (3916; 3% instances), bg-pos/DET (2160; 2% instances), bg-pos/NUM (1880; 1% instances), bg-pos/ADV (451; 0% instances), bg-pos/ADP (1; 0% instances).
NOUN
30458 bg-pos/NOUN tokens (99% of all NOUN
tokens) have a non-empty value of Number
.
The most frequent other feature values with which NOUN
and Number
co-occurred: Definite=Ind (18626; 61%).
NOUN
tokens may have the following values of Number
:
Count
(798; 3% of non-emptyNumber
): %, лв., млн., $, месеца, дни, лева, млрд., пъти, часаPlur
(7910; 26% of non-emptyNumber
): г., години, страни, пари, проблеми, представители, сили, промени, парите, фирмиPtan
(295; 1% of non-emptyNumber
): хората, хора, души, преговори, преговорите, финансите, боеприпаси, книжа, белезници, гащиSing
(21455; 70% of non-emptyNumber
): г., време, година, част, страната, президентът, път, събрание, страна, съветEMPTY
(208): глава, партия, собственост, интерес, президент, въпрос, съюз, училище, въстание, ТЕЦ
Paradigm лев | Sing | Plur | Count |
---|---|---|---|
Definite=Def | лева, Левът | ||
Definite=Ind | лев | левове | |
лв., лева |
VERB
15492 bg-pos/VERB tokens (100% of all VERB
tokens) have a non-empty value of Number
.
The most frequent other feature values with which VERB
and Number
co-occurred: Voice=Act (14262; 92%), Gender=EMPTY (13829; 89%), VerbForm=Fin (13068; 84%), Definite=EMPTY (13068; 84%), Mood=Ind (12832; 83%), Person=3 (10650; 69%), Tense=Pres (8879; 57%), Aspect=Imp (8147; 53%).
VERB
tokens may have the following values of Number
:
Plur
(4650; 30% of non-emptyNumber
): могат, имат, съобщиха, са, можем, работят, имаме, вземат, искат, няматSing
(10842; 70% of non-emptyNumber
): има, може, няма, трябва, е, каза, съобщи, заяви, стана, обяви
Paradigm мога | Sing | Plur |
---|---|---|
Definite=Ind|Gender=Masc|Tense=Imp|VerbForm=Part | можел | |
Definite=Ind|Gender=Masc|Tense=Past|VerbForm=Part | могъл | |
Definite=Ind|Gender=Fem|Tense=Imp|VerbForm=Part | можела | |
Definite=Ind|Gender=Fem|Tense=Past|VerbForm=Part | могла | |
Definite=Ind|Gender=Neut|Tense=Imp|VerbForm=Part | можело | |
Definite=Ind|Gender=Neut|Tense=Past|VerbForm=Part | могло | |
Definite=Ind|Tense=Imp|VerbForm=Part | можели | |
Definite=Ind|Tense=Past|VerbForm=Part | могли | |
Mood=Ind|Person=1|Tense=Imp|VerbForm=Fin | можех | Можехме |
Mood=Ind|Person=1|Tense=Past|VerbForm=Fin | можах | |
Mood=Ind|Person=1|Tense=Pres|VerbForm=Fin | мога | можем |
Mood=Ind|Person=2|Tense=Pres|VerbForm=Fin | можеш | можете |
Mood=Ind|Person=3|Tense=Imp|VerbForm=Fin | можеше | можеха |
Mood=Ind|Person=3|Tense=Past|VerbForm=Fin | можа | можаха |
Mood=Ind|Person=3|Tense=Pres|VerbForm=Fin | може | могат |
ADJ
12133 bg-pos/ADJ tokens (99% of all ADJ
tokens) have a non-empty value of Number
.
The most frequent other feature values with which ADJ
and Number
co-occurred: Degree=Pos (11492; 95%), Aspect=EMPTY (10798; 89%), VerbForm=EMPTY (10798; 89%), Voice=EMPTY (10798; 89%), Definite=Ind (6695; 55%).
ADJ
tokens may have the following values of Number
:
Plur
(3586; 30% of non-emptyNumber
): други, другите, последните, нови, новите, различни, първите, българските, големи, българскиSing
(8547; 70% of non-emptyNumber
): народното, българската, нова, 2001, друг, европейската, голяма, националната, цялата, 2000EMPTY
(81): т.нар., US, жп, държавен, народна, политически, важен, военноморско, навигационен, нов
Paradigm нов | Sing | Plur |
---|---|---|
Case=Voc|Degree=Pos|Gender=Masc | Нови | |
Definite=Def|Degree=Pos | новите | |
Definite=Def|Degree=Pos|Gender=Masc | новия, новият | |
Definite=Def|Degree=Pos|Gender=Fem | новата | |
Definite=Def|Degree=Pos|Gender=Neut | новото | |
Definite=Def|Degree=Sup|Gender=Masc | най-новият | |
Definite=Def|Degree=Sup|Gender=Fem | най-новата | |
Definite=Def|Degree=Sup|Gender=Neut | Най-новото | |
Definite=Ind|Degree=Pos | нови | |
Definite=Ind|Degree=Pos|Gender=Masc | нов | |
Definite=Ind|Degree=Pos|Gender=Fem | нова | |
Definite=Ind|Degree=Pos|Gender=Neut | ново | |
Definite=Ind|Degree=Sup | най-нови |
PROPN
7566 bg-pos/PROPN tokens (99% of all PROPN
tokens) have a non-empty value of Number
.
The most frequent other feature values with which PROPN
and Number
co-occurred: Definite=Ind (7301; 96%), Gender=Masc (4709; 62%).
PROPN
tokens may have the following values of Number
:
Plur
(126; 2% of non-emptyNumber
): САЩ, Балканите, БДЖ, ОДС, DM, Гласове, Полимери, Балкани, КЕШ, КлинтънPtan
(8; 0% of non-emptyNumber
): Кремиковци, ОАЕ, Брадвари, ДрагалевциSing
(7432; 98% of non-emptyNumber
): България, София, Иван, ЕС, Европа, СДС, Петър, Стоянов, Костов, ГеоргиEMPTY
(64): де, ван, -, 2000, Р-300, ал, дела, ди, дьо, 173
Paradigm сдс | Sing | Plur |
---|---|---|
Definite=Def|Gender=Masc | СДС | |
Definite=Def|Gender=Neut | СДС-та | |
Definite=Ind|Gender=Masc | СДС |
Number
seems to be lexical feature of PROPN
. 100% lemmas (2754) occur only with one value of Number
.
PRON
4827 bg-pos/PRON tokens (53% of all PRON
tokens) have a non-empty value of Number
.
The most frequent other feature values with which PRON
and Number
co-occurred: Reflex=EMPTY (4827; 100%), Poss=EMPTY (4827; 100%), PronType=Prs (3048; 63%), Case=Nom (2983; 62%).
PRON
tokens may have the following values of Number
:
Plur
(1274; 26% of non-emptyNumber
): които, те, ги, тях, нас, ни, ние, им, всички, виSing
(3553; 74% of non-emptyNumber
): това, той, го, който, тя, която, му, него, което, азEMPTY
(4286): се, си, му, ни, й, им, ми, себе, ви, ти
Paradigm аз | Sing | Plur |
---|---|---|
Case=Acc|Gender=Masc|Person=3 | го, него | |
Case=Acc|Gender=Fem|Person=3 | я, нея | |
Case=Acc|Gender=Neut|Person=3 | го, него | |
Case=Acc|Person=1 | ме, мен, мене | нас, ни |
Case=Acc|Person=2 | те, тебе, ви, вас, теб | вас, ви |
Case=Acc|Person=3 | ги, тях | |
Case=Dat|Gender=Masc|Person=3 | му, нему | |
Case=Dat|Gender=Fem|Person=3 | й | |
Case=Dat|Gender=Neut|Person=3 | му | |
Case=Dat|Person=1 | ми, мен, мене | ни |
Case=Dat|Person=2 | ти, ви | ви |
Case=Dat|Person=3 | им, тям | |
Case=Nom|Gender=Masc|Person=3 | той | |
Case=Nom|Gender=Fem|Person=3 | тя | |
Case=Nom|Gender=Neut|Person=3 | то | |
Case=Nom|Person=1 | аз | ние, ний |
Case=Nom|Person=2 | ти, вие | вие |
Case=Nom|Person=3 | те | |
Gender=Fem|Person=3 | й | |
Person=1 | ми | ни |
Person=2 | ти, ви | ви |
Person=3 | им |
AUX
3916 bg-pos/AUX tokens (50% of all AUX
tokens) have a non-empty value of Number
.
The most frequent other feature values with which AUX
and Number
co-occurred: Mood=Ind (3816; 97%), Voice=Act (3816; 97%), VerbForm=Fin (3705; 95%), Aspect=Imp (3586; 92%), Person=3 (3342; 85%), Tense=Pres (3029; 77%).
AUX
tokens may have the following values of Number
:
Plur
(1121; 29% of non-emptyNumber
): са, бяха, бъдат, сме, били, сте, бъдем, биха, бихте, бяхтеSing
(2795; 71% of non-emptyNumber
): е, бе, бъде, беше, съм, бил, си, би, била, бихEMPTY
(3986): да, ще, е, бъдат, са, беше, бъде, съм
Paradigm съм | Sing | Plur |
---|---|---|
Definite=Ind|Gender=Masc|Mood=Ind|VerbForm=Part|Voice=Act | бил | |
Definite=Ind|Gender=Fem|Mood=Ind|VerbForm=Part|Voice=Act | била | |
Definite=Ind|Gender=Neut|Mood=Ind|VerbForm=Part|Voice=Act | било | |
Definite=Ind|Mood=Ind|VerbForm=Part|Voice=Act | били | |
Mood=Cnd|Person=1|VerbForm=Fin | бих | бихме |
Mood=Cnd|Person=2|VerbForm=Fin | Би | бихте |
Mood=Cnd|Person=3|Tense=Past|VerbForm=Fin | би | |
Mood=Cnd|Person=3|VerbForm=Fin | биха | |
Mood=Ind|Person=1|Tense=Past|VerbForm=Fin|Voice=Act | бях | бяхме |
Mood=Ind|Person=1|Tense=Pres|VerbForm=Fin|Voice=Act | съм | сме |
Mood=Ind|Person=2|Tense=Past|VerbForm=Fin|Voice=Act | беше | |
Mood=Ind|Person=2|Tense=Pres|VerbForm=Fin|Voice=Act | си | сте |
Mood=Ind|Person=2|VerbForm=Fin|Voice=Act | бяхте | |
Mood=Ind|Person=3|Tense=Past|VerbForm=Fin|Voice=Act | бе, беше | бяха |
Mood=Ind|Person=3|Tense=Pres|VerbForm=Fin|Voice=Act | е | са |
DET
2160 bg-pos/DET tokens (100% of all DET
tokens) have a non-empty value of Number
.
The most frequent other feature values with which DET
and Number
co-occurred: Person=EMPTY (1808; 84%), Poss=EMPTY (1695; 78%), Definite=EMPTY (1446; 67%), Case=EMPTY (1363; 63%).
DET
tokens may have the following values of Number
:
Plur
(634; 29% of non-emptyNumber
): тези, всички, нашите, някои, какви, своите, такива, наши, техните, тияSing
(1526; 71% of non-emptyNumber
): тази, този, това, един, какво, една, всеки, едно, всяка, своя
Paradigm този | Sing | Plur |
---|---|---|
Case=Nom|Gender=Fem | тази, тая, онази, тeзи | |
Case=Nom|Gender=Neut | това, онова, туй | |
Gender=Masc | този, тоя, оня, онзи | |
тези, тия, ония, онези |
NUM
1880 bg-pos/NUM tokens (100% of all NUM
tokens) have a non-empty value of Number
.
The most frequent other feature values with which NUM
and Number
co-occurred: NumType=Card (1880; 100%), Definite=Ind (1748; 93%), Gender=EMPTY (1416; 75%).
NUM
tokens may have the following values of Number
:
Plur
(1664; 89% of non-emptyNumber
): две, 2, два, 3, три, 10, двамата, двете, 20, 000Sing
(216; 11% of non-emptyNumber
): един, една, 1, едно, половин, 0, Единият, едното, 0,1, 0.00EMPTY
(3): 02, 08, 2000
Number
seems to be lexical feature of NUM
. 100% lemmas (383) occur only with one value of Number
.
ADV
451 bg-pos/ADV tokens (8% of all ADV
tokens) have a non-empty value of Number
.
The most frequent other feature values with which ADV
and Number
co-occurred: PronType=EMPTY (451; 100%), Degree=Pos (418; 93%).
ADV
tokens may have the following values of Number
:
Plur
(451; 100% of non-emptyNumber
): много, повече, малко, повечето, по-малко, най-много, най-малко, малкото, Многая, Най-малкотоEMPTY
(5436): още, вчера, само, вече, когато, защото, обаче, как, сега, така
ADP
1 bg-pos/ADP tokens (0% of all ADP
tokens) have a non-empty value of Number
.
ADP
tokens may have the following values of Number
:
Sing
(1; 100% of non-emptyNumber
): сравнениеEMPTY
(19858): на, в, за, от, с, по, до, след, като, през
Relations with Agreement in Number
The 10 most frequent relations where parent and child node agree in Number
:
NOUN –[amod]–> ADJ (10028; 97%),
NOUN –[nmod]–> NOUN (5279; 61%),
VERB –[nsubj]–> NOUN (3943; 93%),
VERB –[obj]–> NOUN (2490; 57%),
VERB –[obl]–> NOUN (2420; 59%),
NOUN –[nmod]–> PROPN (2384; 82%),
VERB –[nsubj]–> PRON (1747; 98%),
NOUN –[det]–> DET (1729; 97%),
NOUN –[conj]–> NOUN (1549; 78%),
PROPN –[flat]–> PROPN (1419; 96%).
Number in other languages: [am] [ar] [bg] [bxr] [ca] [ckb] [cop] [cs] [cu] [da] [de] [el] [en] [es] [et] [eu] [fa] [fi] [fo] [fr] [ga] [gl] [got] [grc] [he] [hi] [hr] [hu] [id] [it] [ja] [kk] [kmr] [ko] [la] [lv] [mr] [nl] [no] [pl] [pt] [ro] [ru] [sa] [sk] [sla] [sl] [so] [sr] [sv] [swl] [ta] [tr] [u] [ug] [uk] [ur] [urj] [vi] [yue] [zh]