home bg/feat edit page issue tracker

This page still pertains to UD version 1.

Number: number

Number is an inflectional feature of nouns, adjectives, verbs. In the tagset it is encoded as: singular (s), plural (p), count (c), pluralia tantum (l). Singularia tantum is not encoded.

Sing: singular number

A singular noun denotes one person, animal or thing.

Examples: [bg] молив / moliv (pencil)

Plur: plural number

A plural noun denotes several persons, animals or things.

Examples: [bg] моливи / molivi (pencils)

Count: count plural form

A form that is used as plural for masculine non-person nouns after numerals. This is a remnant of the dual form.

Examples: [bg] 2 молива / (2) moliva (2 pencils-count)

Ptan: plurale tantum

Some nouns appear only in the plural form even though they denote one thing (semantic singular); some tagsets mark this distinction.

Examples: [bg] финанси, дънки / finansi, danki (finances, jeans)

Coll: collective / mass / singulare tantum

Collective or mass or singulare tantum is a special case of singular. It applies to words that use grammatical singular to describe sets of objects, i.e. semantic plural.

Examples: [bg] човечество / chovechestvo (mankind)


Treebank Statistics (UD_Bulgarian)

This feature is universal but the values Count are language-specific. It occurs with 4 different values: Count, Plur, Ptan, Sing.

78884 tokens (56%) have a non-empty value of Number. 26015 types (105%) occur at least once with a non-empty value of Number. 13377 lemmas (94%) occur at least once with a non-empty value of Number. The feature is used with 10 part-of-speech tags: bg-pos/NOUN (30458; 22% instances), bg-pos/VERB (15492; 11% instances), bg-pos/ADJ (12133; 9% instances), bg-pos/PROPN (7566; 5% instances), bg-pos/PRON (4827; 3% instances), bg-pos/AUX (3916; 3% instances), bg-pos/DET (2160; 2% instances), bg-pos/NUM (1880; 1% instances), bg-pos/ADV (451; 0% instances), bg-pos/ADP (1; 0% instances).

NOUN

30458 bg-pos/NOUN tokens (99% of all NOUN tokens) have a non-empty value of Number.

The most frequent other feature values with which NOUN and Number co-occurred: Definite=Ind (18626; 61%).

NOUN tokens may have the following values of Number:

Paradigm левSingPlurCount
Definite=Defлева, Левът
Definite=Indлевлевове
лв., лева

VERB

15492 bg-pos/VERB tokens (100% of all VERB tokens) have a non-empty value of Number.

The most frequent other feature values with which VERB and Number co-occurred: Voice=Act (14262; 92%), Gender=EMPTY (13829; 89%), VerbForm=Fin (13068; 84%), Definite=EMPTY (13068; 84%), Mood=Ind (12832; 83%), Person=3 (10650; 69%), Tense=Pres (8879; 57%), Aspect=Imp (8147; 53%).

VERB tokens may have the following values of Number:

Paradigm могаSingPlur
Definite=Ind|Gender=Masc|Tense=Imp|VerbForm=Partможел
Definite=Ind|Gender=Masc|Tense=Past|VerbForm=Partмогъл
Definite=Ind|Gender=Fem|Tense=Imp|VerbForm=Partможела
Definite=Ind|Gender=Fem|Tense=Past|VerbForm=Partмогла
Definite=Ind|Gender=Neut|Tense=Imp|VerbForm=Partможело
Definite=Ind|Gender=Neut|Tense=Past|VerbForm=Partмогло
Definite=Ind|Tense=Imp|VerbForm=Partможели
Definite=Ind|Tense=Past|VerbForm=Partмогли
Mood=Ind|Person=1|Tense=Imp|VerbForm=FinможехМожехме
Mood=Ind|Person=1|Tense=Past|VerbForm=Finможах
Mood=Ind|Person=1|Tense=Pres|VerbForm=Finмогаможем
Mood=Ind|Person=2|Tense=Pres|VerbForm=Finможешможете
Mood=Ind|Person=3|Tense=Imp|VerbForm=Finможешеможеха
Mood=Ind|Person=3|Tense=Past|VerbForm=Finможаможаха
Mood=Ind|Person=3|Tense=Pres|VerbForm=Finможемогат

ADJ

12133 bg-pos/ADJ tokens (99% of all ADJ tokens) have a non-empty value of Number.

The most frequent other feature values with which ADJ and Number co-occurred: Degree=Pos (11492; 95%), Aspect=EMPTY (10798; 89%), VerbForm=EMPTY (10798; 89%), Voice=EMPTY (10798; 89%), Definite=Ind (6695; 55%).

ADJ tokens may have the following values of Number:

Paradigm новSingPlur
Case=Voc|Degree=Pos|Gender=MascНови
Definite=Def|Degree=Posновите
Definite=Def|Degree=Pos|Gender=Mascновия, новият
Definite=Def|Degree=Pos|Gender=Femновата
Definite=Def|Degree=Pos|Gender=Neutновото
Definite=Def|Degree=Sup|Gender=Mascнай-новият
Definite=Def|Degree=Sup|Gender=Femнай-новата
Definite=Def|Degree=Sup|Gender=NeutНай-новото
Definite=Ind|Degree=Posнови
Definite=Ind|Degree=Pos|Gender=Mascнов
Definite=Ind|Degree=Pos|Gender=Femнова
Definite=Ind|Degree=Pos|Gender=Neutново
Definite=Ind|Degree=Supнай-нови

PROPN

7566 bg-pos/PROPN tokens (99% of all PROPN tokens) have a non-empty value of Number.

The most frequent other feature values with which PROPN and Number co-occurred: Definite=Ind (7301; 96%), Gender=Masc (4709; 62%).

PROPN tokens may have the following values of Number:

Paradigm сдсSingPlur
Definite=Def|Gender=MascСДС
Definite=Def|Gender=NeutСДС-та
Definite=Ind|Gender=MascСДС

Number seems to be lexical feature of PROPN. 100% lemmas (2754) occur only with one value of Number.

PRON

4827 bg-pos/PRON tokens (53% of all PRON tokens) have a non-empty value of Number.

The most frequent other feature values with which PRON and Number co-occurred: Reflex=EMPTY (4827; 100%), Poss=EMPTY (4827; 100%), PronType=Prs (3048; 63%), Case=Nom (2983; 62%).

PRON tokens may have the following values of Number:

Paradigm азSingPlur
Case=Acc|Gender=Masc|Person=3го, него
Case=Acc|Gender=Fem|Person=3я, нея
Case=Acc|Gender=Neut|Person=3го, него
Case=Acc|Person=1ме, мен, мененас, ни
Case=Acc|Person=2те, тебе, ви, вас, тебвас, ви
Case=Acc|Person=3ги, тях
Case=Dat|Gender=Masc|Person=3му, нему
Case=Dat|Gender=Fem|Person=3й
Case=Dat|Gender=Neut|Person=3му
Case=Dat|Person=1ми, мен, менени
Case=Dat|Person=2ти, виви
Case=Dat|Person=3им, тям
Case=Nom|Gender=Masc|Person=3той
Case=Nom|Gender=Fem|Person=3тя
Case=Nom|Gender=Neut|Person=3то
Case=Nom|Person=1азние, ний
Case=Nom|Person=2ти, виевие
Case=Nom|Person=3те
Gender=Fem|Person=3й
Person=1мини
Person=2ти, виви
Person=3им

AUX

3916 bg-pos/AUX tokens (50% of all AUX tokens) have a non-empty value of Number.

The most frequent other feature values with which AUX and Number co-occurred: Mood=Ind (3816; 97%), Voice=Act (3816; 97%), VerbForm=Fin (3705; 95%), Aspect=Imp (3586; 92%), Person=3 (3342; 85%), Tense=Pres (3029; 77%).

AUX tokens may have the following values of Number:

Paradigm съмSingPlur
Definite=Ind|Gender=Masc|Mood=Ind|VerbForm=Part|Voice=Actбил
Definite=Ind|Gender=Fem|Mood=Ind|VerbForm=Part|Voice=Actбила
Definite=Ind|Gender=Neut|Mood=Ind|VerbForm=Part|Voice=Actбило
Definite=Ind|Mood=Ind|VerbForm=Part|Voice=Actбили
Mood=Cnd|Person=1|VerbForm=Finбихбихме
Mood=Cnd|Person=2|VerbForm=FinБибихте
Mood=Cnd|Person=3|Tense=Past|VerbForm=Finби
Mood=Cnd|Person=3|VerbForm=Finбиха
Mood=Ind|Person=1|Tense=Past|VerbForm=Fin|Voice=Actбяхбяхме
Mood=Ind|Person=1|Tense=Pres|VerbForm=Fin|Voice=Actсъмсме
Mood=Ind|Person=2|Tense=Past|VerbForm=Fin|Voice=Actбеше
Mood=Ind|Person=2|Tense=Pres|VerbForm=Fin|Voice=Actсисте
Mood=Ind|Person=2|VerbForm=Fin|Voice=Actбяхте
Mood=Ind|Person=3|Tense=Past|VerbForm=Fin|Voice=Actбе, бешебяха
Mood=Ind|Person=3|Tense=Pres|VerbForm=Fin|Voice=Actеса

DET

2160 bg-pos/DET tokens (100% of all DET tokens) have a non-empty value of Number.

The most frequent other feature values with which DET and Number co-occurred: Person=EMPTY (1808; 84%), Poss=EMPTY (1695; 78%), Definite=EMPTY (1446; 67%), Case=EMPTY (1363; 63%).

DET tokens may have the following values of Number:

Paradigm тозиSingPlur
Case=Nom|Gender=Femтази, тая, онази, тeзи
Case=Nom|Gender=Neutтова, онова, туй
Gender=Mascтози, тоя, оня, онзи
тези, тия, ония, онези

NUM

1880 bg-pos/NUM tokens (100% of all NUM tokens) have a non-empty value of Number.

The most frequent other feature values with which NUM and Number co-occurred: NumType=Card (1880; 100%), Definite=Ind (1748; 93%), Gender=EMPTY (1416; 75%).

NUM tokens may have the following values of Number:

Number seems to be lexical feature of NUM. 100% lemmas (383) occur only with one value of Number.

ADV

451 bg-pos/ADV tokens (8% of all ADV tokens) have a non-empty value of Number.

The most frequent other feature values with which ADV and Number co-occurred: PronType=EMPTY (451; 100%), Degree=Pos (418; 93%).

ADV tokens may have the following values of Number:

ADP

1 bg-pos/ADP tokens (0% of all ADP tokens) have a non-empty value of Number.

ADP tokens may have the following values of Number:

Relations with Agreement in Number

The 10 most frequent relations where parent and child node agree in Number: NOUN –[amod]–> ADJ (10028; 97%), NOUN –[nmod]–> NOUN (5279; 61%), VERB –[nsubj]–> NOUN (3943; 93%), VERB –[obj]–> NOUN (2490; 57%), VERB –[obl]–> NOUN (2420; 59%), NOUN –[nmod]–> PROPN (2384; 82%), VERB –[nsubj]–> PRON (1747; 98%), NOUN –[det]–> DET (1729; 97%), NOUN –[conj]–> NOUN (1549; 78%), PROPN –[flat]–> PROPN (1419; 96%).


Number in other languages: [am] [ar] [bg] [bxr] [ca] [ckb] [cop] [cs] [cu] [da] [de] [el] [en] [es] [et] [eu] [fa] [fi] [fo] [fr] [ga] [gl] [got] [grc] [he] [hi] [hr] [hu] [id] [it] [ja] [kk] [kmr] [ko] [la] [lv] [mr] [nl] [no] [pl] [pt] [ro] [ru] [sa] [sk] [sla] [sl] [so] [sr] [sv] [swl] [ta] [tr] [u] [ug] [uk] [ur] [urj] [vi] [yue] [zh]