home bg/feat edit page issue tracker

This page still pertains to UD version 1.

Definite: definiteness or state

Definiteness is typically a feature of nouns, adjectives and articles. Its value distinguishes whether we are talking about something known and concrete, or something general or unknown. It can be marked on definite and indefinite articles, or directly on nouns, adjectives etc.

In Bulgarian there are definite and indefinite articles. The definite article is part of the word, in postposition (жената / zhenata ‘woman-the’ (the woman))). The indefinite articles can be: the form един / edin (one) or the zero marker.

However, when added to a nominal phrase, the articles become phrasal affixes, i.e. Bulgarian does not have agreement is definiteness. For example, хубавата висока руса жена / hubavata visoka rusa zhena ‘pretty-the tall blond woman’ (the pretty tall blond woman).

Ind: indefinite

Examples

Def: definite

Examples


Treebank Statistics (UD_Bulgarian)

This feature is universal. It occurs with 2 different values: Def, Ind.

55105 tokens (39%) have a non-empty value of Definite. 20921 types (84%) occur at least once with a non-empty value of Definite. 11797 lemmas (83%) occur at least once with a non-empty value of Definite. The feature is used with 10 part-of-speech tags: bg-pos/NOUN (29645; 21% instances), bg-pos/ADJ (12110; 9% instances), bg-pos/PROPN (7561; 5% instances), bg-pos/VERB (2424; 2% instances), bg-pos/NUM (1880; 1% instances), bg-pos/DET (714; 1% instances), bg-pos/ADV (451; 0% instances), bg-pos/AUX (211; 0% instances), bg-pos/PRON (108; 0% instances), bg-pos/ADP (1; 0% instances).

NOUN

29645 bg-pos/NOUN tokens (97% of all NOUN tokens) have a non-empty value of Definite.

The most frequent other feature values with which NOUN and Definite co-occurred: Number=Sing (21440; 72%).

NOUN tokens may have the following values of Definite:

Paradigm годинаIndDef
Number=Singг., годинагодината
Number=Plurг., годинигодините

ADJ

12110 bg-pos/ADJ tokens (99% of all ADJ tokens) have a non-empty value of Definite.

The most frequent other feature values with which ADJ and Definite co-occurred: Degree=Pos (11469; 95%), Voice=EMPTY (10775; 89%), VerbForm=EMPTY (10775; 89%), Aspect=EMPTY (10775; 89%), Number=Sing (8524; 70%).

ADJ tokens may have the following values of Definite:

Paradigm новIndDef
Degree=Pos|Gender=Masc|Number=Singновновия, новият
Degree=Pos|Gender=Fem|Number=Singновановата
Degree=Pos|Gender=Neut|Number=Singновоновото
Degree=Pos|Number=Plurновиновите
Degree=Sup|Gender=Masc|Number=Singнай-новият
Degree=Sup|Gender=Fem|Number=Singнай-новата
Degree=Sup|Gender=Neut|Number=SingНай-новото
Degree=Sup|Number=Plurнай-нови

PROPN

7561 bg-pos/PROPN tokens (99% of all PROPN tokens) have a non-empty value of Definite.

The most frequent other feature values with which PROPN and Definite co-occurred: Number=Sing (7427; 98%), Gender=Masc (4705; 62%).

PROPN tokens may have the following values of Definite:

Paradigm европаIndDef
ЕвропаЕвропата

Definite seems to be lexical feature of PROPN. 98% lemmas (2716) occur only with one value of Definite.

VERB

2424 bg-pos/VERB tokens (16% of all VERB tokens) have a non-empty value of Definite.

The most frequent other feature values with which VERB and Definite co-occurred: VerbForm=Part (2424; 100%), Person=EMPTY (2424; 100%), Mood=EMPTY (2409; 99%), Aspect=Perf (1842; 76%), Number=Sing (1663; 69%), Voice=Act (1445; 60%), Tense=Past (1223; 50%).

VERB tokens may have the following values of Definite:

Paradigm останаIndDef
Degree=Pos|Number=Plurостаналите
Gender=Masc|Number=Singостанал
Gender=Fem|Number=Singостанала
Gender=Neut|Number=SingОстаналото
Number=PlurостаналиОСТАНАЛИТЕ

Definite seems to be lexical feature of VERB. 100% lemmas (937) occur only with one value of Definite.

NUM

1880 bg-pos/NUM tokens (100% of all NUM tokens) have a non-empty value of Definite.

The most frequent other feature values with which NUM and Definite co-occurred: NumType=Card (1880; 100%), Number=Plur (1664; 89%), Gender=EMPTY (1416; 75%).

NUM tokens may have the following values of Definite:

Paradigm дваIndDef
Gender=Mascдва, 2двата
Gender=Femдве, 2двете
Gender=Neutдведвете
2, дведвете

Definite seems to be lexical feature of NUM. 97% lemmas (372) occur only with one value of Definite.

DET

714 bg-pos/DET tokens (33% of all DET tokens) have a non-empty value of Definite.

The most frequent other feature values with which DET and Definite co-occurred: Case=EMPTY (556; 78%), Number=Sing (538; 75%), Poss=Yes (465; 65%), PronType=Prs (465; 65%), Person=EMPTY (362; 51%).

DET tokens may have the following values of Definite:

Paradigm единIndDef
Case=Nom|Gender=Masc|Number=Singединединият
Case=Nom|Gender=Fem|Number=Singедната
Case=Nom|Gender=Neut|Number=Singедно
Case=Nom|Number=Plurедните
Gender=Masc|Number=Singединия
Gender=Fem|Number=Singедна, един
Number=Plurедни

ADV

451 bg-pos/ADV tokens (8% of all ADV tokens) have a non-empty value of Definite.

The most frequent other feature values with which ADV and Definite co-occurred: PronType=EMPTY (451; 100%), Degree=Pos (418; 93%).

ADV tokens may have the following values of Definite:

Paradigm многоIndDef
Degree=Posмного, Многаямногото
Degree=Supнай-много

AUX

211 bg-pos/AUX tokens (3% of all AUX tokens) have a non-empty value of Definite.

The most frequent other feature values with which AUX and Definite co-occurred: Mood=Ind (211; 100%), Voice=Act (211; 100%), Aspect=Imp (211; 100%), Person=EMPTY (211; 100%), Tense=EMPTY (211; 100%), VerbForm=Part (211; 100%), Number=Sing (154; 73%).

AUX tokens may have the following values of Definite:

PRON

108 bg-pos/PRON tokens (1% of all PRON tokens) have a non-empty value of Definite.

The most frequent other feature values with which PRON and Definite co-occurred: Poss=EMPTY (108; 100%), Person=EMPTY (108; 100%), Reflex=EMPTY (108; 100%), Case=Nom (101; 94%), Number=Sing (84; 78%), Gender=Neut (80; 74%), PronType=Ind (65; 60%).

PRON tokens may have the following values of Definite:

Paradigm никойIndDef
нищонищото

ADP

1 bg-pos/ADP tokens (0% of all ADP tokens) have a non-empty value of Definite.

ADP tokens may have the following values of Definite:

Relations with Agreement in Definite

The 10 most frequent relations where parent and child node agree in Definite: NOUN –[amod]–> ADJ (5207; 51%), NOUN –[nmod]–> NOUN (4375; 51%), NOUN –[conj]–> NOUN (1640; 83%), NOUN –[nmod]–> PROPN (1554; 53%), PROPN –[flat]–> PROPN (1418; 96%), PROPN –[conj]–> PROPN (539; 99%), ADJ –[obl]–> NOUN (340; 55%), PROPN –[nmod]–> PROPN (296; 97%), ADJ –[conj]–> ADJ (296; 91%), PROPN –[nmod]–> NOUN (277; 90%).


Definite in other languages: [am] [ar] [bg] [bxr] [ca] [ckb] [cop] [cu] [da] [de] [el] [en] [es] [et] [eu] [fa] [fo] [fr] [ga] [gl] [got] [grc] [he] [hi] [hr] [hu] [id] [it] [ja] [kk] [kmr] [ko] [la] [lv] [mr] [nl] [no] [pl] [pt] [ro] [sa] [sk] [sla] [sl] [so] [sr] [sv] [swl] [ta] [tr] [u] [ug] [uk] [ur] [urj] [vi] [yue] [zh]