Definite
: definiteness or state
Definiteness is typically a feature of nouns, adjectives and articles. Its value distinguishes whether we are talking about something known and concrete, or something general or unknown. It can be marked on definite and indefinite articles, or directly on nouns, adjectives etc.
In Bulgarian there are definite and indefinite articles. The definite article is part of the word, in postposition (жената / zhenata ‘woman-the’ (the woman))). The indefinite articles can be: the form един / edin (one) or the zero marker.
However, when added to a nominal phrase, the articles become phrasal affixes, i.e. Bulgarian does not have agreement is definiteness. For example, хубавата висока руса жена / hubavata visoka rusa zhena ‘pretty-the tall blond woman’ (the pretty tall blond woman).
Ind
: indefinite
Examples
- Видях една жена да минава по улицата / Vidyah edna zhena da minava po ulitsata “I saw a woman walking on the street”
- Видях жена да минава по улицата / Vidyah zhena da minava po ulitsata “I saw a woman walking on the street”
Def
: definite
Examples
- Видях жената да минава по улицата / Vidyah zhenata da minava po ulitsata “I saw the woman walking on the street”
Treebank Statistics (UD_Bulgarian)
This feature is universal.
It occurs with 2 different values: Def
, Ind
.
55105 tokens (39%) have a non-empty value of Definite
.
20921 types (84%) occur at least once with a non-empty value of Definite
.
11797 lemmas (83%) occur at least once with a non-empty value of Definite
.
The feature is used with 10 part-of-speech tags: bg-pos/NOUN (29645; 21% instances), bg-pos/ADJ (12110; 9% instances), bg-pos/PROPN (7561; 5% instances), bg-pos/VERB (2424; 2% instances), bg-pos/NUM (1880; 1% instances), bg-pos/DET (714; 1% instances), bg-pos/ADV (451; 0% instances), bg-pos/AUX (211; 0% instances), bg-pos/PRON (108; 0% instances), bg-pos/ADP (1; 0% instances).
NOUN
29645 bg-pos/NOUN tokens (97% of all NOUN
tokens) have a non-empty value of Definite
.
The most frequent other feature values with which NOUN
and Definite
co-occurred: Number=Sing (21440; 72%).
NOUN
tokens may have the following values of Definite
:
Def
(11019; 37% of non-emptyDefinite
): хората, страната, президентът, края, правителството, държавата, закона, шефът, началото, парламентаInd
(18626; 63% of non-emptyDefinite
): г., време, година, години, част, хора, път, събрание, страна, съветEMPTY
(1021): %, лв., млн., $, месеца, дни, лева, млрд., пъти, часа
Paradigm година | Ind | Def |
---|---|---|
Number=Sing | г., година | годината |
Number=Plur | г., години | годините |
ADJ
12110 bg-pos/ADJ tokens (99% of all ADJ
tokens) have a non-empty value of Definite
.
The most frequent other feature values with which ADJ
and Definite
co-occurred: Degree=Pos (11469; 95%), Voice=EMPTY (10775; 89%), VerbForm=EMPTY (10775; 89%), Aspect=EMPTY (10775; 89%), Number=Sing (8524; 70%).
ADJ
tokens may have the following values of Definite
:
Def
(5415; 45% of non-emptyDefinite
): народното, българската, другите, последните, европейската, националната, цялата, миналата, новия, новитеInd
(6695; 55% of non-emptyDefinite
): други, нова, 2001, нови, друг, български, голяма, 2000, различни, 1EMPTY
(104): т.нар., US, Нови, уважаеми, жп, държавен, народна, политически, важен, военноморско
Paradigm нов | Ind | Def |
---|---|---|
Degree=Pos|Gender=Masc|Number=Sing | нов | новия, новият |
Degree=Pos|Gender=Fem|Number=Sing | нова | новата |
Degree=Pos|Gender=Neut|Number=Sing | ново | новото |
Degree=Pos|Number=Plur | нови | новите |
Degree=Sup|Gender=Masc|Number=Sing | най-новият | |
Degree=Sup|Gender=Fem|Number=Sing | най-новата | |
Degree=Sup|Gender=Neut|Number=Sing | Най-новото | |
Degree=Sup|Number=Plur | най-нови |
PROPN
7561 bg-pos/PROPN tokens (99% of all PROPN
tokens) have a non-empty value of Definite
.
The most frequent other feature values with which PROPN
and Definite
co-occurred: Number=Sing (7427; 98%), Gender=Masc (4705; 62%).
PROPN
tokens may have the following values of Definite
:
Def
(260; 3% of non-emptyDefinite
): Балканите, СДС, ЕС, Конституцията, НИС, Дяволът, Евросъюза, ЦСКА, Евролевицата, САЩInd
(7301; 97% of non-emptyDefinite
): България, София, Иван, Европа, Петър, ЕС, Стоянов, СДС, Костов, ГеоргиEMPTY
(69): де, ван, -, 2000, Р-300, ал, дела, ди, дьо, 173
Paradigm европа | Ind | Def |
---|---|---|
Европа | Европата |
Definite
seems to be lexical feature of PROPN
. 98% lemmas (2716) occur only with one value of Definite
.
VERB
2424 bg-pos/VERB tokens (16% of all VERB
tokens) have a non-empty value of Definite
.
The most frequent other feature values with which VERB
and Definite
co-occurred: VerbForm=Part (2424; 100%), Person=EMPTY (2424; 100%), Mood=EMPTY (2409; 99%), Aspect=Perf (1842; 76%), Number=Sing (1663; 69%), Voice=Act (1445; 60%), Tense=Past (1223; 50%).
VERB
tokens may have the following values of Definite
:
Def
(5; 0% of non-emptyDefinite
): останалите, Останалото, Уволнените, отличенитеInd
(2419; 100% of non-emptyDefinite
): имало, дал, заминал, искал, направил, направили, трябвало, имал, станал, станалоEMPTY
(13068): има, може, няма, трябва, е, каза, могат, съобщи, заяви, стана
Paradigm остана | Ind | Def |
---|---|---|
Degree=Pos|Number=Plur | останалите | |
Gender=Masc|Number=Sing | останал | |
Gender=Fem|Number=Sing | останала | |
Gender=Neut|Number=Sing | Останалото | |
Number=Plur | останали | ОСТАНАЛИТЕ |
Definite
seems to be lexical feature of VERB
. 100% lemmas (937) occur only with one value of Definite
.
NUM
1880 bg-pos/NUM tokens (100% of all NUM
tokens) have a non-empty value of Definite
.
The most frequent other feature values with which NUM
and Definite
co-occurred: NumType=Card (1880; 100%), Number=Plur (1664; 89%), Gender=EMPTY (1416; 75%).
NUM
tokens may have the following values of Definite
:
Def
(132; 7% of non-emptyDefinite
): двамата, двете, двата, тримата, трите, четиримата, Единият, четирите, десетте, еднотоInd
(1748; 93% of non-emptyDefinite
): две, един, 2, два, една, 1, 3, три, едно, 10EMPTY
(3): 02, 08, 2000
Paradigm два | Ind | Def |
---|---|---|
Gender=Masc | два, 2 | двата |
Gender=Fem | две, 2 | двете |
Gender=Neut | две | двете |
2, две | двете |
Definite
seems to be lexical feature of NUM
. 97% lemmas (372) occur only with one value of Definite
.
DET
714 bg-pos/DET tokens (33% of all DET
tokens) have a non-empty value of Definite
.
The most frequent other feature values with which DET
and Definite
co-occurred: Case=EMPTY (556; 78%), Number=Sing (538; 75%), Poss=Yes (465; 65%), PronType=Prs (465; 65%), Person=EMPTY (362; 51%).
DET
tokens may have the following values of Definite
:
Def
(355; 50% of non-emptyDefinite
): нашите, своите, нашата, своя, техните, неговата, своята, своето, неговия, тяхнотоInd
(359; 50% of non-emptyDefinite
): един, една, едно, наши, негово, нищо, своя, нещо, свой, неговEMPTY
(1446): тази, този, тези, това, всички, какво, всеки, всяка, някои, какви
Paradigm един | Ind | Def |
---|---|---|
Case=Nom|Gender=Masc|Number=Sing | един | единият |
Case=Nom|Gender=Fem|Number=Sing | едната | |
Case=Nom|Gender=Neut|Number=Sing | едно | |
Case=Nom|Number=Plur | едните | |
Gender=Masc|Number=Sing | единия | |
Gender=Fem|Number=Sing | една, един | |
Number=Plur | едни |
ADV
451 bg-pos/ADV tokens (8% of all ADV
tokens) have a non-empty value of Definite
.
The most frequent other feature values with which ADV
and Definite
co-occurred: PronType=EMPTY (451; 100%), Degree=Pos (418; 93%).
ADV
tokens may have the following values of Definite
:
Def
(39; 9% of non-emptyDefinite
): повечето, малкото, Най-малкото, многото, по-малкотоInd
(412; 91% of non-emptyDefinite
): много, повече, малко, по-малко, най-много, най-малко, Многая, Повечко, немалкоEMPTY
(5436): още, вчера, само, вече, когато, защото, обаче, как, сега, така
Paradigm много | Ind | Def |
---|---|---|
Degree=Pos | много, Многая | многото |
Degree=Sup | най-много |
AUX
211 bg-pos/AUX tokens (3% of all AUX
tokens) have a non-empty value of Definite
.
The most frequent other feature values with which AUX
and Definite
co-occurred: Mood=Ind (211; 100%), Voice=Act (211; 100%), Aspect=Imp (211; 100%), Person=EMPTY (211; 100%), Tense=EMPTY (211; 100%), VerbForm=Part (211; 100%), Number=Sing (154; 73%).
AUX
tokens may have the following values of Definite
:
Ind
(211; 100% of non-emptyDefinite
): бил, били, била, билоEMPTY
(7691): да, е, ще, са, бе, бъде, беше, бяха, съм, бъдат
PRON
108 bg-pos/PRON tokens (1% of all PRON
tokens) have a non-empty value of Definite
.
The most frequent other feature values with which PRON
and Definite
co-occurred: Poss=EMPTY (108; 100%), Person=EMPTY (108; 100%), Reflex=EMPTY (108; 100%), Case=Nom (101; 94%), Number=Sing (84; 78%), Gender=Neut (80; 74%), PronType=Ind (65; 60%).
PRON
tokens may have the following values of Definite
:
Def
(24; 22% of non-emptyDefinite
): нещата, всичкото, Единият, Едната, нищотоInd
(84; 78% of non-emptyDefinite
): нищо, нещо, няколко, един, едно, неколцинаEMPTY
(9005): се, си, това, той, му, които, го, ни, те, който
Paradigm никой | Ind | Def |
---|---|---|
нищо | нищото |
ADP
1 bg-pos/ADP tokens (0% of all ADP
tokens) have a non-empty value of Definite
.
ADP
tokens may have the following values of Definite
:
Ind
(1; 100% of non-emptyDefinite
): сравнениеEMPTY
(19858): на, в, за, от, с, по, до, след, като, през
Relations with Agreement in Definite
The 10 most frequent relations where parent and child node agree in Definite
:
NOUN –[amod]–> ADJ (5207; 51%),
NOUN –[nmod]–> NOUN (4375; 51%),
NOUN –[conj]–> NOUN (1640; 83%),
NOUN –[nmod]–> PROPN (1554; 53%),
PROPN –[flat]–> PROPN (1418; 96%),
PROPN –[conj]–> PROPN (539; 99%),
ADJ –[obl]–> NOUN (340; 55%),
PROPN –[nmod]–> PROPN (296; 97%),
ADJ –[conj]–> ADJ (296; 91%),
PROPN –[nmod]–> NOUN (277; 90%).
Definite in other languages: [am] [ar] [bg] [bxr] [ca] [ckb] [cop] [cu] [da] [de] [el] [en] [es] [et] [eu] [fa] [fo] [fr] [ga] [gl] [got] [grc] [he] [hi] [hr] [hu] [id] [it] [ja] [kk] [kmr] [ko] [la] [lv] [mr] [nl] [no] [pl] [pt] [ro] [sa] [sk] [sla] [sl] [so] [sr] [sv] [swl] [ta] [tr] [u] [ug] [uk] [ur] [urj] [vi] [yue] [zh]