Definite: definiteness or state
Definiteness is typically a feature of nouns, adjectives and articles. Its value distinguishes whether we are talking about something known and concrete, or something general or unknown. It can be marked on definite and indefinite articles, or directly on nouns, adjectives etc.
In Bulgarian there are definite and indefinite articles. The definite article is part of the word, in postposition (жената / zhenata ‘woman-the’ (the woman))). The indefinite articles can be: the form един / edin (one) or the zero marker.
However, when added to a nominal phrase, the articles become phrasal affixes, i.e. Bulgarian does not have agreement is definiteness. For example, хубавата висока руса жена / hubavata visoka rusa zhena ‘pretty-the tall blond woman’ (the pretty tall blond woman).
Ind: indefinite
Examples
- Видях една жена да минава по улицата / Vidyah edna zhena da minava po ulitsata “I saw a woman walking on the street”
- Видях жена да минава по улицата / Vidyah zhena da minava po ulitsata “I saw a woman walking on the street”
Def: definite
Examples
- Видях жената да минава по улицата / Vidyah zhenata da minava po ulitsata “I saw the woman walking on the street”
Treebank Statistics (UD_Bulgarian)
This feature is universal.
It occurs with 2 different values: Def, Ind.
55105 tokens (39%) have a non-empty value of Definite.
20921 types (84%) occur at least once with a non-empty value of Definite.
11797 lemmas (83%) occur at least once with a non-empty value of Definite.
The feature is used with 10 part-of-speech tags: bg-pos/NOUN (29645; 21% instances), bg-pos/ADJ (12110; 9% instances), bg-pos/PROPN (7561; 5% instances), bg-pos/VERB (2424; 2% instances), bg-pos/NUM (1880; 1% instances), bg-pos/DET (714; 1% instances), bg-pos/ADV (451; 0% instances), bg-pos/AUX (211; 0% instances), bg-pos/PRON (108; 0% instances), bg-pos/ADP (1; 0% instances).
NOUN
29645 bg-pos/NOUN tokens (97% of all NOUN tokens) have a non-empty value of Definite.
The most frequent other feature values with which NOUN and Definite co-occurred: Number=Sing (21440; 72%).
NOUN tokens may have the following values of Definite:
Def(11019; 37% of non-emptyDefinite): хората, страната, президентът, края, правителството, държавата, закона, шефът, началото, парламентаInd(18626; 63% of non-emptyDefinite): г., време, година, години, част, хора, път, събрание, страна, съветEMPTY(1021): %, лв., млн., $, месеца, дни, лева, млрд., пъти, часа
| Paradigm година | Ind | Def |
|---|---|---|
| Number=Sing | г., година | годината |
| Number=Plur | г., години | годините |
ADJ
12110 bg-pos/ADJ tokens (99% of all ADJ tokens) have a non-empty value of Definite.
The most frequent other feature values with which ADJ and Definite co-occurred: Degree=Pos (11469; 95%), Voice=EMPTY (10775; 89%), VerbForm=EMPTY (10775; 89%), Aspect=EMPTY (10775; 89%), Number=Sing (8524; 70%).
ADJ tokens may have the following values of Definite:
Def(5415; 45% of non-emptyDefinite): народното, българската, другите, последните, европейската, националната, цялата, миналата, новия, новитеInd(6695; 55% of non-emptyDefinite): други, нова, 2001, нови, друг, български, голяма, 2000, различни, 1EMPTY(104): т.нар., US, Нови, уважаеми, жп, държавен, народна, политически, важен, военноморско
| Paradigm нов | Ind | Def |
|---|---|---|
| Degree=Pos|Gender=Masc|Number=Sing | нов | новия, новият |
| Degree=Pos|Gender=Fem|Number=Sing | нова | новата |
| Degree=Pos|Gender=Neut|Number=Sing | ново | новото |
| Degree=Pos|Number=Plur | нови | новите |
| Degree=Sup|Gender=Masc|Number=Sing | най-новият | |
| Degree=Sup|Gender=Fem|Number=Sing | най-новата | |
| Degree=Sup|Gender=Neut|Number=Sing | Най-новото | |
| Degree=Sup|Number=Plur | най-нови |
PROPN
7561 bg-pos/PROPN tokens (99% of all PROPN tokens) have a non-empty value of Definite.
The most frequent other feature values with which PROPN and Definite co-occurred: Number=Sing (7427; 98%), Gender=Masc (4705; 62%).
PROPN tokens may have the following values of Definite:
Def(260; 3% of non-emptyDefinite): Балканите, СДС, ЕС, Конституцията, НИС, Дяволът, Евросъюза, ЦСКА, Евролевицата, САЩInd(7301; 97% of non-emptyDefinite): България, София, Иван, Европа, Петър, ЕС, Стоянов, СДС, Костов, ГеоргиEMPTY(69): де, ван, -, 2000, Р-300, ал, дела, ди, дьо, 173
| Paradigm европа | Ind | Def |
|---|---|---|
| Европа | Европата |
Definite seems to be lexical feature of PROPN. 98% lemmas (2716) occur only with one value of Definite.
VERB
2424 bg-pos/VERB tokens (16% of all VERB tokens) have a non-empty value of Definite.
The most frequent other feature values with which VERB and Definite co-occurred: VerbForm=Part (2424; 100%), Person=EMPTY (2424; 100%), Mood=EMPTY (2409; 99%), Aspect=Perf (1842; 76%), Number=Sing (1663; 69%), Voice=Act (1445; 60%), Tense=Past (1223; 50%).
VERB tokens may have the following values of Definite:
Def(5; 0% of non-emptyDefinite): останалите, Останалото, Уволнените, отличенитеInd(2419; 100% of non-emptyDefinite): имало, дал, заминал, искал, направил, направили, трябвало, имал, станал, станалоEMPTY(13068): има, може, няма, трябва, е, каза, могат, съобщи, заяви, стана
| Paradigm остана | Ind | Def |
|---|---|---|
| Degree=Pos|Number=Plur | останалите | |
| Gender=Masc|Number=Sing | останал | |
| Gender=Fem|Number=Sing | останала | |
| Gender=Neut|Number=Sing | Останалото | |
| Number=Plur | останали | ОСТАНАЛИТЕ |
Definite seems to be lexical feature of VERB. 100% lemmas (937) occur only with one value of Definite.
NUM
1880 bg-pos/NUM tokens (100% of all NUM tokens) have a non-empty value of Definite.
The most frequent other feature values with which NUM and Definite co-occurred: NumType=Card (1880; 100%), Number=Plur (1664; 89%), Gender=EMPTY (1416; 75%).
NUM tokens may have the following values of Definite:
Def(132; 7% of non-emptyDefinite): двамата, двете, двата, тримата, трите, четиримата, Единият, четирите, десетте, еднотоInd(1748; 93% of non-emptyDefinite): две, един, 2, два, една, 1, 3, три, едно, 10EMPTY(3): 02, 08, 2000
| Paradigm два | Ind | Def |
|---|---|---|
| Gender=Masc | два, 2 | двата |
| Gender=Fem | две, 2 | двете |
| Gender=Neut | две | двете |
| 2, две | двете |
Definite seems to be lexical feature of NUM. 97% lemmas (372) occur only with one value of Definite.
DET
714 bg-pos/DET tokens (33% of all DET tokens) have a non-empty value of Definite.
The most frequent other feature values with which DET and Definite co-occurred: Case=EMPTY (556; 78%), Number=Sing (538; 75%), Poss=Yes (465; 65%), PronType=Prs (465; 65%), Person=EMPTY (362; 51%).
DET tokens may have the following values of Definite:
Def(355; 50% of non-emptyDefinite): нашите, своите, нашата, своя, техните, неговата, своята, своето, неговия, тяхнотоInd(359; 50% of non-emptyDefinite): един, една, едно, наши, негово, нищо, своя, нещо, свой, неговEMPTY(1446): тази, този, тези, това, всички, какво, всеки, всяка, някои, какви
| Paradigm един | Ind | Def |
|---|---|---|
| Case=Nom|Gender=Masc|Number=Sing | един | единият |
| Case=Nom|Gender=Fem|Number=Sing | едната | |
| Case=Nom|Gender=Neut|Number=Sing | едно | |
| Case=Nom|Number=Plur | едните | |
| Gender=Masc|Number=Sing | единия | |
| Gender=Fem|Number=Sing | една, един | |
| Number=Plur | едни |
ADV
451 bg-pos/ADV tokens (8% of all ADV tokens) have a non-empty value of Definite.
The most frequent other feature values with which ADV and Definite co-occurred: PronType=EMPTY (451; 100%), Degree=Pos (418; 93%).
ADV tokens may have the following values of Definite:
Def(39; 9% of non-emptyDefinite): повечето, малкото, Най-малкото, многото, по-малкотоInd(412; 91% of non-emptyDefinite): много, повече, малко, по-малко, най-много, най-малко, Многая, Повечко, немалкоEMPTY(5436): още, вчера, само, вече, когато, защото, обаче, как, сега, така
| Paradigm много | Ind | Def |
|---|---|---|
| Degree=Pos | много, Многая | многото |
| Degree=Sup | най-много |
AUX
211 bg-pos/AUX tokens (3% of all AUX tokens) have a non-empty value of Definite.
The most frequent other feature values with which AUX and Definite co-occurred: Mood=Ind (211; 100%), Voice=Act (211; 100%), Aspect=Imp (211; 100%), Person=EMPTY (211; 100%), Tense=EMPTY (211; 100%), VerbForm=Part (211; 100%), Number=Sing (154; 73%).
AUX tokens may have the following values of Definite:
Ind(211; 100% of non-emptyDefinite): бил, били, била, билоEMPTY(7691): да, е, ще, са, бе, бъде, беше, бяха, съм, бъдат
PRON
108 bg-pos/PRON tokens (1% of all PRON tokens) have a non-empty value of Definite.
The most frequent other feature values with which PRON and Definite co-occurred: Poss=EMPTY (108; 100%), Person=EMPTY (108; 100%), Reflex=EMPTY (108; 100%), Case=Nom (101; 94%), Number=Sing (84; 78%), Gender=Neut (80; 74%), PronType=Ind (65; 60%).
PRON tokens may have the following values of Definite:
Def(24; 22% of non-emptyDefinite): нещата, всичкото, Единият, Едната, нищотоInd(84; 78% of non-emptyDefinite): нищо, нещо, няколко, един, едно, неколцинаEMPTY(9005): се, си, това, той, му, които, го, ни, те, който
| Paradigm никой | Ind | Def |
|---|---|---|
| нищо | нищото |
ADP
1 bg-pos/ADP tokens (0% of all ADP tokens) have a non-empty value of Definite.
ADP tokens may have the following values of Definite:
Ind(1; 100% of non-emptyDefinite): сравнениеEMPTY(19858): на, в, за, от, с, по, до, след, като, през
Relations with Agreement in Definite
The 10 most frequent relations where parent and child node agree in Definite:
NOUN –[amod]–> ADJ (5207; 51%),
NOUN –[nmod]–> NOUN (4375; 51%),
NOUN –[conj]–> NOUN (1640; 83%),
NOUN –[nmod]–> PROPN (1554; 53%),
PROPN –[flat]–> PROPN (1418; 96%),
PROPN –[conj]–> PROPN (539; 99%),
ADJ –[obl]–> NOUN (340; 55%),
PROPN –[nmod]–> PROPN (296; 97%),
ADJ –[conj]–> ADJ (296; 91%),
PROPN –[nmod]–> NOUN (277; 90%).
Definite in other languages: [am] [ar] [bg] [bxr] [ca] [ckb] [cop] [cu] [da] [de] [el] [en] [es] [et] [eu] [fa] [fo] [fr] [ga] [gl] [got] [grc] [he] [hi] [hr] [hu] [id] [it] [ja] [kk] [kmr] [ko] [la] [lv] [mr] [nl] [no] [pl] [pt] [ro] [sa] [sk] [sla] [sl] [so] [sr] [sv] [swl] [ta] [tr] [u] [ug] [uk] [ur] [urj] [vi] [yue] [zh]