home bg/pos edit page issue tracker

This page still pertains to UD version 1.

ADJ: adjective

Definition

Adjectives are words that typically modify nouns and specify their properties or attributes. They may also function as predicates, as in

Example: [bg] Колата е зелена / Kolata e zelena (The car is green.)

The ADJ tag is intended for ordinary adjectives only. See DET for determiners and NUM for numerals.

In Bulgarian the words that map to the ADJ tag from the BulTreeBank tagset are:

Example: [bg] добър / dobar (good) 7-годишен / 7-godishen (seven-years-old)

Example: [bg] Иванова книга / Ivanova kniga (Ivan’s book)

Example: [bg] втори / vtori (second)

Example: [bg] идващ / idvasht (coming)

Example: [bg] намерен / nameren (found)

Example: [bg] направил / napravil (made)

Note that the symbol `#’, used in the Universal POS section indicates a holder for arbitrary number of features, suppressed in the respective tag as irrelevant in the BulTreeBank tagset, when mapped to the Universal one.


Treebank Statistics (UD_Bulgarian)

There are 2958 ADJ lemmas (20%), 5880 ADJ types (23%) and 12214 ADJ tokens (9%). Out of 16 observed tags, the rank of ADJ is: 2 in number of lemmas, 3 in number of types and 5 in number of tokens.

The 10 most frequent ADJ lemmas: нов, друг, голям, български, народен, пръв, държавен, цял, европейски, втори

The 10 most frequent ADJ types: народното, други, българската, другите, нова, нови, последните, 2001, друг, европейската

The 10 most frequent ambiguous lemmas: нов (ADJ 271, PROPN 3), голям (ADJ 215, PROPN 1), български (ADJ 201, ADV 2), европейски (ADJ 109, ADV 1), политически (ADJ 101, ADV 3), икономически (ADJ 49, ADV 2), следвам (ADJ 49, VERB 12), мина-(се) (ADJ 45, VERB 30), стар (ADJ 42, PROPN 4), син (ADJ 37, NOUN 18)

The 10 most frequent ambiguous types: български (ADJ 29, ADV 2), 2000 (ADJ 33, NUM 11, PROPN 4), политически (ADJ 30, ADV 3), 1 (NUM 46, ADJ 27, PROPN 1), II (ADJ 16, PROPN 1), останалите (ADJ 7, VERB 1), Южна (ADJ 14, PROPN 1), европейски (ADJ 12, ADV 1), свързани (ADJ 12, VERB 3), 15 (NUM 27, ADJ 11)

Morphology

The form / lemma ratio of ADJ is 1.987830 (the average of all parts of speech is 1.709615).

The 1st highest number of forms (24) was observed with the lemma “голям”: големи, големите, големия, големият, голям, голяма, голямата, голямо, най-големи, най-големите, най-големия, най-големият, най-голям, най-голяма, най-голямата, най-голямо, най-голямото, по-големи, по-големите, по-голям, по-голяма, по-голямата, по-голямо, по-голямото.

The 2nd highest number of forms (21) was observed with the lemma “добър”: Добрата, Добрият, Най-добра, добра, добри, добрите, добро, доброто, добър, най-добрата, най-добри, най-добрите, най-добрия, най-добрият, най-доброто, най-добър, по-добра, по-добри, по-добрият, по-добро, по-добър.

The 3rd highest number of forms (16) was observed with the lemma “висок”: висок, висока, високата, високи, високите, високия, високо, високото, най-висок, най-високата, най-високите, най-високо, по-висок, по-висока, по-високи, по-високите.

ADJ occurs with 10 features: bg-feat/Number (12133; 99% instances), bg-feat/Definite (12110; 99% instances), bg-feat/Degree (11927; 98% instances), bg-feat/Gender (8547; 70% instances), bg-feat/Aspect (1335; 11% instances), bg-feat/VerbForm (1335; 11% instances), bg-feat/Voice (1335; 11% instances), bg-feat/NumType (818; 7% instances), bg-feat/Tense (468; 4% instances), bg-feat/Case (23; 0% instances)

ADJ occurs with 19 feature-value pairs: Aspect=Imp, Aspect=Perf, Case=Voc, Definite=Def, Definite=Ind, Degree=Cmp, Degree=Pos, Degree=Sup, Gender=Fem, Gender=Masc, Gender=Neut, NumType=Ord, Number=Plur, Number=Sing, Tense=Past, Tense=Pres, VerbForm=Part, Voice=Act, Voice=Pass

ADJ occurs with 98 feature combinations. The most frequent feature combination is Definite=Ind|Degree=Pos|Number=Plur (1649 tokens). Examples: други, нови, различни, големи, български, народни, добри, политически, военни, подобни

Relations

ADJ nodes are attached to their parents using 22 different relations: bg-dep/amod (10635; 87% instances), bg-dep/conj (406; 3% instances), bg-dep/root (359; 3% instances), bg-dep/obj (176; 1% instances), bg-dep/nsubj (147; 1% instances), bg-dep/nmod (129; 1% instances), bg-dep/ccomp (93; 1% instances), bg-dep/iobj (54; 0% instances), bg-dep/obl (44; 0% instances), bg-dep/acl (42; 0% instances), bg-dep/advcl (42; 0% instances), bg-dep/xcomp (18; 0% instances), bg-dep/parataxis (16; 0% instances), bg-dep/csubj (13; 0% instances), bg-dep/flat (13; 0% instances), bg-dep/nsubj:pass (12; 0% instances), bg-dep/discourse (6; 0% instances), bg-dep/csubj:pass (3; 0% instances), bg-dep/vocative (3; 0% instances), bg-dep/compound (1; 0% instances), bg-dep/nummod (1; 0% instances), bg-dep/orphan (1; 0% instances)

Parents of ADJ nodes belong to 10 different parts of speech: NOUN (10534; 86% instances), VERB (632; 5% instances), ROOT (359; 3% instances), ADJ (339; 3% instances), PROPN (289; 2% instances), NUM (21; 0% instances), DET (16; 0% instances), ADV (11; 0% instances), PRON (9; 0% instances), PART (4; 0% instances)

9690 (79%) ADJ nodes are leaves.

1336 (11%) ADJ nodes have one child.

406 (3%) ADJ nodes have two children.

782 (6%) ADJ nodes have three or more children.

The highest child degree of a ADJ node is 8.

Children of ADJ nodes are attached using 29 different relations: bg-dep/punct (1142; 21% instances), bg-dep/obl (777; 14% instances), bg-dep/advmod (600; 11% instances), bg-dep/cop (589; 11% instances), bg-dep/nsubj (486; 9% instances), bg-dep/conj (434; 8% instances), bg-dep/det (346; 6% instances), bg-dep/cc (333; 6% instances), bg-dep/case (255; 5% instances), bg-dep/mark (98; 2% instances), bg-dep/aux (88; 2% instances), bg-dep/advcl (55; 1% instances), bg-dep/expl (53; 1% instances), bg-dep/discourse (29; 1% instances), bg-dep/aux:pass (23; 0% instances), bg-dep/iobj (22; 0% instances), bg-dep/acl (19; 0% instances), bg-dep/nsubj:pass (19; 0% instances), bg-dep/flat (13; 0% instances), bg-dep/obj (4; 0% instances), bg-dep/vocative (3; 0% instances), bg-dep/amod (1; 0% instances), bg-dep/appos (1; 0% instances), bg-dep/ccomp (1; 0% instances), bg-dep/csubj (1; 0% instances), bg-dep/nmod (1; 0% instances), bg-dep/nummod (1; 0% instances), bg-dep/orphan (1; 0% instances), bg-dep/parataxis (1; 0% instances)

Children of ADJ nodes belong to 15 different parts of speech: PUNCT (1142; 21% instances), NOUN (1030; 19% instances), AUX (677; 13% instances), PRON (608; 11% instances), ADV (542; 10% instances), ADJ (339; 6% instances), CCONJ (333; 6% instances), ADP (255; 5% instances), VERB (167; 3% instances), PROPN (109; 2% instances), PART (90; 2% instances), SCONJ (88; 2% instances), INTJ (8; 0% instances), DET (5; 0% instances), NUM (3; 0% instances)


ADJ in other languages: [am] [ar] [bg] [bxr] [ca] [ckb] [cop] [cs] [cu] [da] [de] [el] [en] [es] [et] [eu] [fa] [fi] [fo] [fr] [ga] [gl] [got] [grc] [he] [hi] [hr] [hu] [id] [it] [ja] [kk] [kmr] [ko] [la] [lv] [mr] [nl] [no] [pl] [pt] [ro] [ru] [sa] [sk] [sla] [sl] [so] [sr] [sv] [swl] [ta] [tr] [ug] [uk] [u] [urj] [ur] [vi] [yue] [zh]