home bg/pos edit page issue tracker

This page still pertains to UD version 1.

ADV: adverb

Definition

In the group of Bulgarian adverbs there are words that typically modify verbs for such categories as time, place, direction or manner. They may also modify adjectives and other adverbs, as in very briefly or arguably wrong. Some adverbs can modify even [nouns] (Noun).

In BulTreeBank tagset the corresponding POS tag is D.

There is a closed subclass of pronominal adverbs that refer to circumstances in context, rather than naming them directly; similarly to pronouns, these can be categorized as interrogative, relative, demonstrative etc. Pronominal adverbs also get the ADV part-of-speech tag but they are differentiated by additional features.

In the BulTreeBank tagset the corresponding tags are as follows:

Examples

Note that there are words that may be traditionally called numerals in some languages (e.g. Bulgarian) but they are treated as adverbs in the universal tagging scheme. In particular, adverbial ordinal numerals ([bg] първо / parvo “for the first time”) are tagged ADV. The mapped tags present the neuter singular indefinite forms of the ordinal numerals: Monsi. In this way there will be ambiguity with the class of [adjectives] (ADJ).

Another adverbial numeral that goes under ADV is Md#:

Examples

Note that the symbol `#’, used in the Universal POS section indicates a holder for arbitrary number of features, suppressed in the respective tag as irrelevant in the BulTreeBank tagset, when mapped to the Universal one.


Treebank Statistics (UD_Bulgarian)

There are 635 ADV lemmas (4%), 726 ADV types (3%) and 5887 ADV tokens (4%). Out of 16 observed tags, the rank of ADV is: 5 in number of lemmas, 5 in number of types and 9 in number of tokens.

The 10 most frequent ADV lemmas: много, още, вчера, само, така, когато, вече, защото, обаче, там

The 10 most frequent ADV types: още, много, вчера, само, вече, когато, защото, обаче, как, сега

The 10 most frequent ambiguous lemmas: обаче (ADV 145, CCONJ 1), сега (ADV 118, PROPN 3), малко (ADV 74, PROPN 1), следобед (ADV 8, NOUN 1), независимо (ADV 7, ADP 1), случайно (ADV 6, ADJ 1), учудвам-(се) (ADV 4, VERB 1), истински (ADJ 25, ADV 3), политически (ADJ 101, ADV 3), преди (ADP 153, ADV 3)

The 10 most frequent ambiguous types: само (ADV 170, ADJ 1), ясно (ADV 43, ADJ 7), малко (ADV 41, ADJ 1), особено (ADV 24, ADJ 3), достатъчно (ADV 21, ADJ 1), бързо (ADV 23, ADJ 7), просто (ADV 14, ADJ 1), очевидно (ADV 11, ADJ 1), лично (ADV 14, ADJ 7), възможно (ADV 12, ADJ 10)

Morphology

The form / lemma ratio of ADV is 1.143307 (the average of all parts of speech is 1.709615).

The 1st highest number of forms (9) was observed with the lemma “там”: По-нататък, дотам, дотук, нататък, оттам, оттук, там, тук, тука.

The 2nd highest number of forms (6) was observed with the lemma “малко”: Най-малкото, малко, малкото, най-малко, по-малко, по-малкото.

The 3rd highest number of forms (5) was observed with the lemma “къде”: где, докъде, къде, накъде, откъде.

ADV occurs with 5 features: bg-feat/Degree (4577; 78% instances), bg-feat/PronType (1100; 19% instances), bg-feat/NumType (558; 9% instances), bg-feat/Definite (451; 8% instances), bg-feat/Number (451; 8% instances)

ADV occurs with 13 feature-value pairs: Definite=Def, Definite=Ind, Degree=Cmp, Degree=Pos, Degree=Sup, NumType=Card, Number=Plur, PronType=Dem, PronType=Ind, PronType=Int, PronType=Neg, PronType=Rel, PronType=Tot

ADV occurs with 20 feature combinations. The most frequent feature combination is Degree=Pos (3948 tokens). Examples: още, вчера, само, вече, обаче, сега, много, все, днес, също

Relations

ADV nodes are attached to their parents using 20 different relations: bg-dep/advmod (5117; 87% instances), bg-dep/root (208; 4% instances), bg-dep/obj (199; 3% instances), bg-dep/conj (89; 2% instances), bg-dep/mark (67; 1% instances), bg-dep/cc (60; 1% instances), bg-dep/ccomp (35; 1% instances), bg-dep/fixed (25; 0% instances), bg-dep/advcl (19; 0% instances), bg-dep/nsubj (19; 0% instances), bg-dep/acl (12; 0% instances), bg-dep/xcomp (7; 0% instances), bg-dep/obl (6; 0% instances), bg-dep/parataxis (6; 0% instances), bg-dep/iobj (5; 0% instances), bg-dep/csubj (4; 0% instances), bg-dep/nmod (4; 0% instances), bg-dep/nsubj:pass (3; 0% instances), bg-dep/amod (1; 0% instances), bg-dep/goeswith (1; 0% instances)

Parents of ADV nodes belong to 13 different parts of speech: VERB (3595; 61% instances), NOUN (918; 16% instances), ADJ (542; 9% instances), ADV (412; 7% instances), ROOT (208; 4% instances), ADP (51; 1% instances), DET (45; 1% instances), NUM (42; 1% instances), PRON (36; 1% instances), PROPN (29; 0% instances), CCONJ (5; 0% instances), INTJ (2; 0% instances), PART (2; 0% instances)

4815 (82%) ADV nodes are leaves.

644 (11%) ADV nodes have one child.

138 (2%) ADV nodes have two children.

290 (5%) ADV nodes have three or more children.

The highest child degree of a ADV node is 8.

Children of ADV nodes are attached using 21 different relations: bg-dep/punct (435; 21% instances), bg-dep/advmod (387; 19% instances), bg-dep/cop (272; 13% instances), bg-dep/obl (248; 12% instances), bg-dep/fixed (151; 7% instances), bg-dep/csubj (118; 6% instances), bg-dep/conj (96; 5% instances), bg-dep/nsubj (96; 5% instances), bg-dep/cc (95; 5% instances), bg-dep/case (39; 2% instances), bg-dep/discourse (35; 2% instances), bg-dep/mark (30; 1% instances), bg-dep/aux (28; 1% instances), bg-dep/advcl (27; 1% instances), bg-dep/expl (10; 0% instances), bg-dep/iobj (8; 0% instances), bg-dep/obj (3; 0% instances), bg-dep/acl (2; 0% instances), bg-dep/det (2; 0% instances), bg-dep/vocative (2; 0% instances), bg-dep/appos (1; 0% instances)

Children of ADV nodes belong to 15 different parts of speech: PUNCT (435; 21% instances), ADV (412; 20% instances), AUX (300; 14% instances), NOUN (287; 14% instances), VERB (152; 7% instances), CCONJ (135; 6% instances), SCONJ (102; 5% instances), PRON (78; 4% instances), PART (75; 4% instances), ADP (40; 2% instances), INTJ (26; 1% instances), PROPN (25; 1% instances), ADJ (11; 1% instances), DET (4; 0% instances), NUM (3; 0% instances)


ADV in other languages: [am] [ar] [bg] [bxr] [ca] [ckb] [cop] [cs] [cu] [da] [de] [el] [en] [es] [et] [eu] [fa] [fi] [fo] [fr] [ga] [gl] [got] [grc] [he] [hi] [hr] [hu] [id] [it] [ja] [kk] [kmr] [ko] [la] [lv] [mr] [nl] [no] [pl] [pt] [ro] [ru] [sa] [sk] [sla] [sl] [so] [sr] [sv] [swl] [ta] [tr] [ug] [uk] [u] [urj] [ur] [vi] [yue] [zh]