home bg/pos edit page issue tracker

This page still pertains to UD version 1.

VERB: verb

Definition

A verb is a member of the syntactic class of words that typically signal events and actions, can constitute a minimal predicate in a clause, and govern the number and types of other constituents which may occur in the clause. Verbs are often associated with grammatical categories like tense, mood, aspect and voice, which can either be expressed inflectionally or using auxilliary verbs or particles.

The BulTreeBank annotation scheme provides the following mappings here: main verbs, copulas and modal verbs. Note that modal verbs do not have special labels in our annotation scheme. Participles and gerund are considered also VERB. Below the specific labels that map to VERB are given.

Examples

Note that the present active participle V#car# is mapped only to ADJ.

Note that the symbol `#’, used in the Universal POS section indicates a holder for arbitrary number of features, suppressed in the respective tag as irrelevant in the BulTreeBank tagset, when mapped to the Universal one.


Treebank Statistics (UD_Bulgarian)

There are 2644 VERB lemmas (18%), 6130 VERB types (24%) and 15492 VERB tokens (11%). Out of 16 observed tags, the rank of VERB is: 4 in number of lemmas, 2 in number of types and 4 in number of tokens.

The 10 most frequent VERB lemmas: мога, имам, нямам, съм, трябва, кажа, има, искам, стана, съобщя

The 10 most frequent VERB types: има, може, няма, трябва, е, каза, могат, съобщи, заяви, стана

The 10 most frequent ambiguous lemmas: мога (VERB 370, ADJ 1), имам (VERB 323, ADJ 1), съм (AUX 3594, VERB 296), кажа (VERB 206, ADJ 4), искам (VERB 157, ADJ 3), стана (VERB 129, ADJ 3), направя-(се) (VERB 113, ADJ 6), видя-(се) (VERB 86, ADJ 1), дам-(се) (VERB 84, ADJ 4), работя (VERB 84, ADJ 6)

The 10 most frequent ambiguous types: е (AUX 1752, VERB 178), иска (VERB 40, NOUN 1), са (AUX 689, VERB 36), работи (VERB 34, NOUN 17), твърди (VERB 31, ADJ 1), прави (VERB 27, ADJ 5), води (VERB 28, NOUN 1), би (AUX 47, VERB 27), отказа (VERB 22, NOUN 1), мисли (VERB 17, NOUN 4)

Morphology

The form / lemma ratio of VERB is 2.318457 (the average of all parts of speech is 1.709615).

The 1st highest number of forms (21) was observed with the lemma “мога”: Можехме, мога, могат, могла, могли, могло, могъл, можа, можах, можаха, може, можел, можела, можели, можело, можем, можете, можех, можеха, можеш, можеше.

The 2nd highest number of forms (17) was observed with the lemma “взема”: взе, взел, взела, взели, взело, взема, вземат, вземе, вземем, вземете, вземеш, вземи, взета, взети, взето, взех, взеха.

The 3rd highest number of forms (16) was observed with the lemma “видя-(се)”: видели, види, видим, видите, видиш, видя, видял, видяла, видяна, видят, видях, видяха, видяхме, видяхте, виж, вижте.

VERB occurs with 10 features: bg-feat/Aspect (15492; 100% instances), bg-feat/Number (15492; 100% instances), bg-feat/VerbForm (15492; 100% instances), bg-feat/Voice (15241; 98% instances), bg-feat/Tense (14273; 92% instances), bg-feat/Mood (13083; 84% instances), bg-feat/Person (13068; 84% instances), bg-feat/Definite (2424; 16% instances), bg-feat/Gender (1663; 11% instances), bg-feat/Degree (5; 0% instances)

VERB occurs with 24 feature-value pairs: Aspect=Imp, Aspect=Perf, Definite=Def, Definite=Ind, Degree=Cmp, Degree=Pos, Gender=Fem, Gender=Masc, Gender=Neut, Mood=Cnd, Mood=Imp, Mood=Ind, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Tense=Imp, Tense=Past, Tense=Pres, VerbForm=Fin, VerbForm=Part, Voice=Act, Voice=Pass

VERB occurs with 70 feature combinations. The most frequent feature combination is Aspect=Imp|Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin|Voice=Act (3682 tokens). Examples: има, може, няма, трябва, е, става, иска, дава, работи, разбира

Relations

VERB nodes are attached to their parents using 18 different relations: bg-dep/root (8088; 52% instances), bg-dep/ccomp (1688; 11% instances), bg-dep/conj (1672; 11% instances), bg-dep/advcl (1325; 9% instances), bg-dep/acl (1296; 8% instances), bg-dep/xcomp (643; 4% instances), bg-dep/parataxis (355; 2% instances), bg-dep/csubj (290; 2% instances), bg-dep/csubj:pass (60; 0% instances), bg-dep/fixed (33; 0% instances), bg-dep/aux:pass (24; 0% instances), bg-dep/det (4; 0% instances), bg-dep/discourse (3; 0% instances), bg-dep/nsubj (3; 0% instances), bg-dep/obj (3; 0% instances), bg-dep/nmod (2; 0% instances), bg-dep/obl (2; 0% instances), bg-dep/nsubj:pass (1; 0% instances)

Parents of VERB nodes belong to 13 different parts of speech: ROOT (8088; 52% instances), VERB (5569; 36% instances), NOUN (1298; 8% instances), ADJ (167; 1% instances), ADV (152; 1% instances), DET (100; 1% instances), PROPN (50; 0% instances), PART (40; 0% instances), PRON (13; 0% instances), NUM (9; 0% instances), AUX (3; 0% instances), ADP (2; 0% instances), CCONJ (1; 0% instances)

61 (0%) VERB nodes are leaves.

345 (2%) VERB nodes have one child.

2202 (14%) VERB nodes have two children.

12884 (83%) VERB nodes have three or more children.

The highest child degree of a VERB node is 13.

Children of VERB nodes are attached using 25 different relations: bg-dep/punct (11842; 21% instances), bg-dep/nsubj (7395; 13% instances), bg-dep/obj (5995; 11% instances), bg-dep/obl (4814; 8% instances), bg-dep/aux (4506; 8% instances), bg-dep/advmod (4261; 7% instances), bg-dep/iobj (3168; 6% instances), bg-dep/expl (2978; 5% instances), bg-dep/ccomp (1880; 3% instances), bg-dep/conj (1590; 3% instances), bg-dep/cc (1569; 3% instances), bg-dep/mark (1400; 2% instances), bg-dep/advcl (1299; 2% instances), bg-dep/nsubj:pass (1293; 2% instances), bg-dep/aux:pass (846; 1% instances), bg-dep/xcomp (705; 1% instances), bg-dep/parataxis (579; 1% instances), bg-dep/discourse (427; 1% instances), bg-dep/csubj (183; 0% instances), bg-dep/case (111; 0% instances), bg-dep/csubj:pass (80; 0% instances), bg-dep/vocative (39; 0% instances), bg-dep/fixed (24; 0% instances), bg-dep/cop (14; 0% instances), bg-dep/amod (6; 0% instances)

Children of VERB nodes belong to 16 different parts of speech: NOUN (16220; 28% instances), PUNCT (12001; 21% instances), PRON (6886; 12% instances), VERB (5569; 10% instances), AUX (5368; 9% instances), ADV (3595; 6% instances), PROPN (1852; 3% instances), CCONJ (1563; 3% instances), PART (1418; 2% instances), SCONJ (1121; 2% instances), ADJ (632; 1% instances), ADP (364; 1% instances), DET (253; 0% instances), NUM (128; 0% instances), INTJ (33; 0% instances), X (1; 0% instances)


VERB in other languages: [am] [ar] [bg] [bxr] [ca] [ckb] [cop] [cs] [cu] [da] [de] [el] [en] [es] [et] [eu] [fa] [fi] [fo] [fr] [ga] [gl] [got] [grc] [he] [hi] [hr] [hu] [id] [it] [ja] [kk] [kmr] [ko] [la] [lv] [mr] [nl] [no] [pl] [pt] [ro] [ru] [sa] [sk] [sla] [sl] [so] [sr] [sv] [swl] [ta] [tr] [ug] [uk] [u] [urj] [ur] [vi] [yue] [zh]