VERB: verb
Definition
A verb is a member of the syntactic class of words that typically signal events and actions, can constitute a minimal predicate in a clause, and govern the number and types of other constituents which may occur in the clause. Verbs are often associated with grammatical categories like tense, mood, aspect and voice, which can either be expressed inflectionally or using auxilliary verbs or particles.
The BulTreeBank annotation scheme provides the following mappings here: main verbs, copulas and modal verbs.
Note that modal verbs do not have special labels in our annotation scheme.
Participles and gerund are considered also VERB. Below the specific labels that map to VERB are given.
Examples
- Vp# (finite verb): тичам / ticham “run”
- Vn# (impersonal verb): вали, трябва / vali, tryabva “It rains, must”
- Vx# (the copula to be): съм / sam “to be”
- Vy# (the copula to be): бъда / bada “to be”
- Vi# (the copula to be): бивам / bivam “to be”
- V#cv# (past passive participle): намерен / nameren “found”. It is also mapped to ADJ in its attributive usages.
- V#cam# (past imperfective participle): четял / chetyal “He was reading”
- V#cao# (past perfective participle): дошъл / doshal “He has come”. It is also mapped to ADJ in its attributive usages.
- V#g (gerund): Идвайки / idvayki “Coming”
Note that the present active participle V#car# is mapped only to ADJ.
Note that the symbol `#’, used in the Universal POS section indicates a holder for arbitrary number of features, suppressed in the respective tag as irrelevant in the BulTreeBank tagset, when mapped to the Universal one.
Treebank Statistics (UD_Bulgarian)
There are 2644 VERB lemmas (18%), 6130 VERB types (24%) and 15492 VERB tokens (11%).
Out of 16 observed tags, the rank of VERB is: 4 in number of lemmas, 2 in number of types and 4 in number of tokens.
The 10 most frequent VERB lemmas: мога, имам, нямам, съм, трябва, кажа, има, искам, стана, съобщя
The 10 most frequent VERB types: има, може, няма, трябва, е, каза, могат, съобщи, заяви, стана
The 10 most frequent ambiguous lemmas: мога (VERB 370, ADJ 1), имам (VERB 323, ADJ 1), съм (AUX 3594, VERB 296), кажа (VERB 206, ADJ 4), искам (VERB 157, ADJ 3), стана (VERB 129, ADJ 3), направя-(се) (VERB 113, ADJ 6), видя-(се) (VERB 86, ADJ 1), дам-(се) (VERB 84, ADJ 4), работя (VERB 84, ADJ 6)
The 10 most frequent ambiguous types: е (AUX 1752, VERB 178), иска (VERB 40, NOUN 1), са (AUX 689, VERB 36), работи (VERB 34, NOUN 17), твърди (VERB 31, ADJ 1), прави (VERB 27, ADJ 5), води (VERB 28, NOUN 1), би (AUX 47, VERB 27), отказа (VERB 22, NOUN 1), мисли (VERB 17, NOUN 4)
- е
- иска
- VERB 40: Той иска съвет , към кого да се обърне .
- NOUN 1: В петък следобед в съда в канадския град Ванкувър и американския Саут Бент са внесени два граждански иска срещу България в лицето на Министерство на финансите , бившите Главна прокуратура и Национална следствена служба и Националния център по заразни и паразитни болести .
- са
- работи
- твърди
- прави
- води
- би
- отказа
- мисли
Morphology
The form / lemma ratio of VERB is 2.318457 (the average of all parts of speech is 1.709615).
The 1st highest number of forms (21) was observed with the lemma “мога”: Можехме, мога, могат, могла, могли, могло, могъл, можа, можах, можаха, може, можел, можела, можели, можело, можем, можете, можех, можеха, можеш, можеше.
The 2nd highest number of forms (17) was observed with the lemma “взема”: взе, взел, взела, взели, взело, взема, вземат, вземе, вземем, вземете, вземеш, вземи, взета, взети, взето, взех, взеха.
The 3rd highest number of forms (16) was observed with the lemma “видя-(се)”: видели, види, видим, видите, видиш, видя, видял, видяла, видяна, видят, видях, видяха, видяхме, видяхте, виж, вижте.
VERB occurs with 10 features: bg-feat/Aspect (15492; 100% instances), bg-feat/Number (15492; 100% instances), bg-feat/VerbForm (15492; 100% instances), bg-feat/Voice (15241; 98% instances), bg-feat/Tense (14273; 92% instances), bg-feat/Mood (13083; 84% instances), bg-feat/Person (13068; 84% instances), bg-feat/Definite (2424; 16% instances), bg-feat/Gender (1663; 11% instances), bg-feat/Degree (5; 0% instances)
VERB occurs with 24 feature-value pairs: Aspect=Imp, Aspect=Perf, Definite=Def, Definite=Ind, Degree=Cmp, Degree=Pos, Gender=Fem, Gender=Masc, Gender=Neut, Mood=Cnd, Mood=Imp, Mood=Ind, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Tense=Imp, Tense=Past, Tense=Pres, VerbForm=Fin, VerbForm=Part, Voice=Act, Voice=Pass
VERB occurs with 70 feature combinations.
The most frequent feature combination is Aspect=Imp|Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin|Voice=Act (3682 tokens).
Examples: има, може, няма, трябва, е, става, иска, дава, работи, разбира
Relations
VERB nodes are attached to their parents using 18 different relations: bg-dep/root (8088; 52% instances), bg-dep/ccomp (1688; 11% instances), bg-dep/conj (1672; 11% instances), bg-dep/advcl (1325; 9% instances), bg-dep/acl (1296; 8% instances), bg-dep/xcomp (643; 4% instances), bg-dep/parataxis (355; 2% instances), bg-dep/csubj (290; 2% instances), bg-dep/csubj:pass (60; 0% instances), bg-dep/fixed (33; 0% instances), bg-dep/aux:pass (24; 0% instances), bg-dep/det (4; 0% instances), bg-dep/discourse (3; 0% instances), bg-dep/nsubj (3; 0% instances), bg-dep/obj (3; 0% instances), bg-dep/nmod (2; 0% instances), bg-dep/obl (2; 0% instances), bg-dep/nsubj:pass (1; 0% instances)
Parents of VERB nodes belong to 13 different parts of speech: ROOT (8088; 52% instances), VERB (5569; 36% instances), NOUN (1298; 8% instances), ADJ (167; 1% instances), ADV (152; 1% instances), DET (100; 1% instances), PROPN (50; 0% instances), PART (40; 0% instances), PRON (13; 0% instances), NUM (9; 0% instances), AUX (3; 0% instances), ADP (2; 0% instances), CCONJ (1; 0% instances)
61 (0%) VERB nodes are leaves.
345 (2%) VERB nodes have one child.
2202 (14%) VERB nodes have two children.
12884 (83%) VERB nodes have three or more children.
The highest child degree of a VERB node is 13.
Children of VERB nodes are attached using 25 different relations: bg-dep/punct (11842; 21% instances), bg-dep/nsubj (7395; 13% instances), bg-dep/obj (5995; 11% instances), bg-dep/obl (4814; 8% instances), bg-dep/aux (4506; 8% instances), bg-dep/advmod (4261; 7% instances), bg-dep/iobj (3168; 6% instances), bg-dep/expl (2978; 5% instances), bg-dep/ccomp (1880; 3% instances), bg-dep/conj (1590; 3% instances), bg-dep/cc (1569; 3% instances), bg-dep/mark (1400; 2% instances), bg-dep/advcl (1299; 2% instances), bg-dep/nsubj:pass (1293; 2% instances), bg-dep/aux:pass (846; 1% instances), bg-dep/xcomp (705; 1% instances), bg-dep/parataxis (579; 1% instances), bg-dep/discourse (427; 1% instances), bg-dep/csubj (183; 0% instances), bg-dep/case (111; 0% instances), bg-dep/csubj:pass (80; 0% instances), bg-dep/vocative (39; 0% instances), bg-dep/fixed (24; 0% instances), bg-dep/cop (14; 0% instances), bg-dep/amod (6; 0% instances)
Children of VERB nodes belong to 16 different parts of speech: NOUN (16220; 28% instances), PUNCT (12001; 21% instances), PRON (6886; 12% instances), VERB (5569; 10% instances), AUX (5368; 9% instances), ADV (3595; 6% instances), PROPN (1852; 3% instances), CCONJ (1563; 3% instances), PART (1418; 2% instances), SCONJ (1121; 2% instances), ADJ (632; 1% instances), ADP (364; 1% instances), DET (253; 0% instances), NUM (128; 0% instances), INTJ (33; 0% instances), X (1; 0% instances)
VERB in other languages: [am] [ar] [bg] [bxr] [ca] [ckb] [cop] [cs] [cu] [da] [de] [el] [en] [es] [et] [eu] [fa] [fi] [fo] [fr] [ga] [gl] [got] [grc] [he] [hi] [hr] [hu] [id] [it] [ja] [kk] [kmr] [ko] [la] [lv] [mr] [nl] [no] [pl] [pt] [ro] [ru] [sa] [sk] [sla] [sl] [so] [sr] [sv] [swl] [ta] [tr] [ug] [uk] [u] [urj] [ur] [vi] [yue] [zh]