VERB
: verb
Definition
A verb is a member of the syntactic class of words that typically signal events and actions, can constitute a minimal predicate in a clause, and govern the number and types of other constituents which may occur in the clause. Verbs are often associated with grammatical categories like tense, mood, aspect and voice, which can either be expressed inflectionally or using auxilliary verbs or particles.
The BulTreeBank annotation scheme provides the following mappings here: main verbs, copulas and modal verbs.
Note that modal verbs do not have special labels in our annotation scheme.
Participles and gerund are considered also VERB
. Below the specific labels that map to VERB
are given.
Examples
- Vp# (finite verb): тичам / ticham “run”
- Vn# (impersonal verb): вали, трябва / vali, tryabva “It rains, must”
- Vx# (the copula to be): съм / sam “to be”
- Vy# (the copula to be): бъда / bada “to be”
- Vi# (the copula to be): бивам / bivam “to be”
- V#cv# (past passive participle): намерен / nameren “found”. It is also mapped to ADJ in its attributive usages.
- V#cam# (past imperfective participle): четял / chetyal “He was reading”
- V#cao# (past perfective participle): дошъл / doshal “He has come”. It is also mapped to ADJ in its attributive usages.
- V#g (gerund): Идвайки / idvayki “Coming”
Note that the present active participle V#car# is mapped only to ADJ.
Note that the symbol `#’, used in the Universal POS section indicates a holder for arbitrary number of features, suppressed in the respective tag as irrelevant in the BulTreeBank tagset, when mapped to the Universal one.
Treebank Statistics (UD_Bulgarian)
There are 2644 VERB
lemmas (18%), 6130 VERB
types (24%) and 15492 VERB
tokens (11%).
Out of 16 observed tags, the rank of VERB
is: 4 in number of lemmas, 2 in number of types and 4 in number of tokens.
The 10 most frequent VERB
lemmas: мога, имам, нямам, съм, трябва, кажа, има, искам, стана, съобщя
The 10 most frequent VERB
types: има, може, няма, трябва, е, каза, могат, съобщи, заяви, стана
The 10 most frequent ambiguous lemmas: мога (VERB 370, ADJ 1), имам (VERB 323, ADJ 1), съм (AUX 3594, VERB 296), кажа (VERB 206, ADJ 4), искам (VERB 157, ADJ 3), стана (VERB 129, ADJ 3), направя-(се) (VERB 113, ADJ 6), видя-(се) (VERB 86, ADJ 1), дам-(се) (VERB 84, ADJ 4), работя (VERB 84, ADJ 6)
The 10 most frequent ambiguous types: е (AUX 1752, VERB 178), иска (VERB 40, NOUN 1), са (AUX 689, VERB 36), работи (VERB 34, NOUN 17), твърди (VERB 31, ADJ 1), прави (VERB 27, ADJ 5), води (VERB 28, NOUN 1), би (AUX 47, VERB 27), отказа (VERB 22, NOUN 1), мисли (VERB 17, NOUN 4)
- е
- иска
- VERB 40: Той иска съвет , към кого да се обърне .
- NOUN 1: В петък следобед в съда в канадския град Ванкувър и американския Саут Бент са внесени два граждански иска срещу България в лицето на Министерство на финансите , бившите Главна прокуратура и Национална следствена служба и Националния център по заразни и паразитни болести .
- са
- работи
- твърди
- прави
- води
- би
- отказа
- мисли
Morphology
The form / lemma ratio of VERB
is 2.318457 (the average of all parts of speech is 1.709615).
The 1st highest number of forms (21) was observed with the lemma “мога”: Можехме, мога, могат, могла, могли, могло, могъл, можа, можах, можаха, може, можел, можела, можели, можело, можем, можете, можех, можеха, можеш, можеше.
The 2nd highest number of forms (17) was observed with the lemma “взема”: взе, взел, взела, взели, взело, взема, вземат, вземе, вземем, вземете, вземеш, вземи, взета, взети, взето, взех, взеха.
The 3rd highest number of forms (16) was observed with the lemma “видя-(се)”: видели, види, видим, видите, видиш, видя, видял, видяла, видяна, видят, видях, видяха, видяхме, видяхте, виж, вижте.
VERB
occurs with 10 features: bg-feat/Aspect (15492; 100% instances), bg-feat/Number (15492; 100% instances), bg-feat/VerbForm (15492; 100% instances), bg-feat/Voice (15241; 98% instances), bg-feat/Tense (14273; 92% instances), bg-feat/Mood (13083; 84% instances), bg-feat/Person (13068; 84% instances), bg-feat/Definite (2424; 16% instances), bg-feat/Gender (1663; 11% instances), bg-feat/Degree (5; 0% instances)
VERB
occurs with 24 feature-value pairs: Aspect=Imp
, Aspect=Perf
, Definite=Def
, Definite=Ind
, Degree=Cmp
, Degree=Pos
, Gender=Fem
, Gender=Masc
, Gender=Neut
, Mood=Cnd
, Mood=Imp
, Mood=Ind
, Number=Plur
, Number=Sing
, Person=1
, Person=2
, Person=3
, Tense=Imp
, Tense=Past
, Tense=Pres
, VerbForm=Fin
, VerbForm=Part
, Voice=Act
, Voice=Pass
VERB
occurs with 70 feature combinations.
The most frequent feature combination is Aspect=Imp|Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin|Voice=Act
(3682 tokens).
Examples: има, може, няма, трябва, е, става, иска, дава, работи, разбира
Relations
VERB
nodes are attached to their parents using 18 different relations: bg-dep/root (8088; 52% instances), bg-dep/ccomp (1688; 11% instances), bg-dep/conj (1672; 11% instances), bg-dep/advcl (1325; 9% instances), bg-dep/acl (1296; 8% instances), bg-dep/xcomp (643; 4% instances), bg-dep/parataxis (355; 2% instances), bg-dep/csubj (290; 2% instances), bg-dep/csubj:pass (60; 0% instances), bg-dep/fixed (33; 0% instances), bg-dep/aux:pass (24; 0% instances), bg-dep/det (4; 0% instances), bg-dep/discourse (3; 0% instances), bg-dep/nsubj (3; 0% instances), bg-dep/obj (3; 0% instances), bg-dep/nmod (2; 0% instances), bg-dep/obl (2; 0% instances), bg-dep/nsubj:pass (1; 0% instances)
Parents of VERB
nodes belong to 13 different parts of speech: ROOT (8088; 52% instances), VERB (5569; 36% instances), NOUN (1298; 8% instances), ADJ (167; 1% instances), ADV (152; 1% instances), DET (100; 1% instances), PROPN (50; 0% instances), PART (40; 0% instances), PRON (13; 0% instances), NUM (9; 0% instances), AUX (3; 0% instances), ADP (2; 0% instances), CCONJ (1; 0% instances)
61 (0%) VERB
nodes are leaves.
345 (2%) VERB
nodes have one child.
2202 (14%) VERB
nodes have two children.
12884 (83%) VERB
nodes have three or more children.
The highest child degree of a VERB
node is 13.
Children of VERB
nodes are attached using 25 different relations: bg-dep/punct (11842; 21% instances), bg-dep/nsubj (7395; 13% instances), bg-dep/obj (5995; 11% instances), bg-dep/obl (4814; 8% instances), bg-dep/aux (4506; 8% instances), bg-dep/advmod (4261; 7% instances), bg-dep/iobj (3168; 6% instances), bg-dep/expl (2978; 5% instances), bg-dep/ccomp (1880; 3% instances), bg-dep/conj (1590; 3% instances), bg-dep/cc (1569; 3% instances), bg-dep/mark (1400; 2% instances), bg-dep/advcl (1299; 2% instances), bg-dep/nsubj:pass (1293; 2% instances), bg-dep/aux:pass (846; 1% instances), bg-dep/xcomp (705; 1% instances), bg-dep/parataxis (579; 1% instances), bg-dep/discourse (427; 1% instances), bg-dep/csubj (183; 0% instances), bg-dep/case (111; 0% instances), bg-dep/csubj:pass (80; 0% instances), bg-dep/vocative (39; 0% instances), bg-dep/fixed (24; 0% instances), bg-dep/cop (14; 0% instances), bg-dep/amod (6; 0% instances)
Children of VERB
nodes belong to 16 different parts of speech: NOUN (16220; 28% instances), PUNCT (12001; 21% instances), PRON (6886; 12% instances), VERB (5569; 10% instances), AUX (5368; 9% instances), ADV (3595; 6% instances), PROPN (1852; 3% instances), CCONJ (1563; 3% instances), PART (1418; 2% instances), SCONJ (1121; 2% instances), ADJ (632; 1% instances), ADP (364; 1% instances), DET (253; 0% instances), NUM (128; 0% instances), INTJ (33; 0% instances), X (1; 0% instances)
VERB in other languages: [am] [ar] [bg] [bxr] [ca] [ckb] [cop] [cs] [cu] [da] [de] [el] [en] [es] [et] [eu] [fa] [fi] [fo] [fr] [ga] [gl] [got] [grc] [he] [hi] [hr] [hu] [id] [it] [ja] [kk] [kmr] [ko] [la] [lv] [mr] [nl] [no] [pl] [pt] [ro] [ru] [sa] [sk] [sla] [sl] [so] [sr] [sv] [swl] [ta] [tr] [ug] [uk] [u] [urj] [ur] [vi] [yue] [zh]