home et/pos edit page issue tracker

This page still pertains to UD version 1.

VERB: verb

Definition

A verb typically signals events and actions; it can constitute a minimal predicate in a clause. Verbs in Estonian associate with grammatical categories like person, number, tense, mood and voice.
The verb tag in Estonian UD v 1.3 does not cover auxiliaries AUX.
Auxiliaries are:
olema “be” and in rare occasions saama “get” are auxiliaries that form periphrastic tense forms;
modal verbs are võima, tohtima “may”, saama “can”, pidama “must”, näima, paistma, tunduma “seem”;
ei and ära “not” in negative verb forms.

Participles are word forms that share properties and usage of adjectives and verbs. Depending on their syntactic function they are tagged as VERB or ADJ in Estonian UD.

Gerunds and infinitives are tagged as VERB, except for grammatized word-forms.


Treebank Statistics (UD_Estonian)

There are 1162 VERB lemmas (16%), 2406 VERB types (22%) and 5066 VERB tokens (15%). Out of 16 observed tags, the rank of VERB is: 2 in number of lemmas, 2 in number of types and 3 in number of tokens.

The 10 most frequent VERB lemmas: olema, tulema, ütlema, saama, tegema, minema, liiguta, nägema, jääma, teadma

The 10 most frequent VERB types: liigutas, ütles, on, jäi, viskus, tuli, oli, liikus, tõukas, läks

The 10 most frequent ambiguous lemmas: olema (AUX 878, VERB 118), tulema (VERB 98, AUX 3), saama (VERB 91, AUX 28), nägema (VERB 67, AUX 2), hakkama (VERB 46, AUX 3), sõit (VERB 38, NOUN 6), tõus (VERB 26, NOUN 1), pidama (AUX 70, VERB 24), paistma (VERB 13, AUX 1), tunduma (VERB 13, AUX 7)

The 10 most frequent ambiguous types: on (AUX 305, VERB 35), oli (AUX 333, VERB 24), sai (VERB 19, AUX 3), tuleb (VERB 17, AUX 1), tulnud (VERB 16, ADJ 3), nägi (VERB 15, AUX 1), hakkas (VERB 13, AUX 3), polnud (AUX 31, VERB 11), saanud (VERB 12, AUX 7, ADJ 4), vastas (VERB 12, ADV 2, ADP 1)

Morphology

The form / lemma ratio of VERB is 2.070568 (the average of all parts of speech is 1.545328).

The 1st highest number of forms (20) was observed with the lemma “minema”: Läksin, Minge, lähe, läheb, lähed, lähegi, läheks, läheme, lähen, lähete, lähevad, lähme, läinud, läks, läksid, läksime, mine, minema, minna, minnes.

The 2nd highest number of forms (20) was observed with the lemma “olema”: Oleme, Olge, Olgu, Ongi, ole, oled, oleks, olema, olemas, olen, olevat, oli, olid, oligi, olla, olnud, on, pole, polegi, polnud.

The 3rd highest number of forms (19) was observed with the lemma “tegema”: Teeme, tee, teeb, teed, teeks, teen, teevad, tegema, tegemas, tegemata, tegi, tegid, tegin, teha, tehes, tehtagi, tehti, tehtud, teinud.

VERB occurs with 9 features: et-feat/VerbForm (5066; 100% instances), et-feat/Voice (4462; 88% instances), et-feat/Tense (4163; 82% instances), et-feat/Mood (3791; 75% instances), et-feat/Number (3360; 66% instances), et-feat/Person (3356; 66% instances), et-feat/Case (299; 6% instances), et-feat/Connegative (273; 5% instances), et-feat/Polarity (27; 1% instances)

VERB occurs with 28 feature-value pairs: Case=Abe, Case=All, Case=Ela, Case=Ill, Case=Ine, Case=Tra, Connegative=Yes, Mood=Cnd, Mood=Imp, Mood=Ind, Mood=Qot, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Polarity=Neg, Tense=Imp, Tense=Past, Tense=Pres, VerbForm=Conv, VerbForm=Fin, VerbForm=Ger, VerbForm=Inf, VerbForm=Part, VerbForm=Sup, Voice=Act, Voice=Pass

VERB occurs with 58 feature combinations. The most frequent feature combination is Mood=Ind|Number=Sing|Person=3|Tense=Past|VerbForm=Fin|Voice=Act (1157 tokens). Examples: ütles, oli, jäi, tuli, võttis, vaatas, tundis, küsis, sai, tegi

Relations

VERB nodes are attached to their parents using 11 different relations: et-dep/root (2634; 52% instances), et-dep/conj (854; 17% instances), et-dep/advcl (540; 11% instances), et-dep/xcomp (298; 6% instances), et-dep/parataxis (237; 5% instances), et-dep/acl:relcl (193; 4% instances), et-dep/ccomp (148; 3% instances), et-dep/csubj (66; 1% instances), et-dep/acl (53; 1% instances), et-dep/csubj:cop (37; 1% instances), et-dep/compound (6; 0% instances)

Parents of VERB nodes belong to 9 different parts of speech: ROOT (2634; 52% instances), VERB (1850; 37% instances), NOUN (274; 5% instances), ADJ (161; 3% instances), PRON (60; 1% instances), ADV (43; 1% instances), PROPN (26; 1% instances), AUX (17; 0% instances), NUM (1; 0% instances)

169 (3%) VERB nodes are leaves.

376 (7%) VERB nodes have one child.

522 (10%) VERB nodes have two children.

3999 (79%) VERB nodes have three or more children.

The highest child degree of a VERB node is 14.

Children of VERB nodes are attached using 25 different relations: et-dep/punct (4619; 24% instances), et-dep/obl (3304; 17% instances), et-dep/nsubj (3045; 16% instances), et-dep/obj (1795; 9% instances), et-dep/advmod (1780; 9% instances), et-dep/conj (827; 4% instances), et-dep/aux (797; 4% instances), et-dep/cc (691; 4% instances), et-dep/advcl (577; 3% instances), et-dep/compound:prt (570; 3% instances), et-dep/mark (545; 3% instances), et-dep/xcomp (393; 2% instances), et-dep/ccomp (203; 1% instances), et-dep/parataxis (188; 1% instances), et-dep/csubj (66; 0% instances), et-dep/amod (60; 0% instances), et-dep/vocative (44; 0% instances), et-dep/nummod (42; 0% instances), et-dep/discourse (29; 0% instances), et-dep/compound (6; 0% instances), et-dep/cc:preconj (4; 0% instances), et-dep/cop (4; 0% instances), et-dep/nsubj:cop (4; 0% instances), et-dep/nmod (3; 0% instances), et-dep/orphan (1; 0% instances)

Children of VERB nodes belong to 14 different parts of speech: NOUN (5686; 29% instances), PUNCT (4619; 24% instances), ADV (2469; 13% instances), VERB (1850; 9% instances), PRON (1789; 9% instances), PROPN (923; 5% instances), AUX (802; 4% instances), CCONJ (694; 4% instances), SCONJ (447; 2% instances), ADJ (232; 1% instances), NUM (54; 0% instances), INTJ (29; 0% instances), ADP (2; 0% instances), X (1; 0% instances)


VERB in other languages: [am] [ar] [bg] [bxr] [ca] [ckb] [cop] [cs] [cu] [da] [de] [el] [en] [es] [et] [eu] [fa] [fi] [fo] [fr] [ga] [gl] [got] [grc] [he] [hi] [hr] [hu] [id] [it] [ja] [kk] [kmr] [ko] [la] [lv] [mr] [nl] [no] [pl] [pt] [ro] [ru] [sa] [sk] [sla] [sl] [so] [sr] [sv] [swl] [ta] [tr] [ug] [uk] [u] [urj] [ur] [vi] [yue] [zh]