VERB
: verb
Definition
A verb typically signals events and actions; it can constitute a minimal predicate in a clause.
Verbs in Estonian associate with grammatical categories like person, number, tense, mood and voice.
The verb tag in Estonian UD v 1.3 does not cover auxiliaries AUX
.
Auxiliaries are:
olema “be” and in rare occasions saama “get” are auxiliaries that form periphrastic tense forms;
modal verbs are võima, tohtima “may”, saama “can”, pidama “must”, näima, paistma, tunduma “seem”;
ei and ära “not” in negative verb forms.
Participles are word forms that share properties and usage of adjectives and verbs. Depending on their syntactic function they are tagged as VERB
or ADJ
in Estonian UD.
Gerunds and infinitives are tagged as VERB
, except for grammatized word-forms.
Treebank Statistics (UD_Estonian)
There are 1162 VERB
lemmas (16%), 2406 VERB
types (22%) and 5066 VERB
tokens (15%).
Out of 16 observed tags, the rank of VERB
is: 2 in number of lemmas, 2 in number of types and 3 in number of tokens.
The 10 most frequent VERB
lemmas: olema, tulema, ütlema, saama, tegema, minema, liiguta, nägema, jääma, teadma
The 10 most frequent VERB
types: liigutas, ütles, on, jäi, viskus, tuli, oli, liikus, tõukas, läks
The 10 most frequent ambiguous lemmas: olema (AUX 878, VERB 118), tulema (VERB 98, AUX 3), saama (VERB 91, AUX 28), nägema (VERB 67, AUX 2), hakkama (VERB 46, AUX 3), sõit (VERB 38, NOUN 6), tõus (VERB 26, NOUN 1), pidama (AUX 70, VERB 24), paistma (VERB 13, AUX 1), tunduma (VERB 13, AUX 7)
The 10 most frequent ambiguous types: on (AUX 305, VERB 35), oli (AUX 333, VERB 24), sai (VERB 19, AUX 3), tuleb (VERB 17, AUX 1), tulnud (VERB 16, ADJ 3), nägi (VERB 15, AUX 1), hakkas (VERB 13, AUX 3), polnud (AUX 31, VERB 11), saanud (VERB 12, AUX 7, ADJ 4), vastas (VERB 12, ADV 2, ADP 1)
- on
- oli
- sai
- tuleb
- tulnud
- nägi
- hakkas
- polnud
- saanud
- vastas
Morphology
The form / lemma ratio of VERB
is 2.070568 (the average of all parts of speech is 1.545328).
The 1st highest number of forms (20) was observed with the lemma “minema”: Läksin, Minge, lähe, läheb, lähed, lähegi, läheks, läheme, lähen, lähete, lähevad, lähme, läinud, läks, läksid, läksime, mine, minema, minna, minnes.
The 2nd highest number of forms (20) was observed with the lemma “olema”: Oleme, Olge, Olgu, Ongi, ole, oled, oleks, olema, olemas, olen, olevat, oli, olid, oligi, olla, olnud, on, pole, polegi, polnud.
The 3rd highest number of forms (19) was observed with the lemma “tegema”: Teeme, tee, teeb, teed, teeks, teen, teevad, tegema, tegemas, tegemata, tegi, tegid, tegin, teha, tehes, tehtagi, tehti, tehtud, teinud.
VERB
occurs with 9 features: et-feat/VerbForm (5066; 100% instances), et-feat/Voice (4462; 88% instances), et-feat/Tense (4163; 82% instances), et-feat/Mood (3791; 75% instances), et-feat/Number (3360; 66% instances), et-feat/Person (3356; 66% instances), et-feat/Case (299; 6% instances), et-feat/Connegative (273; 5% instances), et-feat/Polarity (27; 1% instances)
VERB
occurs with 28 feature-value pairs: Case=Abe
, Case=All
, Case=Ela
, Case=Ill
, Case=Ine
, Case=Tra
, Connegative=Yes
, Mood=Cnd
, Mood=Imp
, Mood=Ind
, Mood=Qot
, Number=Plur
, Number=Sing
, Person=1
, Person=2
, Person=3
, Polarity=Neg
, Tense=Imp
, Tense=Past
, Tense=Pres
, VerbForm=Conv
, VerbForm=Fin
, VerbForm=Ger
, VerbForm=Inf
, VerbForm=Part
, VerbForm=Sup
, Voice=Act
, Voice=Pass
VERB
occurs with 58 feature combinations.
The most frequent feature combination is Mood=Ind|Number=Sing|Person=3|Tense=Past|VerbForm=Fin|Voice=Act
(1157 tokens).
Examples: ütles, oli, jäi, tuli, võttis, vaatas, tundis, küsis, sai, tegi
Relations
VERB
nodes are attached to their parents using 11 different relations: et-dep/root (2634; 52% instances), et-dep/conj (854; 17% instances), et-dep/advcl (540; 11% instances), et-dep/xcomp (298; 6% instances), et-dep/parataxis (237; 5% instances), et-dep/acl:relcl (193; 4% instances), et-dep/ccomp (148; 3% instances), et-dep/csubj (66; 1% instances), et-dep/acl (53; 1% instances), et-dep/csubj:cop (37; 1% instances), et-dep/compound (6; 0% instances)
Parents of VERB
nodes belong to 9 different parts of speech: ROOT (2634; 52% instances), VERB (1850; 37% instances), NOUN (274; 5% instances), ADJ (161; 3% instances), PRON (60; 1% instances), ADV (43; 1% instances), PROPN (26; 1% instances), AUX (17; 0% instances), NUM (1; 0% instances)
169 (3%) VERB
nodes are leaves.
376 (7%) VERB
nodes have one child.
522 (10%) VERB
nodes have two children.
3999 (79%) VERB
nodes have three or more children.
The highest child degree of a VERB
node is 14.
Children of VERB
nodes are attached using 25 different relations: et-dep/punct (4619; 24% instances), et-dep/obl (3304; 17% instances), et-dep/nsubj (3045; 16% instances), et-dep/obj (1795; 9% instances), et-dep/advmod (1780; 9% instances), et-dep/conj (827; 4% instances), et-dep/aux (797; 4% instances), et-dep/cc (691; 4% instances), et-dep/advcl (577; 3% instances), et-dep/compound:prt (570; 3% instances), et-dep/mark (545; 3% instances), et-dep/xcomp (393; 2% instances), et-dep/ccomp (203; 1% instances), et-dep/parataxis (188; 1% instances), et-dep/csubj (66; 0% instances), et-dep/amod (60; 0% instances), et-dep/vocative (44; 0% instances), et-dep/nummod (42; 0% instances), et-dep/discourse (29; 0% instances), et-dep/compound (6; 0% instances), et-dep/cc:preconj (4; 0% instances), et-dep/cop (4; 0% instances), et-dep/nsubj:cop (4; 0% instances), et-dep/nmod (3; 0% instances), et-dep/orphan (1; 0% instances)
Children of VERB
nodes belong to 14 different parts of speech: NOUN (5686; 29% instances), PUNCT (4619; 24% instances), ADV (2469; 13% instances), VERB (1850; 9% instances), PRON (1789; 9% instances), PROPN (923; 5% instances), AUX (802; 4% instances), CCONJ (694; 4% instances), SCONJ (447; 2% instances), ADJ (232; 1% instances), NUM (54; 0% instances), INTJ (29; 0% instances), ADP (2; 0% instances), X (1; 0% instances)
VERB in other languages: [am] [ar] [bg] [bxr] [ca] [ckb] [cop] [cs] [cu] [da] [de] [el] [en] [es] [et] [eu] [fa] [fi] [fo] [fr] [ga] [gl] [got] [grc] [he] [hi] [hr] [hu] [id] [it] [ja] [kk] [kmr] [ko] [la] [lv] [mr] [nl] [no] [pl] [pt] [ro] [ru] [sa] [sk] [sla] [sl] [so] [sr] [sv] [swl] [ta] [tr] [ug] [uk] [u] [urj] [ur] [vi] [yue] [zh]