home sl/pos edit page issue tracker

This page still pertains to UD version 1.

VERB: verb

Definition

A verb is a member of the syntactic class of words that typically signal events and actions, can constitute a minimal predicate in a clause, and govern the number and types of other constituents which may occur in the clause.

In Slovenian, the VERB tag covers all verbs (including content, modal and copula verbs), except for the auxiliary verb biti “to be”, which is tagged as AUX.

Word forms that etymologically derive from verbs, but have different syntactic properties, such as adjectival participles (ukraden “stolen”, pokrit “covered”), transgressives (upoštevaje “taking into account”, začenši “starting”) and gerunds (govorjenje “speaking”, zavrnitev “rejection”, gretje “heating”), are marked as adjectives, adverbs or nouns respectively.

Examples

Conversion from JOS

All verbs with Type=main have been converted to VERB. Additionally, those instances of verb biti with Type=auxiliary that do not bear the PPart dependency relation to a main verb have also been converted to VERB.


Treebank Statistics (UD_Slovenian)

There are 2247 VERB lemmas (14%), 5700 VERB types (19%) and 13031 VERB tokens (10%). Out of 16 observed tags, the rank of VERB is: 4 in number of lemmas, 3 in number of types and 4 in number of tokens.

The 10 most frequent VERB lemmas: biti, imeti, morati, iti, začeti, priti, moči, vedeti, dobiti, povedati

The 10 most frequent VERB types: je, ima, bilo, ni, gre, imajo, bo, so, mora, bila

The 10 most frequent ambiguous lemmas: biti (AUX 8949, VERB 875), peti (ADJ 7, VERB 4)

The 10 most frequent ambiguous types: je (AUX 3703, VERB 332, PRON 13), bilo (VERB 112, AUX 106), ni (AUX 345, VERB 93), bo (AUX 438, VERB 66), so (AUX 1362, VERB 62), mora (VERB 48, NOUN 2), bila (AUX 187, VERB 44), pomeni (VERB 39, NOUN 1), bil (AUX 196, VERB 34), pravi (VERB 29, ADJ 21)

Morphology

The form / lemma ratio of VERB is 2.536716 (the average of all parts of speech is 1.870691).

The 1st highest number of forms (28) was observed with the lemma “biti”: Sva, bi, bijejo, bil, bila, bile, bili, bilo, biti, bla, blo, bo, bodo, bom, boste, je, ni, nisem, nisi, nismo, niso, niste, sem, si, smo, so, sta, ste.

The 2nd highest number of forms (21) was observed with the lemma “imeti”: ima, imajo, imam, imamo, imata, imate, imava, imaš, imejte, imel, imela, imele, imeli, imelo, imeti, nima, nimajo, nimam, nimamo, nimate, nimaš.

The 3rd highest number of forms (17) was observed with the lemma “hoteti”: Hočeš, Nočemo, hotel, hotela, hotele, hoteli, hoče, hočejo, hočem, hočemo, hočeta, hočete, noče, nočejo, nočem, nočete, nočeš.

VERB occurs with 8 features: sl-feat/VerbForm (13031; 100% instances), sl-feat/Number (11811; 91% instances), sl-feat/Aspect (11189; 86% instances), sl-feat/Gender (6228; 48% instances), sl-feat/Mood (5584; 43% instances), sl-feat/Person (5583; 43% instances), sl-feat/Tense (5328; 41% instances), sl-feat/Polarity (978; 8% instances)

VERB occurs with 22 feature-value pairs: Aspect=Imp, Aspect=Perf, Gender=Fem, Gender=Masc, Gender=Neut, Mood=Cnd, Mood=Imp, Mood=Ind, Number=Dual, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Polarity=Neg, Polarity=Pos, Tense=Fut, Tense=Pres, VerbForm=Fin, VerbForm=Inf, VerbForm=Part, VerbForm=Sup

VERB occurs with 104 feature combinations. The most frequent feature combination is Aspect=Perf|Gender=Masc|Number=Sing|VerbForm=Part (1391 tokens). Examples: povedal, dejal, rekel, postal, začel, dobil, prišel, odločil, napisal, pogledal

Relations

VERB nodes are attached to their parents using 9 different relations: sl-dep/root (5597; 43% instances), sl-dep/acl (1788; 14% instances), sl-dep/conj (1407; 11% instances), sl-dep/parataxis (1101; 8% instances), sl-dep/advcl (980; 8% instances), sl-dep/ccomp (870; 7% instances), sl-dep/xcomp (816; 6% instances), sl-dep/csubj (468; 4% instances), sl-dep/fixed (4; 0% instances)

Parents of VERB nodes belong to 11 different parts of speech: ROOT (5597; 43% instances), VERB (4802; 37% instances), NOUN (1674; 13% instances), ADJ (666; 5% instances), DET (168; 1% instances), PROPN (91; 1% instances), PRON (20; 0% instances), NUM (9; 0% instances), X (2; 0% instances), ADV (1; 0% instances), AUX (1; 0% instances)

54 (0%) VERB nodes are leaves.

378 (3%) VERB nodes have one child.

1117 (9%) VERB nodes have two children.

11482 (88%) VERB nodes have three or more children.

The highest child degree of a VERB node is 12.

Children of VERB nodes are attached using 21 different relations: sl-dep/punct (11578; 20% instances), sl-dep/obl (7632; 13% instances), sl-dep/advmod (7602; 13% instances), sl-dep/obj (6170; 11% instances), sl-dep/aux (5787; 10% instances), sl-dep/nsubj (4968; 9% instances), sl-dep/mark (3505; 6% instances), sl-dep/expl (2063; 4% instances), sl-dep/cc (1687; 3% instances), sl-dep/conj (1344; 2% instances), sl-dep/parataxis (1178; 2% instances), sl-dep/xcomp (1059; 2% instances), sl-dep/ccomp (1019; 2% instances), sl-dep/advcl (990; 2% instances), sl-dep/iobj (555; 1% instances), sl-dep/csubj (257; 0% instances), sl-dep/discourse (46; 0% instances), sl-dep/dep (6; 0% instances), sl-dep/nmod (6; 0% instances), sl-dep/cc:preconj (1; 0% instances), sl-dep/fixed (1; 0% instances)

Children of VERB nodes belong to 16 different parts of speech: NOUN (14561; 25% instances), PUNCT (11578; 20% instances), AUX (5779; 10% instances), VERB (4802; 8% instances), PRON (4471; 8% instances), ADV (3963; 7% instances), SCONJ (3307; 6% instances), PART (2879; 5% instances), CCONJ (2457; 4% instances), PROPN (1410; 2% instances), DET (1093; 2% instances), ADJ (972; 2% instances), NUM (130; 0% instances), X (29; 0% instances), ADP (12; 0% instances), INTJ (11; 0% instances)


Treebank Statistics (UD_Slovenian-SST)

There are 570 VERB lemmas (18%), 1084 VERB types (23%) and 2581 VERB tokens (13%). Out of 16 observed tags, the rank of VERB is: 2 in number of lemmas, 2 in number of types and 1 in number of tokens.

The 10 most frequent VERB lemmas: biti, imeti, vedeti, iti, reči, misliti, dati, videti, priti, morati

The 10 most frequent VERB types: je, veš, vem, mislim, bilo, ni, recimo, so, ima, bo

The 10 most frequent ambiguous lemmas: biti (AUX 1267, VERB 449), peti (ADJ 5, VERB 2)

The 10 most frequent ambiguous types: je (AUX 461, VERB 196, INTJ 2, PRON 2), mislim (VERB 53, NOUN 1), bilo (VERB 36, AUX 14), ni (AUX 44, VERB 36), so (AUX 135, VERB 29, X 2), bo (AUX 65, VERB 22), bil (AUX 25, VERB 21), pravi (VERB 18, ADJ 1), si (AUX 34, PRON 33, VERB 16), bom (AUX 15, VERB 14)

Morphology

The form / lemma ratio of VERB is 1.901754 (the average of all parts of speech is 1.494596).

The 1st highest number of forms (28) was observed with the lemma “biti”: bi, bil, bila, bile, bili, bilo, biti, bo, bodo, bojo, bom, bomo, bosta, bova, boš, je, ni, nisem, nismo, niso, niste, sem, si, smo, so, sta, ste, sva.

The 2nd highest number of forms (18) was observed with the lemma “imeti”: ima, imajo, imam, imamo, imata, imate, imaš, imejte, imel, imela, imele, imeli, imeti, nima, nimajo, nimam, nimamo, nimaš.

The 3rd highest number of forms (16) was observed with the lemma “iti”: gre, gredo, grejo, grem, gremo, gresta, greste, greš, idem, iti, pojdi, šel, šla, šle, šli, šlo.

VERB occurs with 8 features: sl-feat/VerbForm (2581; 100% instances), sl-feat/Number (2421; 94% instances), sl-feat/Aspect (1874; 73% instances), sl-feat/Mood (1652; 64% instances), sl-feat/Person (1645; 64% instances), sl-feat/Tense (1507; 58% instances), sl-feat/Gender (776; 30% instances), sl-feat/Polarity (499; 19% instances)

VERB occurs with 22 feature-value pairs: Aspect=Imp, Aspect=Perf, Gender=Fem, Gender=Masc, Gender=Neut, Mood=Cnd, Mood=Imp, Mood=Ind, Number=Dual, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Polarity=Neg, Polarity=Pos, Tense=Fut, Tense=Pres, VerbForm=Fin, VerbForm=Inf, VerbForm=Part, VerbForm=Sup

VERB occurs with 90 feature combinations. The most frequent feature combination is Aspect=Imp|Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin (197 tokens). Examples: zdi, more, mora, drži, govori, piše, teče, ve, hodi, stoji

Relations

VERB nodes are attached to their parents using 14 different relations: sl-dep/root (1000; 39% instances), sl-dep/parataxis (438; 17% instances), sl-dep/conj (226; 9% instances), sl-dep/advcl (184; 7% instances), sl-dep/acl (169; 7% instances), sl-dep/parataxis:discourse (143; 6% instances), sl-dep/ccomp (138; 5% instances), sl-dep/xcomp (115; 4% instances), sl-dep/reparandum (53; 2% instances), sl-dep/parataxis:restart (50; 2% instances), sl-dep/csubj (45; 2% instances), sl-dep/fixed (12; 0% instances), sl-dep/conj:extend (4; 0% instances), sl-dep/dislocated (4; 0% instances)

Parents of VERB nodes belong to 15 different parts of speech: VERB (1088; 42% instances), ROOT (1000; 39% instances), NOUN (242; 9% instances), ADJ (81; 3% instances), DET (61; 2% instances), ADV (47; 2% instances), PRON (28; 1% instances), PROPN (12; 0% instances), NUM (8; 0% instances), PART (5; 0% instances), AUX (3; 0% instances), CCONJ (2; 0% instances), X (2; 0% instances), INTJ (1; 0% instances), SCONJ (1; 0% instances)

198 (8%) VERB nodes are leaves.

251 (10%) VERB nodes have one child.

414 (16%) VERB nodes have two children.

1718 (67%) VERB nodes have three or more children.

The highest child degree of a VERB node is 14.

Children of VERB nodes are attached using 31 different relations: sl-dep/advmod (1862; 20% instances), sl-dep/obj (847; 9% instances), sl-dep/obl (793; 9% instances), sl-dep/aux (728; 8% instances), sl-dep/nsubj (727; 8% instances), sl-dep/discourse (626; 7% instances), sl-dep/mark (577; 6% instances), sl-dep/punct (549; 6% instances), sl-dep/parataxis (452; 5% instances), sl-dep/cc (395; 4% instances), sl-dep/expl (294; 3% instances), sl-dep/conj (232; 3% instances), sl-dep/discourse:filler (226; 2% instances), sl-dep/ccomp (184; 2% instances), sl-dep/advcl (156; 2% instances), sl-dep/xcomp (156; 2% instances), sl-dep/reparandum (139; 1% instances), sl-dep/parataxis:discourse (90; 1% instances), sl-dep/iobj (60; 1% instances), sl-dep/parataxis:restart (47; 1% instances), sl-dep/vocative (34; 0% instances), sl-dep/csubj (31; 0% instances), sl-dep/dislocated (27; 0% instances), sl-dep/conj:extend (19; 0% instances), sl-dep/case (6; 0% instances), sl-dep/acl (5; 0% instances), sl-dep/cc:preconj (5; 0% instances), sl-dep/cop (2; 0% instances), sl-dep/det (2; 0% instances), sl-dep/fixed (2; 0% instances), sl-dep/nummod (2; 0% instances)

Children of VERB nodes belong to 16 different parts of speech: NOUN (1368; 15% instances), ADV (1182; 13% instances), VERB (1088; 12% instances), PART (1064; 11% instances), PRON (918; 10% instances), AUX (758; 8% instances), CCONJ (574; 6% instances), SCONJ (562; 6% instances), X (423; 5% instances), DET (411; 4% instances), INTJ (294; 3% instances), PUNCT (197; 2% instances), ADJ (185; 2% instances), PROPN (181; 2% instances), NUM (53; 1% instances), ADP (17; 0% instances)


VERB in other languages: [am] [ar] [bg] [bxr] [ca] [ckb] [cop] [cs] [cu] [da] [de] [el] [en] [es] [et] [eu] [fa] [fi] [fo] [fr] [ga] [gl] [got] [grc] [he] [hi] [hr] [hu] [id] [it] [ja] [kk] [kmr] [ko] [la] [lv] [mr] [nl] [no] [pl] [pt] [ro] [ru] [sa] [sk] [sla] [sl] [so] [sr] [sv] [swl] [ta] [tr] [ug] [uk] [u] [urj] [ur] [vi] [yue] [zh]