VERB
: verb
Definition
A verb is a member of the syntactic class of words that typically signal events and actions, can constitute a minimal predicate in a clause, and govern the number and types of other constituents which may occur in the clause.
In Slovenian, the VERB
tag covers all verbs (including content, modal and copula verbs), except for the auxiliary verb biti “to be”, which is tagged as AUX.
Word forms that etymologically derive from verbs, but have different syntactic properties, such as adjectival participles (ukraden “stolen”, pokrit “covered”), transgressives (upoštevaje “taking into account”, začenši “starting”) and gerunds (govorjenje “speaking”, zavrnitev “rejection”, gretje “heating”), are marked as adjectives, adverbs or nouns respectively.
Examples
- imeti “to have”, vedeti “to know”, dobiti “to get”
- morati “to must”, moči “to be able to”, postati “to become”
- začeti “to start, iti “to go”, priti “to come”
Conversion from JOS
All verbs with Type=main have been converted to VERB
. Additionally, those instances of verb biti with Type=auxiliary that do not bear the PPart dependency relation to a main verb have also been converted to VERB
.
Treebank Statistics (UD_Slovenian)
There are 2247 VERB
lemmas (14%), 5700 VERB
types (19%) and 13031 VERB
tokens (10%).
Out of 16 observed tags, the rank of VERB
is: 4 in number of lemmas, 3 in number of types and 4 in number of tokens.
The 10 most frequent VERB
lemmas: biti, imeti, morati, iti, začeti, priti, moči, vedeti, dobiti, povedati
The 10 most frequent VERB
types: je, ima, bilo, ni, gre, imajo, bo, so, mora, bila
The 10 most frequent ambiguous lemmas: biti (AUX 8949, VERB 875), peti (ADJ 7, VERB 4)
The 10 most frequent ambiguous types: je (AUX 3703, VERB 332, PRON 13), bilo (VERB 112, AUX 106), ni (AUX 345, VERB 93), bo (AUX 438, VERB 66), so (AUX 1362, VERB 62), mora (VERB 48, NOUN 2), bila (AUX 187, VERB 44), pomeni (VERB 39, NOUN 1), bil (AUX 196, VERB 34), pravi (VERB 29, ADJ 21)
- je
- bilo
- ni
- bo
- so
- mora
- bila
- pomeni
- bil
- pravi
Morphology
The form / lemma ratio of VERB
is 2.536716 (the average of all parts of speech is 1.870691).
The 1st highest number of forms (28) was observed with the lemma “biti”: Sva, bi, bijejo, bil, bila, bile, bili, bilo, biti, bla, blo, bo, bodo, bom, boste, je, ni, nisem, nisi, nismo, niso, niste, sem, si, smo, so, sta, ste.
The 2nd highest number of forms (21) was observed with the lemma “imeti”: ima, imajo, imam, imamo, imata, imate, imava, imaš, imejte, imel, imela, imele, imeli, imelo, imeti, nima, nimajo, nimam, nimamo, nimate, nimaš.
The 3rd highest number of forms (17) was observed with the lemma “hoteti”: Hočeš, Nočemo, hotel, hotela, hotele, hoteli, hoče, hočejo, hočem, hočemo, hočeta, hočete, noče, nočejo, nočem, nočete, nočeš.
VERB
occurs with 8 features: sl-feat/VerbForm (13031; 100% instances), sl-feat/Number (11811; 91% instances), sl-feat/Aspect (11189; 86% instances), sl-feat/Gender (6228; 48% instances), sl-feat/Mood (5584; 43% instances), sl-feat/Person (5583; 43% instances), sl-feat/Tense (5328; 41% instances), sl-feat/Polarity (978; 8% instances)
VERB
occurs with 22 feature-value pairs: Aspect=Imp
, Aspect=Perf
, Gender=Fem
, Gender=Masc
, Gender=Neut
, Mood=Cnd
, Mood=Imp
, Mood=Ind
, Number=Dual
, Number=Plur
, Number=Sing
, Person=1
, Person=2
, Person=3
, Polarity=Neg
, Polarity=Pos
, Tense=Fut
, Tense=Pres
, VerbForm=Fin
, VerbForm=Inf
, VerbForm=Part
, VerbForm=Sup
VERB
occurs with 104 feature combinations.
The most frequent feature combination is Aspect=Perf|Gender=Masc|Number=Sing|VerbForm=Part
(1391 tokens).
Examples: povedal, dejal, rekel, postal, začel, dobil, prišel, odločil, napisal, pogledal
Relations
VERB
nodes are attached to their parents using 9 different relations: sl-dep/root (5597; 43% instances), sl-dep/acl (1788; 14% instances), sl-dep/conj (1407; 11% instances), sl-dep/parataxis (1101; 8% instances), sl-dep/advcl (980; 8% instances), sl-dep/ccomp (870; 7% instances), sl-dep/xcomp (816; 6% instances), sl-dep/csubj (468; 4% instances), sl-dep/fixed (4; 0% instances)
Parents of VERB
nodes belong to 11 different parts of speech: ROOT (5597; 43% instances), VERB (4802; 37% instances), NOUN (1674; 13% instances), ADJ (666; 5% instances), DET (168; 1% instances), PROPN (91; 1% instances), PRON (20; 0% instances), NUM (9; 0% instances), X (2; 0% instances), ADV (1; 0% instances), AUX (1; 0% instances)
54 (0%) VERB
nodes are leaves.
378 (3%) VERB
nodes have one child.
1117 (9%) VERB
nodes have two children.
11482 (88%) VERB
nodes have three or more children.
The highest child degree of a VERB
node is 12.
Children of VERB
nodes are attached using 21 different relations: sl-dep/punct (11578; 20% instances), sl-dep/obl (7632; 13% instances), sl-dep/advmod (7602; 13% instances), sl-dep/obj (6170; 11% instances), sl-dep/aux (5787; 10% instances), sl-dep/nsubj (4968; 9% instances), sl-dep/mark (3505; 6% instances), sl-dep/expl (2063; 4% instances), sl-dep/cc (1687; 3% instances), sl-dep/conj (1344; 2% instances), sl-dep/parataxis (1178; 2% instances), sl-dep/xcomp (1059; 2% instances), sl-dep/ccomp (1019; 2% instances), sl-dep/advcl (990; 2% instances), sl-dep/iobj (555; 1% instances), sl-dep/csubj (257; 0% instances), sl-dep/discourse (46; 0% instances), sl-dep/dep (6; 0% instances), sl-dep/nmod (6; 0% instances), sl-dep/cc:preconj (1; 0% instances), sl-dep/fixed (1; 0% instances)
Children of VERB
nodes belong to 16 different parts of speech: NOUN (14561; 25% instances), PUNCT (11578; 20% instances), AUX (5779; 10% instances), VERB (4802; 8% instances), PRON (4471; 8% instances), ADV (3963; 7% instances), SCONJ (3307; 6% instances), PART (2879; 5% instances), CCONJ (2457; 4% instances), PROPN (1410; 2% instances), DET (1093; 2% instances), ADJ (972; 2% instances), NUM (130; 0% instances), X (29; 0% instances), ADP (12; 0% instances), INTJ (11; 0% instances)
Treebank Statistics (UD_Slovenian-SST)
There are 570 VERB
lemmas (18%), 1084 VERB
types (23%) and 2581 VERB
tokens (13%).
Out of 16 observed tags, the rank of VERB
is: 2 in number of lemmas, 2 in number of types and 1 in number of tokens.
The 10 most frequent VERB
lemmas: biti, imeti, vedeti, iti, reči, misliti, dati, videti, priti, morati
The 10 most frequent VERB
types: je, veš, vem, mislim, bilo, ni, recimo, so, ima, bo
The 10 most frequent ambiguous lemmas: biti (AUX 1267, VERB 449), peti (ADJ 5, VERB 2)
The 10 most frequent ambiguous types: je (AUX 461, VERB 196, INTJ 2, PRON 2), mislim (VERB 53, NOUN 1), bilo (VERB 36, AUX 14), ni (AUX 44, VERB 36), so (AUX 135, VERB 29, X 2), bo (AUX 65, VERB 22), bil (AUX 25, VERB 21), pravi (VERB 18, ADJ 1), si (AUX 34, PRON 33, VERB 16), bom (AUX 15, VERB 14)
- je
- AUX 461: ja lionizem je tudi morda največja socialna mreža v svetu
- VERB 196: na vrhu je tako kot si rekla en šef lahko sta tudi dva
- INTJ 2: veš kadar sem prišel v maribor ne je zdaj pa macdonald’s tu ne je zdaj pa bom jaz tu non stop ne ja ja prvi prvi dan sem že šel ne pa te naslednji teden teden sem tudi šel pa za mesec dni sem tudi hodil pa non stop sem hodil zdaj pa niti ne povoham ga dokler grem mimo ker mi je totalno out no eh hodi v pizdo [gap]
- PRON 2: sploh če je ne prebereš mislim tako da jo prebereš
- mislim
- bilo
- ni
- so
- bo
- bil
- pravi
- si
- bom
Morphology
The form / lemma ratio of VERB
is 1.901754 (the average of all parts of speech is 1.494596).
The 1st highest number of forms (28) was observed with the lemma “biti”: bi, bil, bila, bile, bili, bilo, biti, bo, bodo, bojo, bom, bomo, bosta, bova, boš, je, ni, nisem, nismo, niso, niste, sem, si, smo, so, sta, ste, sva.
The 2nd highest number of forms (18) was observed with the lemma “imeti”: ima, imajo, imam, imamo, imata, imate, imaš, imejte, imel, imela, imele, imeli, imeti, nima, nimajo, nimam, nimamo, nimaš.
The 3rd highest number of forms (16) was observed with the lemma “iti”: gre, gredo, grejo, grem, gremo, gresta, greste, greš, idem, iti, pojdi, šel, šla, šle, šli, šlo.
VERB
occurs with 8 features: sl-feat/VerbForm (2581; 100% instances), sl-feat/Number (2421; 94% instances), sl-feat/Aspect (1874; 73% instances), sl-feat/Mood (1652; 64% instances), sl-feat/Person (1645; 64% instances), sl-feat/Tense (1507; 58% instances), sl-feat/Gender (776; 30% instances), sl-feat/Polarity (499; 19% instances)
VERB
occurs with 22 feature-value pairs: Aspect=Imp
, Aspect=Perf
, Gender=Fem
, Gender=Masc
, Gender=Neut
, Mood=Cnd
, Mood=Imp
, Mood=Ind
, Number=Dual
, Number=Plur
, Number=Sing
, Person=1
, Person=2
, Person=3
, Polarity=Neg
, Polarity=Pos
, Tense=Fut
, Tense=Pres
, VerbForm=Fin
, VerbForm=Inf
, VerbForm=Part
, VerbForm=Sup
VERB
occurs with 90 feature combinations.
The most frequent feature combination is Aspect=Imp|Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin
(197 tokens).
Examples: zdi, more, mora, drži, govori, piše, teče, ve, hodi, stoji
Relations
VERB
nodes are attached to their parents using 14 different relations: sl-dep/root (1000; 39% instances), sl-dep/parataxis (438; 17% instances), sl-dep/conj (226; 9% instances), sl-dep/advcl (184; 7% instances), sl-dep/acl (169; 7% instances), sl-dep/parataxis:discourse (143; 6% instances), sl-dep/ccomp (138; 5% instances), sl-dep/xcomp (115; 4% instances), sl-dep/reparandum (53; 2% instances), sl-dep/parataxis:restart (50; 2% instances), sl-dep/csubj (45; 2% instances), sl-dep/fixed (12; 0% instances), sl-dep/conj:extend (4; 0% instances), sl-dep/dislocated (4; 0% instances)
Parents of VERB
nodes belong to 15 different parts of speech: VERB (1088; 42% instances), ROOT (1000; 39% instances), NOUN (242; 9% instances), ADJ (81; 3% instances), DET (61; 2% instances), ADV (47; 2% instances), PRON (28; 1% instances), PROPN (12; 0% instances), NUM (8; 0% instances), PART (5; 0% instances), AUX (3; 0% instances), CCONJ (2; 0% instances), X (2; 0% instances), INTJ (1; 0% instances), SCONJ (1; 0% instances)
198 (8%) VERB
nodes are leaves.
251 (10%) VERB
nodes have one child.
414 (16%) VERB
nodes have two children.
1718 (67%) VERB
nodes have three or more children.
The highest child degree of a VERB
node is 14.
Children of VERB
nodes are attached using 31 different relations: sl-dep/advmod (1862; 20% instances), sl-dep/obj (847; 9% instances), sl-dep/obl (793; 9% instances), sl-dep/aux (728; 8% instances), sl-dep/nsubj (727; 8% instances), sl-dep/discourse (626; 7% instances), sl-dep/mark (577; 6% instances), sl-dep/punct (549; 6% instances), sl-dep/parataxis (452; 5% instances), sl-dep/cc (395; 4% instances), sl-dep/expl (294; 3% instances), sl-dep/conj (232; 3% instances), sl-dep/discourse:filler (226; 2% instances), sl-dep/ccomp (184; 2% instances), sl-dep/advcl (156; 2% instances), sl-dep/xcomp (156; 2% instances), sl-dep/reparandum (139; 1% instances), sl-dep/parataxis:discourse (90; 1% instances), sl-dep/iobj (60; 1% instances), sl-dep/parataxis:restart (47; 1% instances), sl-dep/vocative (34; 0% instances), sl-dep/csubj (31; 0% instances), sl-dep/dislocated (27; 0% instances), sl-dep/conj:extend (19; 0% instances), sl-dep/case (6; 0% instances), sl-dep/acl (5; 0% instances), sl-dep/cc:preconj (5; 0% instances), sl-dep/cop (2; 0% instances), sl-dep/det (2; 0% instances), sl-dep/fixed (2; 0% instances), sl-dep/nummod (2; 0% instances)
Children of VERB
nodes belong to 16 different parts of speech: NOUN (1368; 15% instances), ADV (1182; 13% instances), VERB (1088; 12% instances), PART (1064; 11% instances), PRON (918; 10% instances), AUX (758; 8% instances), CCONJ (574; 6% instances), SCONJ (562; 6% instances), X (423; 5% instances), DET (411; 4% instances), INTJ (294; 3% instances), PUNCT (197; 2% instances), ADJ (185; 2% instances), PROPN (181; 2% instances), NUM (53; 1% instances), ADP (17; 0% instances)
VERB in other languages: [am] [ar] [bg] [bxr] [ca] [ckb] [cop] [cs] [cu] [da] [de] [el] [en] [es] [et] [eu] [fa] [fi] [fo] [fr] [ga] [gl] [got] [grc] [he] [hi] [hr] [hu] [id] [it] [ja] [kk] [kmr] [ko] [la] [lv] [mr] [nl] [no] [pl] [pt] [ro] [ru] [sa] [sk] [sla] [sl] [so] [sr] [sv] [swl] [ta] [tr] [ug] [uk] [u] [urj] [ur] [vi] [yue] [zh]