home no/pos edit page issue tracker

This page still pertains to UD version 1.

VERB: verb

#####Definition A verb is a member of the syntactic class of words that typically signal events and actions, can constitute a minimal predicate in a clause, and govern the number and types of other constituents which may occur in the clause. Note that the VERB tag covers main verbs (content verbs) and copulas but it does not cover auxiliary verbs, for which there is the AUX tag.

In Norwegian, modal verbs occurring alone, as in skal hjem lit. shall home “will go home” are tagged as VERB, but otherwise they will be AUX. Participles are annotated as verbs when they occur with auxiliaries ha “have”, “get” or a modal auxiliary. When the participle occurs with the copula or bli “become” or follows a head noun, the participle may be either adjective or verb (for syntactic tests to determine these cases, see Kinn et. al.).

#####Examples

#####References Kari Kinn, Per Erik Solberg and Pål Kristian Eriksen. “NDT Guidelines for Morphological Annotation”. National Library Tech Report.


Treebank Statistics (UD_Norwegian-Bokmaal)

There are 1980 VERB lemmas (9%), 4257 VERB types (13%) and 30022 VERB tokens (11%). Out of 17 observed tags, the rank of VERB is: 4 in number of lemmas, 4 in number of types and 4 in number of tokens.

The 10 most frequent VERB lemmas: ha, si, bli, få, komme, gjøre, være, ta, gå, se

The 10 most frequent VERB types: har, sier, er, blir, kommer, går, ha, få, bli, ta

The 10 most frequent ambiguous lemmas: ha (AUX 2564, VERB 1617, X 1), si (VERB 1236, ADJ 3), bli (AUX 983, VERB 928), (VERB 839, AUX 230, ADJ 87), komme (VERB 710, ADJ 8), gjøre (VERB 700, ADJ 2), være (AUX 7334, VERB 685, ADJ 1), ta (VERB 680, ADJ 2, X 1), (VERB 659, ADJ 3), se (VERB 620, ADJ 7)

The 10 most frequent ambiguous types: har (AUX 1999, VERB 994), er (AUX 5005, VERB 450, X 4, DET 2), blir (VERB 307, AUX 201, X 2), går (VERB 269, NOUN 52), ha (VERB 260, AUX 194, X 1), (VERB 254, AUX 74, ADJ 62), bli (VERB 250, AUX 144), ta (VERB 235, X 1), får (VERB 230, AUX 80), ble (AUX 514, VERB 228)

Morphology

The form / lemma ratio of VERB is 2.150000 (the average of all parts of speech is 1.383513).

The 1st highest number of forms (7) was observed with the lemma “fortelle”: Fortell, Fotelle, fortalt, fortalte, fortelle, forteller, fortelles.

The 2nd highest number of forms (7) was observed with the lemma “kalle”: Kall, kalle, kaller, kalles, kallet, kalt, kalte.

The 3rd highest number of forms (7) was observed with the lemma “lage”: laga, lagd, lagde, lage, lager, lages, laget.

VERB occurs with 7 features: no-feat/VerbForm (30022; 100% instances), no-feat/Mood (17436; 58% instances), no-feat/Tense (17225; 57% instances), no-feat/Voice (1047; 3% instances), no-feat/Abbr (19; 0% instances), no-feat/Definite (1; 0% instances), no-feat/Number (1; 0% instances)

VERB occurs with 11 feature-value pairs: Abbr=Yes, Definite=Ind, Mood=Imp, Mood=Ind, Number=Sing, Tense=Past, Tense=Pres, VerbForm=Fin, VerbForm=Inf, VerbForm=Part, Voice=Pass

VERB occurs with 10 feature combinations. The most frequent feature combination is Mood=Ind|Tense=Pres|VerbForm=Fin (11813 tokens). Examples: har, sier, er, blir, kommer, går, mener, får, ser, gjør

Relations

VERB nodes are attached to their parents using 17 different relations: no-dep/root (12363; 41% instances), no-dep/advcl (4329; 14% instances), no-dep/conj (3721; 12% instances), no-dep/acl:relcl (3442; 11% instances), no-dep/ccomp (1539; 5% instances), no-dep/acl (1234; 4% instances), no-dep/xcomp (1180; 4% instances), no-dep/parataxis (1117; 4% instances), no-dep/csubj (833; 3% instances), no-dep/appos (126; 0% instances), no-dep/nmod (62; 0% instances), no-dep/orphan (33; 0% instances), no-dep/flat:name (13; 0% instances), no-dep/goeswith (13; 0% instances), no-dep/compound (9; 0% instances), no-dep/csubj:pass (7; 0% instances), no-dep/iobj (1; 0% instances)

Parents of VERB nodes belong to 14 different parts of speech: ROOT (12363; 41% instances), VERB (9812; 33% instances), NOUN (4304; 14% instances), ADJ (1814; 6% instances), PRON (803; 3% instances), ADV (398; 1% instances), PROPN (315; 1% instances), DET (79; 0% instances), ADP (68; 0% instances), NUM (35; 0% instances), X (12; 0% instances), AUX (11; 0% instances), INTJ (7; 0% instances), CCONJ (1; 0% instances)

106 (0%) VERB nodes are leaves.

1905 (6%) VERB nodes have one child.

3740 (12%) VERB nodes have two children.

24271 (81%) VERB nodes have three or more children.

The highest child degree of a VERB node is 14.

Children of VERB nodes are attached using 32 different relations: no-dep/nsubj (20865; 18% instances), no-dep/punct (18578; 16% instances), no-dep/obl (13843; 12% instances), no-dep/obj (12392; 10% instances), no-dep/mark (11495; 10% instances), no-dep/advmod (9016; 8% instances), no-dep/aux (7296; 6% instances), no-dep/cc (3746; 3% instances), no-dep/conj (3570; 3% instances), no-dep/xcomp (3433; 3% instances), no-dep/advcl (3407; 3% instances), no-dep/compound:prt (2308; 2% instances), no-dep/ccomp (2005; 2% instances), no-dep/expl (1683; 1% instances), no-dep/nsubj:pass (1665; 1% instances), no-dep/aux:pass (983; 1% instances), no-dep/parataxis (855; 1% instances), no-dep/iobj (634; 1% instances), no-dep/cop (336; 0% instances), no-dep/csubj (252; 0% instances), no-dep/case (243; 0% instances), no-dep/dislocated (104; 0% instances), no-dep/nmod (104; 0% instances), no-dep/discourse (72; 0% instances), no-dep/appos (43; 0% instances), no-dep/goeswith (38; 0% instances), no-dep/orphan (28; 0% instances), no-dep/nummod (15; 0% instances), no-dep/flat:name (9; 0% instances), no-dep/csubj:pass (7; 0% instances), no-dep/acl (1; 0% instances), no-dep/compound (1; 0% instances)

Children of VERB nodes belong to 17 different parts of speech: NOUN (28487; 24% instances), PUNCT (18578; 16% instances), PRON (16918; 14% instances), VERB (9812; 8% instances), AUX (8615; 7% instances), ADV (6377; 5% instances), PROPN (6126; 5% instances), ADP (5620; 5% instances), ADJ (5247; 4% instances), SCONJ (4817; 4% instances), CCONJ (3749; 3% instances), PART (3736; 3% instances), NUM (479; 0% instances), DET (374; 0% instances), INTJ (72; 0% instances), X (19; 0% instances), SYM (1; 0% instances)


Treebank Statistics (UD_Norwegian-Nynorsk)

There are 1686 VERB lemmas (7%), 3729 VERB types (12%) and 26992 VERB tokens (10%). Out of 17 observed tags, the rank of VERB is: 4 in number of lemmas, 4 in number of types and 4 in number of tokens.

The 10 most frequent VERB lemmas: ha, seie, få, verte, vere, kome, gå, gjere, ta, sjå

The 10 most frequent VERB types: har, seier, vart, er, få, kjem, ha, meiner, får, går

The 10 most frequent ambiguous lemmas: ha (AUX 2500, VERB 1633, INTJ 4, ADJ 1, X 1), seie (VERB 1205, ADJ 20), (VERB 1098, AUX 230, ADJ 84, X 1), verte (VERB 936, ADJ 8), vere (AUX 7084, VERB 783, ADJ 2), kome (VERB 764, ADJ 40), (VERB 756, ADJ 4, X 2), gjere (VERB 667, ADJ 54), ta (VERB 606, ADJ 51, ADP 1), sjå (VERB 604, ADJ 30)

The 10 most frequent ambiguous types: har (AUX 2006, VERB 974, X 6, SCONJ 1), vart (VERB 519, ADJ 1, AUX 1), er (AUX 4799, VERB 507, X 12, NOUN 1), (VERB 341, AUX 77, ADJ 55, X 1), ha (VERB 310, AUX 187, INTJ 4, PRON 2, X 1), får (VERB 304, AUX 77, X 1), går (VERB 280, NOUN 22, X 2), blir (AUX 311, VERB 268), fekk (VERB 270, AUX 53), ta (VERB 243, ADP 1)

Morphology

The form / lemma ratio of VERB is 2.211744 (the average of all parts of speech is 1.343969).

The 1st highest number of forms (10) was observed with the lemma “følgje”: følg, følgd, følgde, følge, følgja, følgjast, følgje, følgjer, følgt, følgte.

The 2nd highest number of forms (10) was observed with the lemma “la”: Lat, la, lar, late, latt, let, lot, lèt, lét, lête.

The 3rd highest number of forms (10) was observed with the lemma “seie”: sa, sagt, seg, segja, sei, seia, seiast, seie, seier, sier.

VERB occurs with 7 features: no-feat/VerbForm (26992; 100% instances), no-feat/Mood (16696; 62% instances), no-feat/Tense (16435; 61% instances), no-feat/Abbr (34; 0% instances), no-feat/Definite (3; 0% instances), no-feat/Number (3; 0% instances), no-feat/Gender (2; 0% instances)

VERB occurs with 11 feature-value pairs: Abbr=Yes, Definite=Ind, Gender=Neut, Mood=Imp, Mood=Ind, Number=Sing, Tense=Past, Tense=Pres, VerbForm=Fin, VerbForm=Inf, VerbForm=Part

VERB occurs with 8 feature combinations. The most frequent feature combination is Mood=Ind|Tense=Pres|VerbForm=Fin (11401 tokens). Examples: har, seier, er, kjem, meiner, får, går, blir, vert, ser

Relations

VERB nodes are attached to their parents using 21 different relations: no-dep/root (9908; 37% instances), no-dep/advcl (3987; 15% instances), no-dep/conj (3301; 12% instances), no-dep/acl:relcl (3050; 11% instances), no-dep/parataxis (1395; 5% instances), no-dep/ccomp (1350; 5% instances), no-dep/acl (1303; 5% instances), no-dep/xcomp (1008; 4% instances), no-dep/csubj (835; 3% instances), no-dep/aux:pass (560; 2% instances), no-dep/appos (157; 1% instances), no-dep/nmod (57; 0% instances), no-dep/flat:name (34; 0% instances), no-dep/orphan (18; 0% instances), no-dep/goeswith (10; 0% instances), no-dep/advmod (5; 0% instances), no-dep/iobj (5; 0% instances), no-dep/amod (3; 0% instances), no-dep/csubj:pass (3; 0% instances), no-dep/compound (2; 0% instances), no-dep/obl (1; 0% instances)

Parents of VERB nodes belong to 13 different parts of speech: ROOT (9908; 37% instances), VERB (8546; 32% instances), NOUN (4151; 15% instances), ADJ (2727; 10% instances), PRON (760; 3% instances), ADV (454; 2% instances), PROPN (214; 1% instances), ADP (92; 0% instances), DET (78; 0% instances), NUM (34; 0% instances), X (17; 0% instances), INTJ (8; 0% instances), AUX (3; 0% instances)

666 (2%) VERB nodes are leaves.

1750 (6%) VERB nodes have one child.

3314 (12%) VERB nodes have two children.

21262 (79%) VERB nodes have three or more children.

The highest child degree of a VERB node is 14.

Children of VERB nodes are attached using 31 different relations: no-dep/nsubj (19872; 19% instances), no-dep/punct (15553; 15% instances), no-dep/obj (11831; 11% instances), no-dep/obl (11635; 11% instances), no-dep/mark (10993; 10% instances), no-dep/advmod (7799; 7% instances), no-dep/aux (6227; 6% instances), no-dep/cc (3529; 3% instances), no-dep/conj (3211; 3% instances), no-dep/xcomp (3117; 3% instances), no-dep/advcl (2927; 3% instances), no-dep/compound:prt (2585; 2% instances), no-dep/ccomp (1749; 2% instances), no-dep/expl (1667; 2% instances), no-dep/parataxis (944; 1% instances), no-dep/iobj (617; 1% instances), no-dep/cop (342; 0% instances), no-dep/csubj (255; 0% instances), no-dep/case (209; 0% instances), no-dep/nmod (151; 0% instances), no-dep/discourse (88; 0% instances), no-dep/dislocated (79; 0% instances), no-dep/appos (58; 0% instances), no-dep/nummod (26; 0% instances), no-dep/goeswith (25; 0% instances), no-dep/orphan (12; 0% instances), no-dep/acl (3; 0% instances), no-dep/aux:pass (2; 0% instances), no-dep/flat:name (2; 0% instances), no-dep/compound (1; 0% instances), no-dep/nsubj:pass (1; 0% instances)

Children of VERB nodes belong to 17 different parts of speech: NOUN (27207; 26% instances), PUNCT (15553; 15% instances), PRON (14487; 14% instances), VERB (8546; 8% instances), AUX (6571; 6% instances), ADP (5873; 6% instances), ADV (5394; 5% instances), ADJ (5155; 5% instances), SCONJ (4423; 4% instances), PROPN (4180; 4% instances), PART (3634; 3% instances), CCONJ (3531; 3% instances), NUM (435; 0% instances), DET (396; 0% instances), INTJ (93; 0% instances), X (31; 0% instances), SYM (1; 0% instances)


VERB in other languages: [am] [ar] [bg] [bxr] [ca] [ckb] [cop] [cs] [cu] [da] [de] [el] [en] [es] [et] [eu] [fa] [fi] [fo] [fr] [ga] [gl] [got] [grc] [he] [hi] [hr] [hu] [id] [it] [ja] [kk] [kmr] [ko] [la] [lv] [mr] [nl] [no] [pl] [pt] [ro] [ru] [sa] [sk] [sla] [sl] [so] [sr] [sv] [swl] [ta] [tr] [ug] [uk] [u] [urj] [ur] [vi] [yue] [zh]