home cs/pos edit page issue tracker

This page pertains to UD version 2.

VERB: verb

Definition

A verb is a member of the syntactic class of words that typically signal events and actions, can constitute a minimal predicate in a clause, and govern the number and types of other constituents which may occur in the clause.

Note that the VERB tag covers main verbs (content verbs) and modal verbs but it does not cover auxiliary verbs and copulas, for which there is the AUX tag. (Czech modal verbs are not considered auxiliary.) See the description of AUX for more information on the borderline between VERB and AUX.

Czech verbs can take the following morphological forms:

There are participial forms that are tagged as adjectives (ADJ) rather than verbs. See below for examples.

A verbal noun can be derived productively from almost every verb (e.g. dělat  “to do” → dělání  “doing”). While in other languages a corresponding form may be called gerund and tagged VERB, in Czech it is tagged NOUN. It has always the neuter cs-feat/Gender and it inflects for cs-feat/Number and cs-feat/Case.

Examples

Border cases

Passive participles lie on the border between verbs and adjectives. Since release 2.0, both short and long forms are tagged ADJ, although they may have verbal features in addition to the adjectival ones. For example:

Their meaning is almost identical but the usage slightly varies. Both groups can be used in nominal predication with copula. Only the short forms can be used to form the passive voice (but it may be sometimes difficult to distinguish from copula constructions, see AUX). On the other hand, the long forms inflect for case and thus can modify nouns. (Occasionally even the short form may inflect for case but it is extremely rare in the modern language. Example: nesenu is the short form of feminine singular accusative. The corresponding long form is nesenou.)

There is an analogy with some adjectives that preserved so called nominal (short) forms. And these adjectives are not derived from verbs. Example:

The nominal forms are used in predication, the standard forms both in predication and to modify nouns.

References


Treebank Statistics (UD_Czech)

There are 5446 VERB lemmas (10%), 19681 VERB types (16%) and 119672 VERB tokens (9%). Out of 17 observed tags, the rank of VERB is: 4 in number of lemmas, 4 in number of types and 5 in number of tokens.

The 10 most frequent VERB lemmas: mít, být, moci, muset, říci, stát, chtít, jít, lze, dát

The 10 most frequent VERB types: má, je, může, řekl, měl, mají, musí, jde, měla, lze

The 10 most frequent ambiguous lemmas: být (AUX 36186, VERB 4574), moci (VERB 3747, AUX 1), stát (VERB 1358, NOUN 1270, AUX 2), růst (NOUN 321, VERB 137), vzrůst (VERB 123, NOUN 11), jet (VERB 117, PROPN 5, NOUN 3), bývat (AUX 147, VERB 42), hledět (VERB 31, ADP 1), škodit (VERB 17, NOUN 1), rozlišit (VERB 12, NOUN 1)

The 10 most frequent ambiguous types: (VERB 1914, DET 15), je (AUX 8849, VERB 1773, PRON 793), jsou (AUX 2314, VERB 531), není (AUX 1070, VERB 286), bylo (AUX 1221, VERB 259), bude (AUX 2173, VERB 228), stal (VERB 215, AUX 1), byl (AUX 1755, VERB 188), stát (NOUN 259, VERB 200), být (AUX 1646, VERB 188)

Morphology

The form / lemma ratio of VERB is 3.613845 (the average of all parts of speech is 2.162583).

The 1st highest number of forms (44) was observed with the lemma “být”: Jsouc, bolo, bude, budeme, budete, budiž, budou, budu, buď, buďte, by, bych, bychom, byl, byla, byli, bylo, byly, bysme, být, býti, j, je, jest, jsem, jsi, jsme, jsou, jste, nebude, nebudeme, nebudeš, nebudou, nebudu, nebyl, nebyla, nebyli, nebylo, nebyly, nebýt, nejsem, nejsme, nejsou, není.

The 2nd highest number of forms (35) was observed with the lemma “stát”: Stojím, nestal, nestala, nestali, nestalo, nestaly, nestane, nestanou, nestojí, nestojíte, nestál, nestála, nestáli, nestálo, nestály, stal, stala, stali, stalo, staly, stane, stanete, stanou, stanu, stoje, stojí, stojíme, stál, stála, stáli, stálo, stály, stát, státi, stůj.

The 3rd highest number of forms (30) was observed with the lemma “jít”: Nejít, Pojď, Pojďme, jde, jdem, jdeme, jdete, jdou, jít, nejde, nejdeme, nejdou, nejdu, nepůjde, nepůjdou, nepůjdu, nešel, nešli, nešlo, nešly, půjde, půjdeme, půjdete, půjdou, půjdu, šel, šla, šli, šlo, šly.

VERB occurs with 14 features: cs-feat/VerbForm (119672; 100% instances), cs-feat/Polarity (119658; 100% instances), cs-feat/Number (98039; 82% instances), cs-feat/Tense (97291; 81% instances), cs-feat/Voice (97291; 81% instances), cs-feat/Aspect (72285; 60% instances), cs-feat/Mood (53459; 45% instances), cs-feat/Person (53450; 45% instances), cs-feat/Gender (44574; 37% instances), cs-feat/Animacy (11242; 9% instances), cs-feat/Style (158; 0% instances), cs-feat/Foreign (102; 0% instances), cs-feat/Abbr (19; 0% instances), cs-feat/NameType (13; 0% instances)

VERB occurs with 38 feature-value pairs: Abbr=Yes, Animacy=Anim, Animacy=Inan, Aspect=Imp, Aspect=Perf, Foreign=Yes, Gender=Fem, Gender=Fem,Masc, Gender=Fem,Neut, Gender=Masc, Gender=Neut, Mood=Cnd, Mood=Imp, Mood=Ind, NameType=Com, NameType=Oth, NameType=Pro, Number=Plur, Number=Plur,Sing, Number=Sing, Person=1, Person=2, Person=3, Polarity=Neg, Polarity=Pos, Style=Arch, Style=Coll, Style=Expr, Style=Rare, Style=Vrnc, Tense=Fut, Tense=Past, Tense=Pres, VerbForm=Conv, VerbForm=Fin, VerbForm=Inf, VerbForm=Part, Voice=Act

VERB occurs with 178 feature combinations. The most frequent feature combination is Aspect=Imp|Mood=Ind|Number=Sing|Person=3|Polarity=Pos|Tense=Pres|VerbForm=Fin|Voice=Act (15348 tokens). Examples: říká, patří, znamená, tvrdí, představuje, uvádí, nabízí, považuje, vyplývá, existuje

Relations

VERB nodes are attached to their parents using 19 different relations: cs-dep/root (52371; 44% instances), cs-dep/acl (17256; 14% instances), cs-dep/conj (15924; 13% instances), cs-dep/xcomp (12376; 10% instances), cs-dep/ccomp (7061; 6% instances), cs-dep/advcl (6782; 6% instances), cs-dep/csubj (4783; 4% instances), cs-dep/parataxis (1360; 1% instances), cs-dep/appos (623; 1% instances), cs-dep/dep (384; 0% instances), cs-dep/csubj:pass (348; 0% instances), cs-dep/orphan (170; 0% instances), cs-dep/cc (121; 0% instances), cs-dep/flat:foreign (61; 0% instances), cs-dep/case (23; 0% instances), cs-dep/advmod (22; 0% instances), cs-dep/fixed (4; 0% instances), cs-dep/mark (2; 0% instances), cs-dep/nmod (1; 0% instances)

Parents of VERB nodes belong to 17 different parts of speech: ROOT (52371; 44% instances), VERB (40607; 34% instances), NOUN (15801; 13% instances), ADJ (5566; 5% instances), DET (2537; 2% instances), PROPN (1646; 1% instances), ADV (626; 1% instances), PRON (211; 0% instances), NUM (172; 0% instances), PART (68; 0% instances), AUX (34; 0% instances), CCONJ (17; 0% instances), SCONJ (9; 0% instances), INTJ (4; 0% instances), ADP (1; 0% instances), PUNCT (1; 0% instances), SYM (1; 0% instances)

1385 (1%) VERB nodes are leaves.

8677 (7%) VERB nodes have one child.

13702 (11%) VERB nodes have two children.

95908 (80%) VERB nodes have three or more children.

The highest child degree of a VERB node is 19.

Children of VERB nodes are attached using 33 different relations: cs-dep/punct (102990; 23% instances), cs-dep/obj (65531; 14% instances), cs-dep/nsubj (63691; 14% instances), cs-dep/obl (59809; 13% instances), cs-dep/advmod (36482; 8% instances), cs-dep/conj (16715; 4% instances), cs-dep/mark (16059; 4% instances), cs-dep/cc (15866; 3% instances), cs-dep/expl:pv (14747; 3% instances), cs-dep/xcomp (13377; 3% instances), cs-dep/aux (11865; 3% instances), cs-dep/ccomp (8417; 2% instances), cs-dep/iobj (7477; 2% instances), cs-dep/advcl (6099; 1% instances), cs-dep/expl:pass (4338; 1% instances), cs-dep/nsubj:pass (2792; 1% instances), cs-dep/csubj (2452; 1% instances), cs-dep/dep (2141; 0% instances), cs-dep/advmod:emph (1395; 0% instances), cs-dep/parataxis (995; 0% instances), cs-dep/csubj:pass (348; 0% instances), cs-dep/nmod (308; 0% instances), cs-dep/discourse (267; 0% instances), cs-dep/appos (262; 0% instances), cs-dep/flat:foreign (57; 0% instances), cs-dep/orphan (53; 0% instances), cs-dep/vocative (52; 0% instances), cs-dep/amod (22; 0% instances), cs-dep/acl (15; 0% instances), cs-dep/det (10; 0% instances), cs-dep/nummod (8; 0% instances), cs-dep/cop (4; 0% instances), cs-dep/fixed (1; 0% instances)

Children of VERB nodes belong to 16 different parts of speech: NOUN (143925; 32% instances), PUNCT (102998; 23% instances), VERB (40607; 9% instances), ADV (38818; 9% instances), PRON (35203; 8% instances), PROPN (19956; 4% instances), DET (16782; 4% instances), CCONJ (16026; 4% instances), SCONJ (15588; 3% instances), AUX (11869; 3% instances), ADJ (8026; 2% instances), NUM (2926; 1% instances), PART (1746; 0% instances), ADP (83; 0% instances), SYM (62; 0% instances), INTJ (30; 0% instances)


Treebank Statistics (UD_Czech-CAC)

There are 3645 VERB lemmas (13%), 10526 VERB types (17%) and 39550 VERB tokens (8%). Out of 16 observed tags, the rank of VERB is: 3 in number of lemmas, 3 in number of types and 5 in number of tokens.

The 10 most frequent VERB lemmas: mít, být, moci, muset, jít, lze, stát, chtít, dát, pracovat

The 10 most frequent VERB types: je, má, mají, musí, může, jde, lze, jsou, mohou, můžeme

The 10 most frequent ambiguous lemmas: mít (VERB 2285, AUX 5), být (AUX 13843, VERB 1797), moci (VERB 1402, AUX 5), muset (VERB 671, AUX 2), stát (VERB 338, NOUN 169), znát (VERB 110, ADJ 1), vyžadovat (VERB 106, AUX 1), růst (NOUN 104, VERB 57), vzrůst (VERB 32, NOUN 13), bývat (AUX 84, VERB 29)

The 10 most frequent ambiguous types: je (AUX 4329, VERB 716, PRON 334), (VERB 717, DET 1, AUX 1), musí (VERB 363, AUX 1), může (VERB 365, AUX 1), jsou (AUX 1350, VERB 258), mohou (VERB 235, AUX 1), měl (VERB 138, AUX 2), měla (VERB 125, AUX 1), bylo (AUX 529, VERB 102), není (AUX 401, VERB 82)

Morphology

The form / lemma ratio of VERB is 2.887791 (the average of all parts of speech is 2.180683).

The 1st highest number of forms (30) was observed with the lemma “být”: Budiž, Buď, bude, budou, budu, byl, byla, byli, bylo, byly, být, býti, je, jest, jsem, jsme, jsou, jste, nebude, nebudu, nebyl, nebyla, nebyli, nebylo, nebyly, nebýt, nejsme, nejsou, nejste, není.

The 2nd highest number of forms (26) was observed with the lemma “moci”: moci, mohl, mohla, mohli, mohlo, mohly, mohou, mohu, může, můžeme, můžete, můžeš, můžu, nemohl, nemohla, nemohli, nemohlo, nemohly, nemohou, nemohu, nemůže, nemůžeme, nemůžete, nemůžeš, nemůžou, nemůžu.

The 3rd highest number of forms (26) was observed with the lemma “stát”: nestal, nestalo, nestaly, nestane, nestojí, nestál, nestálo, stal, stala, stali, stalo, staly, stane, staneme, stanou, staňte, stojí, stojíme, stál, stála, stáli, stálo, stály, stát, státi, stůj.

VERB occurs with 12 features: cs-feat/Polarity (39550; 100% instances), cs-feat/VerbForm (39550; 100% instances), cs-feat/Number (32146; 81% instances), cs-feat/Tense (31799; 80% instances), cs-feat/Voice (31799; 80% instances), cs-feat/Aspect (24299; 61% instances), cs-feat/Mood (21771; 55% instances), cs-feat/Person (21771; 55% instances), cs-feat/Gender (10369; 26% instances), cs-feat/Animacy (3195; 8% instances), cs-feat/Style (52; 0% instances), cs-feat/Foreign (7; 0% instances)

VERB occurs with 31 feature-value pairs: Animacy=Anim, Animacy=Inan, Aspect=Imp, Aspect=Perf, Foreign=Yes, Gender=Fem,Masc, Gender=Fem,Neut, Gender=Masc, Gender=Neut, Mood=Imp, Mood=Ind, Number=Plur, Number=Plur,Sing, Number=Sing, Person=1, Person=2, Person=3, Polarity=Neg, Polarity=Pos, Style=Arch, Style=Coll, Style=Rare, Style=Vrnc, Tense=Fut, Tense=Past, Tense=Pres, VerbForm=Conv, VerbForm=Fin, VerbForm=Inf, VerbForm=Part, Voice=Act

VERB occurs with 112 feature combinations. The most frequent feature combination is Aspect=Imp|Mood=Ind|Number=Sing|Person=3|Polarity=Pos|Tense=Pres|VerbForm=Fin|Voice=Act (6187 tokens). Examples: znamená, patří, odpovídá, stává, dochází, představuje, tvoří, ukazuje, říká, vyžaduje

Relations

VERB nodes are attached to their parents using 18 different relations: cs-dep/root (16599; 42% instances), cs-dep/conj (6783; 17% instances), cs-dep/acl (5814; 15% instances), cs-dep/xcomp (3613; 9% instances), cs-dep/advcl (2425; 6% instances), cs-dep/csubj (2132; 5% instances), cs-dep/ccomp (1230; 3% instances), cs-dep/parataxis (498; 1% instances), cs-dep/csubj:pass (124; 0% instances), cs-dep/dep (119; 0% instances), cs-dep/orphan (103; 0% instances), cs-dep/appos (62; 0% instances), cs-dep/cc (24; 0% instances), cs-dep/case (12; 0% instances), cs-dep/advmod (9; 0% instances), cs-dep/advmod:emph (1; 0% instances), cs-dep/flat:foreign (1; 0% instances), cs-dep/nmod (1; 0% instances)

Parents of VERB nodes belong to 16 different parts of speech: ROOT (16599; 42% instances), VERB (13270; 34% instances), NOUN (5544; 14% instances), ADJ (2367; 6% instances), DET (827; 2% instances), ADV (497; 1% instances), PROPN (158; 0% instances), PRON (92; 0% instances), SYM (65; 0% instances), NUM (47; 0% instances), SCONJ (39; 0% instances), PART (21; 0% instances), AUX (16; 0% instances), INTJ (4; 0% instances), CCONJ (3; 0% instances), PUNCT (1; 0% instances)

325 (1%) VERB nodes are leaves.

2540 (6%) VERB nodes have one child.

5075 (13%) VERB nodes have two children.

31610 (80%) VERB nodes have three or more children.

The highest child degree of a VERB node is 13.

Children of VERB nodes are attached using 32 different relations: cs-dep/punct (32514; 22% instances), cs-dep/obj (21903; 15% instances), cs-dep/obl (19570; 13% instances), cs-dep/nsubj (17953; 12% instances), cs-dep/advmod (12868; 9% instances), cs-dep/conj (7289; 5% instances), cs-dep/cc (5935; 4% instances), cs-dep/expl:pv (5661; 4% instances), cs-dep/mark (4938; 3% instances), cs-dep/xcomp (4023; 3% instances), cs-dep/aux (3013; 2% instances), cs-dep/advcl (2207; 1% instances), cs-dep/iobj (2076; 1% instances), cs-dep/expl:pass (2035; 1% instances), cs-dep/ccomp (1531; 1% instances), cs-dep/nsubj:pass (1406; 1% instances), cs-dep/csubj (894; 1% instances), cs-dep/dep (764; 1% instances), cs-dep/parataxis (337; 0% instances), cs-dep/advmod:emph (294; 0% instances), cs-dep/csubj:pass (101; 0% instances), cs-dep/nmod (89; 0% instances), cs-dep/discourse (77; 0% instances), cs-dep/appos (58; 0% instances), cs-dep/vocative (41; 0% instances), cs-dep/orphan (40; 0% instances), cs-dep/acl (22; 0% instances), cs-dep/amod (12; 0% instances), cs-dep/nummod (6; 0% instances), cs-dep/cop (4; 0% instances), cs-dep/det (4; 0% instances), cs-dep/flat:foreign (3; 0% instances)

Children of VERB nodes belong to 16 different parts of speech: NOUN (49615; 34% instances), PUNCT (32515; 22% instances), VERB (13270; 9% instances), ADV (13223; 9% instances), PRON (12909; 9% instances), CCONJ (5683; 4% instances), DET (5494; 4% instances), SCONJ (4846; 3% instances), AUX (3009; 2% instances), ADJ (2921; 2% instances), PROPN (2184; 1% instances), PART (820; 1% instances), NUM (651; 0% instances), SYM (504; 0% instances), ADP (20; 0% instances), INTJ (4; 0% instances)


Treebank Statistics (UD_Czech-CLTT)

There are 257 VERB lemmas (11%), 494 VERB types (12%) and 1428 VERB tokens (5%). Out of 15 observed tags, the rank of VERB is: 4 in number of lemmas, 4 in number of types and 6 in number of tokens.

The 10 most frequent VERB lemmas: obsahovat, uvést, účtovat, použít, moci, mít, vést, sestavovat, stanovit, rozumět

The 10 most frequent VERB types: obsahuje, uvede, rozumí, může, vést, účtuje, použijí, mohou, má, sestavují

The 10 most frequent ambiguous lemmas: být (AUX 412, VERB 11), stát (NOUN 25, VERB 7)

The 10 most frequent ambiguous types: je (AUX 133, PRON 8, VERB 7), delší (ADJ 14, VERB 4), koupí (NOUN 2, VERB 2), bude (AUX 12, VERB 1), daní (NOUN 2, VERB 1), jsou (AUX 110, VERB 1), nejsou (AUX 38, VERB 1), není (AUX 29, VERB 1), vlastní (ADJ 8, VERB 1)

Morphology

The form / lemma ratio of VERB is 1.922179 (the average of all parts of speech is 1.685169).

The 1st highest number of forms (10) was observed with the lemma “účtovat”: neúčtovala, neúčtovat, neúčtuje, neúčtují, účtovat, účtována, účtováno, účtovány, účtuje, účtují.

The 2nd highest number of forms (8) was observed with the lemma “použít”: nepoužije, nepoužijí, použije, použijí, použila, použita, použity, použít.

The 3rd highest number of forms (8) was observed with the lemma “uvést”: neuvede, uvede, uveden, uvedena, uvedeno, uvedeny, uvedou, uvést.

VERB occurs with 11 features: cs-feat/Polarity (1428; 100% instances), cs-feat/VerbForm (1428; 100% instances), cs-feat/Number (1224; 86% instances), cs-feat/Voice (1224; 86% instances), cs-feat/Tense (1046; 73% instances), cs-feat/Mood (966; 68% instances), cs-feat/Person (966; 68% instances), cs-feat/Gender (258; 18% instances), cs-feat/Animacy (109; 8% instances), cs-feat/Case (2; 0% instances), cs-feat/Style (1; 0% instances)

VERB occurs with 24 feature-value pairs: Animacy=Anim, Animacy=Inan, Case=Acc, Gender=Fem, Gender=Fem,Masc, Gender=Fem,Neut, Gender=Masc, Gender=Neut, Mood=Ind, Number=Plur, Number=Plur,Sing, Number=Sing, Person=3, Polarity=Neg, Polarity=Pos, Style=Arch, Tense=Fut, Tense=Past, Tense=Pres, VerbForm=Fin, VerbForm=Inf, VerbForm=Part, Voice=Act, Voice=Pass

VERB occurs with 22 feature combinations. The most frequent feature combination is Mood=Ind|Number=Sing|Person=3|Polarity=Pos|Tense=Pres|VerbForm=Fin|Voice=Act (586 tokens). Examples: obsahuje, uvede, rozumí, může, účtuje, má, lze, odpisuje, postupuje, sestavuje

Relations

VERB nodes are attached to their parents using 15 different relations: cs-dep/root (520; 36% instances), cs-dep/acl (365; 26% instances), cs-dep/conj (184; 13% instances), cs-dep/xcomp (127; 9% instances), cs-dep/advcl (88; 6% instances), cs-dep/parataxis (42; 3% instances), cs-dep/csubj (36; 3% instances), cs-dep/ccomp (31; 2% instances), cs-dep/dep (22; 2% instances), cs-dep/advmod (4; 0% instances), cs-dep/aux (3; 0% instances), cs-dep/cc (2; 0% instances), cs-dep/obl (2; 0% instances), cs-dep/appos (1; 0% instances), cs-dep/csubj:pass (1; 0% instances)

Parents of VERB nodes belong to 8 different parts of speech: ROOT (520; 36% instances), NOUN (428; 30% instances), VERB (379; 27% instances), ADJ (87; 6% instances), PRON (7; 0% instances), X (5; 0% instances), ADV (1; 0% instances), AUX (1; 0% instances)

9 (1%) VERB nodes are leaves.

102 (7%) VERB nodes have one child.

143 (10%) VERB nodes have two children.

1174 (82%) VERB nodes have three or more children.

The highest child degree of a VERB node is 10.

Children of VERB nodes are attached using 24 different relations: cs-dep/punct (1450; 25% instances), cs-dep/obl (868; 15% instances), cs-dep/obj (851; 15% instances), cs-dep/nsubj (667; 12% instances), cs-dep/advmod (354; 6% instances), cs-dep/nsubj:pass (261; 5% instances), cs-dep/expl:pass (243; 4% instances), cs-dep/conj (192; 3% instances), cs-dep/cc (140; 2% instances), cs-dep/mark (135; 2% instances), cs-dep/advcl (92; 2% instances), cs-dep/aux:pass (89; 2% instances), cs-dep/xcomp (87; 2% instances), cs-dep/cop (68; 1% instances), cs-dep/expl:pv (50; 1% instances), cs-dep/ccomp (34; 1% instances), cs-dep/aux (29; 1% instances), cs-dep/csubj (25; 0% instances), cs-dep/parataxis (17; 0% instances), cs-dep/dep (16; 0% instances), cs-dep/iobj (15; 0% instances), cs-dep/advmod:emph (4; 0% instances), cs-dep/appos (1; 0% instances), cs-dep/csubj:pass (1; 0% instances)

Children of VERB nodes belong to 12 different parts of speech: NOUN (2356; 41% instances), PUNCT (1450; 25% instances), PRON (648; 11% instances), VERB (379; 7% instances), ADV (231; 4% instances), AUX (181; 3% instances), CCONJ (138; 2% instances), SCONJ (132; 2% instances), X (87; 2% instances), ADJ (51; 1% instances), NUM (32; 1% instances), ADP (4; 0% instances)


VERB in other languages: [am] [ar] [bg] [bxr] [ca] [ckb] [cop] [cs] [cu] [da] [de] [el] [en] [es] [et] [eu] [fa] [fi] [fo] [fr] [ga] [gl] [got] [grc] [he] [hi] [hr] [hu] [id] [it] [ja] [kk] [kmr] [ko] [la] [lv] [mr] [nl] [no] [pl] [pt] [ro] [ru] [sa] [sk] [sla] [sl] [so] [sr] [sv] [swl] [ta] [tr] [ug] [uk] [u] [urj] [ur] [vi] [yue] [zh]