home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Czech-PUD: POS Tags: ADJ

There are 1174 ADJ lemmas (22%), 1751 ADJ types (23%) and 2284 ADJ tokens (12%). Out of 15 observed tags, the rank of ADJ is: 2 in number of lemmas, 2 in number of types and 3 in number of tokens.

The 10 most frequent ADJ lemmas: velký, nový, další, první, vysoký, druhý, poslední, americký, jiný, jižní

The 10 most frequent ADJ types: první, další, nové, jižní, druhé, severní, hlavní, velká, velké, vlastní

The 10 most frequent ambiguous lemmas: každý (ADJ 14, DET 2), známý (ADJ 9, NOUN 1), vedoucí (ADJ 5, NOUN 2), rád (ADJ 3, ADV 2), El (ADJ 2, PROPN 1), a (CCONJ 462, ADJ 2, ADP 1), Le (NOUN 1, ADJ 1), Santa (PROPN 1, ADJ 1), jeden (NUM 37, ADJ 1), sci-fi (NOUN 1, ADJ 1)

The 10 most frequent ambiguous types: vlastní (ADJ 8, VERB 2), každý (ADJ 6, DET 2), vedoucí (NOUN 1, ADJ 1), El (ADJ 2, PROPN 1), a (CCONJ 454, ADJ 2, ADP 1), Royal (NOUN 1, ADJ 1), Santa (PROPN 1, ADJ 1), sci-fi (NOUN 1, ADJ 1), známých (NOUN 1, ADJ 1)

Morphology

The form / lemma ratio of ADJ is 1.491482 (the average of all parts of speech is 1.427558).

The 1st highest number of forms (16) was observed with the lemma “velký”: největší, největším, největšími, velcí, velkou, velká, velké, velkého, velkém, velkému, velký, velkých, velkým, větší, větších, větším.

The 2nd highest number of forms (11) was observed with the lemma “vysoký”: Nejvyššího, nejvyšší, nejvyšším, vysoká, vysoké, vysokému, vysoký, vysokých, vysokým, vyšší, vyšších.

The 3rd highest number of forms (10) was observed with the lemma “dobrý”: Dobrá, Dobrými, dobrou, dobré, dobrých, lepší, lepším, nejlepší, nejlepších, nejlepším.

ADJ occurs with 20 features: Number (2245; 98% instances), Gender (2244; 98% instances), Polarity (2168; 95% instances), Case (2025; 89% instances), Degree (1883; 82% instances), Animacy (876; 38% instances), VerbForm (268; 12% instances), Voice (268; 12% instances), Variant (220; 10% instances), Aspect (184; 8% instances), NumType (76; 3% instances), Tense (63; 3% instances), NameType (56; 2% instances), Gender[psor] (39; 2% instances), Poss (39; 2% instances), Foreign (31; 1% instances), Style (6; 0% instances), Abbr (2; 0% instances), Hyph (2; 0% instances), NumValue (1; 0% instances)

ADJ occurs with 46 feature-value pairs: Abbr=Yes, Animacy=Anim, Animacy=Inan, Aspect=Imp, Aspect=Perf, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Degree=Cmp, Degree=Pos, Degree=Sup, Foreign=Yes, Gender=Fem, Gender=Fem,Masc, Gender=Fem,Neut, Gender=Masc, Gender=Neut, Gender[psor]=Fem, Gender[psor]=Masc, Hyph=Yes, NameType=Com, NameType=Geo, NameType=Giv, NameType=Pro, NameType=Sur, NumType=Ord, NumType=Sets, NumValue=1, Number=Dual, Number=Plur, Number=Plur,Sing, Number=Sing, Polarity=Neg, Polarity=Pos, Poss=Yes, Style=Coll, Style=Expr, Style=Rare, Tense=Pres, Variant=Short, VerbForm=Part, Voice=Act, Voice=Pass

ADJ occurs with 245 feature combinations. The most frequent feature combination is Case=Nom|Degree=Pos|Gender=Fem|Number=Sing|Polarity=Pos (155 tokens). Examples: velká, severní, každá, Průmyslová, makedonská, Federální, Francouzská, Jihoafrická, Jihokorejská, Nová

Relations

ADJ nodes are attached to their parents using 19 different relations: amod (1833; 80% instances), root (154; 7% instances), conj (91; 4% instances), acl (55; 2% instances), advcl (27; 1% instances), nmod (23; 1% instances), ccomp (22; 1% instances), xcomp (19; 1% instances), flat (12; 1% instances), nsubj (12; 1% instances), obj (7; 0% instances), parataxis (7; 0% instances), iobj (6; 0% instances), obl (6; 0% instances), compound (4; 0% instances), appos (3; 0% instances), discourse (1; 0% instances), fixed (1; 0% instances), nsubj:pass (1; 0% instances)

Parents of ADJ nodes belong to 11 different parts of speech: NOUN (1805; 79% instances), (154; 7% instances), VERB (117; 5% instances), PROPN (103; 5% instances), ADJ (66; 3% instances), PRON (12; 1% instances), ADV (9; 0% instances), DET (9; 0% instances), NUM (5; 0% instances), ADP (3; 0% instances), AUX (1; 0% instances)

1682 (74%) ADJ nodes are leaves.

207 (9%) ADJ nodes have one child.

58 (3%) ADJ nodes have two children.

337 (15%) ADJ nodes have three or more children.

The highest child degree of a ADJ node is 9.

Children of ADJ nodes are attached using 33 different relations: punct (374; 20% instances), advmod (182; 10% instances), obl (178; 9% instances), aux:pass (164; 9% instances), nmod (155; 8% instances), cop (149; 8% instances), nsubj:pass (127; 7% instances), nsubj (113; 6% instances), conj (98; 5% instances), cc (87; 5% instances), mark (49; 3% instances), flat (35; 2% instances), case (27; 1% instances), iobj (21; 1% instances), obl:agent (21; 1% instances), advcl (18; 1% instances), csubj (18; 1% instances), xcomp (14; 1% instances), obj (13; 1% instances), aux (9; 0% instances), det (5; 0% instances), advmod:emph (4; 0% instances), appos (4; 0% instances), compound (4; 0% instances), ccomp (3; 0% instances), csubj:pass (3; 0% instances), acl (2; 0% instances), amod (2; 0% instances), nummod (2; 0% instances), parataxis (2; 0% instances), det:numgov (1; 0% instances), discourse (1; 0% instances), orphan (1; 0% instances)

Children of ADJ nodes belong to 15 different parts of speech: NOUN (492; 26% instances), PUNCT (374; 20% instances), AUX (321; 17% instances), ADV (183; 10% instances), PROPN (96; 5% instances), CCONJ (85; 5% instances), VERB (82; 4% instances), ADJ (66; 3% instances), DET (65; 3% instances), SCONJ (52; 3% instances), PRON (32; 2% instances), ADP (27; 1% instances), PART (7; 0% instances), NUM (2; 0% instances), SYM (2; 0% instances)