home et/pos edit page issue tracker

This page still pertains to UD version 1.

ADJ: adjective

Definition

Adjectives are words that typically modify nouns and specify their properties or attributes. They may also function as predicates as in Suvi on soe ‘The summer is warm’.
Also pro-adjectives, e.g. selline ‘such’, niisugune ‘such’ , missugune ‘which’ etc and attributive ordinal numerals like esimene ‘first’, teine ‘second’ etc are labelled ADJ according to Estonian UD annotation.
Attributive or predicative participles, e.g. valvav mees ‘guarding man’, valvatav mees ‘man who is guarded’ möödunud nädal ‘last week’, lõhutud vaas ‘broken vase’ also get the ADJ label.


Treebank Statistics (UD_Estonian)

There are 1122 ADJ lemmas (16%), 1597 ADJ types (14%) and 2270 ADJ tokens (7%). Out of 16 observed tags, the rank of ADJ is: 3 in number of lemmas, 3 in number of types and 6 in number of tokens.

The 10 most frequent ADJ lemmas: suur, hea, esimene, pikk, vana, viimane, püha, valge, noor, teine

The 10 most frequent ADJ types: hea, suur, püha, vana, noor, esimest, terve, tugev, väike, muud

The 10 most frequent ambiguous lemmas: hea (ADJ 36, NOUN 5), esimene (ADJ 28, PRON 5, DET 2), pikk (ADJ 23, PROPN 1), vana (ADJ 23, NOUN 1), püha (ADJ 22, NOUN 1), valge (ADJ 21, NOUN 3), noor (ADJ 20, NOUN 2, PROPN 1), teine (PRON 37, DET 22, ADJ 20, NUM 1), parem (ADJ 18, ADV 4), muu (ADJ 16, PRON 3, DET 3)

The 10 most frequent ambiguous types: hea (ADJ 25, NOUN 4), muud (ADJ 9, DET 1, PRON 1), sama (ADJ 5, DET 2, ADV 1), kogu (DET 8, ADJ 5), parem (ADJ 6, ADV 4), sündinud (ADJ 6, VERB 1), valge (ADJ 4, NOUN 2), õige (ADV 5, ADJ 5), kolmandat (ADJ 5, NUM 2), selline (ADJ 5, DET 1)

Morphology

The form / lemma ratio of ADJ is 1.423351 (the average of all parts of speech is 1.545328).

The 1st highest number of forms (14) was observed with the lemma “suur”: Suured, suur, suurde, suure, suureks, suurel, suurele, suures, suurest, suuri, suurt, suurte, suurteks, suurtes.

The 2nd highest number of forms (9) was observed with the lemma “esimene”: esimene, esimese, esimesed, esimesel, esimesena, esimeses, esimesest, esimest, esimestes.

The 3rd highest number of forms (9) was observed with the lemma “pikk”: pika, pikaks, pikal, pikas, pikast, pikk, pikka, pikkade, pikki.

ADJ occurs with 9 features: et-feat/Degree (2055; 91% instances), et-feat/Case (2024; 89% instances), et-feat/Number (2015; 89% instances), et-feat/VerbForm (363; 16% instances), et-feat/Voice (363; 16% instances), et-feat/Tense (300; 13% instances), et-feat/NumType (126; 6% instances), et-feat/NumForm (120; 5% instances), et-feat/PronType (54; 2% instances)

ADJ occurs with 31 feature-value pairs: Case=Abe, Case=Abl, Case=Add, Case=Ade, Case=All, Case=Com, Case=Ela, Case=Ess, Case=Gen, Case=Ill, Case=Ine, Case=Nom, Case=Par, Case=Tra, Degree=Cmp, Degree=Pos, Degree=Sup, NumForm=Digit, NumForm=Letter, NumType=Ord, Number=Plur, Number=Sing, PronType=Dem, PronType=Ind, PronType=Rel, Tense=Past, Tense=Pres, VerbForm=Part, VerbForm=Sup, Voice=Act, Voice=Pass

ADJ occurs with 131 feature combinations. The most frequent feature combination is Case=Nom|Degree=Pos|Number=Sing (572 tokens). Examples: suur, hea, noor, tugev, väike, soe, vana, viimane, pikk, püha

Relations

ADJ nodes are attached to their parents using 19 different relations: et-dep/amod (1526; 67% instances), et-dep/acl (236; 10% instances), et-dep/conj (171; 8% instances), et-dep/root (169; 7% instances), et-dep/xcomp (54; 2% instances), et-dep/advcl (29; 1% instances), et-dep/ccomp (24; 1% instances), et-dep/nsubj (14; 1% instances), et-dep/acl:relcl (12; 1% instances), et-dep/parataxis (10; 0% instances), et-dep/nmod (7; 0% instances), et-dep/nsubj:cop (4; 0% instances), et-dep/obj (4; 0% instances), et-dep/obl (4; 0% instances), et-dep/advmod:quant (2; 0% instances), et-dep/cc:preconj (1; 0% instances), et-dep/csubj (1; 0% instances), et-dep/csubj:cop (1; 0% instances), et-dep/flat (1; 0% instances)

Parents of ADJ nodes belong to 10 different parts of speech: NOUN (1622; 71% instances), VERB (232; 10% instances), ROOT (169; 7% instances), ADJ (138; 6% instances), PROPN (61; 3% instances), PRON (28; 1% instances), ADV (10; 0% instances), NUM (5; 0% instances), AUX (4; 0% instances), DET (1; 0% instances)

1457 (64%) ADJ nodes are leaves.

392 (17%) ADJ nodes have one child.

107 (5%) ADJ nodes have two children.

314 (14%) ADJ nodes have three or more children.

The highest child degree of a ADJ node is 12.

Children of ADJ nodes are attached using 28 different relations: et-dep/punct (373; 18% instances), et-dep/advmod (349; 17% instances), et-dep/obl (272; 13% instances), et-dep/cop (269; 13% instances), et-dep/nsubj:cop (213; 10% instances), et-dep/conj (190; 9% instances), et-dep/cc (116; 6% instances), et-dep/advcl (55; 3% instances), et-dep/mark (52; 2% instances), et-dep/csubj:cop (33; 2% instances), et-dep/obj (29; 1% instances), et-dep/aux (23; 1% instances), et-dep/amod (19; 1% instances), et-dep/compound:prt (15; 1% instances), et-dep/parataxis (15; 1% instances), et-dep/xcomp (13; 1% instances), et-dep/nummod (11; 1% instances), et-dep/nsubj (10; 0% instances), et-dep/csubj (5; 0% instances), et-dep/flat (5; 0% instances), et-dep/acl:relcl (4; 0% instances), et-dep/discourse (4; 0% instances), et-dep/case (3; 0% instances), et-dep/ccomp (3; 0% instances), et-dep/vocative (3; 0% instances), et-dep/cc:preconj (2; 0% instances), et-dep/det (1; 0% instances), et-dep/nmod (1; 0% instances)

Children of ADJ nodes belong to 14 different parts of speech: NOUN (390; 19% instances), ADV (381; 18% instances), PUNCT (373; 18% instances), AUX (293; 14% instances), VERB (161; 8% instances), ADJ (138; 7% instances), PRON (118; 6% instances), CCONJ (116; 6% instances), PROPN (53; 3% instances), SCONJ (45; 2% instances), NUM (12; 1% instances), INTJ (4; 0% instances), ADP (3; 0% instances), DET (1; 0% instances)


ADJ in other languages: [am] [ar] [bg] [bxr] [ca] [ckb] [cop] [cs] [cu] [da] [de] [el] [en] [es] [et] [eu] [fa] [fi] [fo] [fr] [ga] [gl] [got] [grc] [he] [hi] [hr] [hu] [id] [it] [ja] [kk] [kmr] [ko] [la] [lv] [mr] [nl] [no] [pl] [pt] [ro] [ru] [sa] [sk] [sla] [sl] [so] [sr] [sv] [swl] [ta] [tr] [ug] [uk] [u] [urj] [ur] [vi] [yue] [zh]