NUM
: numeral
A numeral is a word, functioning most typically as a determiner, adjective or pronoun, that expresses a number and a relation to the number, such as quantity, sequence, frequency or fraction.
Examples
- [kk] 0, 1, 2, 3, 4, 5, 2014, 1000000, 3.14159265359
- [kk] бір, екі, үш “one, two, three”
- [kk] I, II, III, IV, V, MMXIV
- [kk] бірінші, екінші, үшінші “first, second, third”
Treebank Statistics (UD_Kazakh)
There are 14 NUM
lemmas (5%), 16 NUM
types (5%) and 19 NUM
tokens (4%).
Out of 15 observed tags, the rank of NUM
is: 5 in number of lemmas, 5 in number of types and 7 in number of tokens.
The 10 most frequent NUM
lemmas: екеу, үш, II, 17, 1952, 2015, 24, 25, 6, VI
The 10 most frequent NUM
types: II, екеуінің, үшінші, 17, 1952, 2015, 24, 25, 6, VI
The 10 most frequent ambiguous lemmas:
The 10 most frequent ambiguous types:
Morphology
The form / lemma ratio of NUM
is 1.142857 (the average of all parts of speech is 1.240602).
The 1st highest number of forms (2) was observed with the lemma “екеу”: екеуі, екеуінің.
The 2nd highest number of forms (2) was observed with the lemma “үш”: үш, үшінші.
The 3rd highest number of forms (1) was observed with the lemma “17”: 17.
NUM
occurs with 4 features: kk-feat/NumType (19; 100% instances), kk-feat/Case (4; 21% instances), kk-feat/Number[psor] (4; 21% instances), kk-feat/Person[psor] (4; 21% instances)
NUM
occurs with 7 feature-value pairs: Case=Gen
, Case=Nom
, NumType=Card
, NumType=Coll
, NumType=Ord
, Number[psor]=Plur,Sing
, Person[psor]=3
NUM
occurs with 4 feature combinations.
The most frequent feature combination is NumType=Ord
(11 tokens).
Examples: II, үшінші, 17, 1952, 2015, 24, 6, VI, VIII
Relations
NUM
nodes are attached to their parents using 5 different relations: kk-dep/amod (7; 37% instances), kk-dep/nsubj (4; 21% instances), kk-dep/nummod (4; 21% instances), kk-dep/appos (2; 11% instances), kk-dep/nmod:poss (2; 11% instances)
Parents of NUM
nodes belong to 2 different parts of speech: NOUN (18; 95% instances), VERB (1; 5% instances)
13 (68%) NUM
nodes are leaves.
5 (26%) NUM
nodes have one child.
1 (5%) NUM
nodes have two children.
The highest child degree of a NUM
node is 2.
Children of NUM
nodes are attached using 3 different relations: kk-dep/flat:name (5; 71% instances), kk-dep/advmod (1; 14% instances), kk-dep/punct (1; 14% instances)
Children of NUM
nodes belong to 4 different parts of speech: PROPN (3; 43% instances), NOUN (2; 29% instances), ADV (1; 14% instances), PUNCT (1; 14% instances)
NUM in other languages: [am] [ar] [bg] [bxr] [ca] [ckb] [cop] [cs] [cu] [da] [de] [el] [en] [es] [et] [eu] [fa] [fi] [fo] [fr] [ga] [gl] [got] [grc] [he] [hi] [hr] [hu] [id] [it] [ja] [kk] [kmr] [ko] [la] [lv] [mr] [nl] [no] [pl] [pt] [ro] [ru] [sa] [sk] [sla] [sl] [so] [sr] [sv] [swl] [ta] [tr] [ug] [uk] [u] [urj] [ur] [vi] [yue] [zh]