Treebank Statistics: UD_Czech: Features: NumForm
This feature is language-specific.
It occurs with 3 different values: Digit
, Roman
, Word
.
41165 tokens (3%) have a non-empty value of NumForm
.
3589 types (3%) occur at least once with a non-empty value of NumForm
.
3428 lemmas (6%) occur at least once with a non-empty value of NumForm
.
The feature is used with 1 part-of-speech tags: NUM (41165; 3% instances).
NUM
41165 NUM tokens (99% of all NUM
tokens) have a non-empty value of NumForm
.
The most frequent other feature values with which NUM
and NumForm
co-occurred: NumType=Card (41165; 100%), Gender=EMPTY (36748; 89%), NumValue=EMPTY (33115; 80%), Case=EMPTY (29884; 73%), Number=EMPTY (29858; 73%).
NUM
tokens may have the following values of NumForm
:
Digit
(29481; 72% of non-emptyNumForm
): 1, 2, 3, 4, 6, 5, 1992, 10, 1994, 1993Roman
(376; 1% of non-emptyNumForm
): II, I, III, IV, V, VI, XX, D, C, IXWord
(11308; 27% of non-emptyNumForm
): dva, tři, jeden, dvě, tisíc, dvou, pět, čtyři, obou, jednoho
NumForm
seems to be lexical feature of NUM
. 100% lemmas (3428) occur only with one value of NumForm
.
Relations with Agreement in NumForm
The 10 most frequent relations where parent and child node agree in NumForm
:
NUM –[conj]–> NUM (3247; 100%),
NUM –[compound]–> NUM (2671; 95%),
NUM –[orphan]–> NUM (79; 98%),
NUM –[dep]–> NUM (50; 96%).