NumForm

home cs/feat edit page issue tracker

This page pertains to UD version 2.

`NumForm`: numeral form

Values:

Digit

Roman

Word

Feature of cardinal and ordinal numbers. Is the number expressed by digits or as a word?

`Word`: number expressed as word

Examples

jeden “one”, dva “two”, tři “three”

`Digit`: number expressed using digits

Examples

1, 2, 3

`Roman`: roman numeral

Examples

I, II, III

Treebank Statistics (UD_Czech)

This feature is language-specific. It occurs with 3 different values: Digit, Roman, Word.

36547 tokens (3%) have a non-empty value of NumForm. 3403 types (3%) occur at least once with a non-empty value of NumForm. 3244 lemmas (6%) occur at least once with a non-empty value of NumForm. The feature is used with 1 part-of-speech tags: cs-pos/NUM (36547; 3% instances).

`NUM`

36547 cs-pos/NUM tokens (99% of all NUM tokens) have a non-empty value of NumForm.

The most frequent other feature values with which NUM and NumForm co-occurred: NumType=Card (36547; 100%), Gender=EMPTY (32665; 89%), NumValue=EMPTY (29468; 81%), Case=EMPTY (26598; 73%), Number=EMPTY (26574; 73%).

NUM tokens may have the following values of NumForm:

Digit (26226; 72% of non-empty NumForm): 1, 2, 3, 4, 6, 5, 10, 1992, 1994, 1993
Roman (348; 1% of non-empty NumForm): II, I, III, IV, V, VI, D, XX, C, IX
Word (9973; 27% of non-empty NumForm): dva, tři, jeden, dvě, tisíc, dvou, pět, obou, čtyři, jednoho

NumForm seems to be lexical feature of NUM. 100% lemmas (3244) occur only with one value of NumForm.

Relations with Agreement in `NumForm`

The 10 most frequent relations where parent and child node agree in NumForm: NUM –[conj]–> NUM (2900; 100%), NUM –[compound]–> NUM (2481; 96%), NUM –[orphan]–> NUM (73; 97%), NUM –[dep]–> NUM (38; 97%).

Treebank Statistics (UD_Czech-CAC)

This feature is language-specific. It occurs with 2 different values: Digit, Word.

7149 tokens (1%) have a non-empty value of NumForm. 123 types (0%) occur at least once with a non-empty value of NumForm. 50 lemmas (0%) occur at least once with a non-empty value of NumForm. The feature is used with 1 part-of-speech tags: cs-pos/NUM (7149; 1% instances).

`NUM`

7149 cs-pos/NUM tokens (99% of all NUM tokens) have a non-empty value of NumForm.

The most frequent other feature values with which NUM and NumForm co-occurred: NumType=Card (7149; 100%), Gender=EMPTY (6031; 84%), NumValue=EMPTY (5219; 73%), Number=EMPTY (4784; 67%), Case=EMPTY (4784; 67%).

NUM tokens may have the following values of NumForm:

Digit (4784; 67% of non-empty NumForm): #
Word (2365; 33% of non-empty NumForm): dvou, jeden, dvě, tři, dva, obou, jednoho, jedné, jedním, dvěma

NumForm seems to be lexical feature of NUM. 100% lemmas (50) occur only with one value of NumForm.

Relations with Agreement in `NumForm`

The 10 most frequent relations where parent and child node agree in NumForm: NUM –[conj]–> NUM (308; 100%), NUM –[compound]–> NUM (31; 74%), NUM –[orphan]–> NUM (16; 100%).

Treebank Statistics (UD_Czech-CLTT)

This feature is language-specific. It occurs with 2 different values: Roman, Word.

310 tokens (1%) have a non-empty value of NumForm. 86 types (2%) occur at least once with a non-empty value of NumForm. 76 lemmas (3%) occur at least once with a non-empty value of NumForm. The feature is used with 1 part-of-speech tags: cs-pos/NUM (310; 1% instances).

`NUM`

310 cs-pos/NUM tokens (100% of all NUM tokens) have a non-empty value of NumForm.

The most frequent other feature values with which NUM and NumForm co-occurred: NumType=Card (310; 100%), Gender=EMPTY (278; 90%), NumValue=EMPTY (272; 88%), Case=EMPTY (264; 85%), Number=EMPTY (264; 85%).

NUM tokens may have the following values of NumForm:

Roman (264; 85% of non-empty NumForm): 1, 3, 2, 4, 5, 41, 2004, 2008, 31, 2005
Word (46; 15% of non-empty NumForm): jeden, dvanáct, tří, dvě, jednoho, jedno, jednou, pět, dvanácti, dvou

NumForm seems to be lexical feature of NUM. 100% lemmas (76) occur only with one value of NumForm.

Relations with Agreement in `NumForm`

The 10 most frequent relations where parent and child node agree in NumForm: NUM –[conj]–> NUM (29; 100%).

NumForm in other languages: [ar] [ca] [cs] [es] [et] [la] [nl] [pt] [ro] [sl] [ta]

NumForm: numeral form

Word: number expressed as word

Examples

Digit: number expressed using digits

Examples

Roman: roman numeral

Examples

Treebank Statistics (UD_Czech)

NUM

Relations with Agreement in NumForm

Treebank Statistics (UD_Czech-CAC)

NUM

Relations with Agreement in NumForm

Treebank Statistics (UD_Czech-CLTT)

NUM

Relations with Agreement in NumForm

`NumForm`: numeral form

`Word`: number expressed as word

`Digit`: number expressed using digits

`Roman`: roman numeral

`NUM`

Relations with Agreement in `NumForm`

`NUM`

Relations with Agreement in `NumForm`

`NUM`

Relations with Agreement in `NumForm`