home en/pos edit page issue tracker

This page still pertains to UD version 1.

NUM: numeral

The English NUM corresponds exactly to the PTB CD.


Treebank Statistics (UD_English)

There are 1063 NUM lemmas (6%), 1064 NUM types (5%) and 4377 NUM tokens (2%). Out of 17 observed tags, the rank of NUM is: 5 in number of lemmas, 5 in number of types and 13 in number of tokens.

The 10 most frequent NUM lemmas: one, two, 2, 3, 5, 1, 4, 10, three, 6

The 10 most frequent NUM types: one, two, 2, 3, 5, 1, 4, 10, three, 6

The 10 most frequent ambiguous lemmas: one (NUM 415, NOUN 127, PRON 25, VERB 1), 2 (NUM 139, X 30, PART 1, PROPN 1), 3 (NUM 116, X 17, NOUN 1), 5 (NUM 103, X 4), 1 (NUM 101, X 31), 4 (NUM 93, X 13, ADP 1), 10 (NUM 89, X 2), 6 (NUM 59, X 2), 20 (NUM 58, NOUN 5), m (NUM 46, NOUN 17, PROPN 2)

The 10 most frequent ambiguous types: one (NUM 368, NOUN 88, PRON 22), 2 (NUM 139, X 30, PROPN 1, PART 1), 3 (NUM 116, X 17), 5 (NUM 103, X 4), 1 (NUM 101, X 31), 4 (NUM 93, X 13, ADP 1), 10 (NUM 89, X 2), 6 (NUM 59, X 2), 20 (NUM 58, NOUN 3), m (NUM 41, AUX 20, NOUN 11, PROPN 2, VERB 1)

Morphology

The form / lemma ratio of NUM is 1.000941 (the average of all parts of speech is 1.181137).

The 1st highest number of forms (2) was observed with the lemma “’72”: ‘72, ’72.

The 2nd highest number of forms (1) was observed with the lemma “’02”: ‘02.

The 3rd highest number of forms (1) was observed with the lemma “’05”: ‘05.

NUM occurs with 2 features: en-feat/NumType (4376; 100% instances), en-feat/Number (1; 0% instances)

NUM occurs with 2 feature-value pairs: NumType=Card, Number=Sing

NUM occurs with 2 feature combinations. The most frequent feature combination is NumType=Card (4376 tokens). Examples: one, two, 2, 3, 5, 1, 4, 10, three, 6

Relations

NUM nodes are attached to their parents using 29 different relations: en-dep/nummod (2630; 60% instances), en-dep/root (392; 9% instances), en-dep/nmod (250; 6% instances), en-dep/compound (235; 5% instances), en-dep/obl (219; 5% instances), en-dep/appos (179; 4% instances), en-dep/nsubj (95; 2% instances), en-dep/obj (88; 2% instances), en-dep/conj (82; 2% instances), en-dep/list (53; 1% instances), en-dep/nmod:tmod (49; 1% instances), en-dep/amod (18; 0% instances), en-dep/parataxis (15; 0% instances), en-dep/obl:tmod (12; 0% instances), en-dep/advmod (9; 0% instances), en-dep/xcomp (8; 0% instances), en-dep/advcl (7; 0% instances), en-dep/ccomp (7; 0% instances), en-dep/obl:npmod (7; 0% instances), en-dep/nmod:npmod (5; 0% instances), en-dep/nsubj:pass (4; 0% instances), en-dep/acl:relcl (3; 0% instances), en-dep/case (2; 0% instances), en-dep/det (2; 0% instances), en-dep/reparandum (2; 0% instances), en-dep/iobj (1; 0% instances), en-dep/nmod:poss (1; 0% instances), en-dep/orphan (1; 0% instances), en-dep/vocative (1; 0% instances)

Parents of NUM nodes belong to 12 different parts of speech: NOUN (2151; 49% instances), PROPN (684; 16% instances), ROOT (392; 9% instances), VERB (379; 9% instances), NUM (378; 9% instances), SYM (315; 7% instances), ADJ (37; 1% instances), ADV (15; 0% instances), X (13; 0% instances), PRON (7; 0% instances), DET (5; 0% instances), AUX (1; 0% instances)

2787 (64%) NUM nodes are leaves.

1003 (23%) NUM nodes have one child.

250 (6%) NUM nodes have two children.

337 (8%) NUM nodes have three or more children.

The highest child degree of a NUM node is 10.

Children of NUM nodes are attached using 34 different relations: en-dep/punct (635; 23% instances), en-dep/case (485; 18% instances), en-dep/nmod (311; 11% instances), en-dep/advmod (201; 7% instances), en-dep/nmod:tmod (175; 6% instances), en-dep/appos (172; 6% instances), en-dep/compound (144; 5% instances), en-dep/conj (88; 3% instances), en-dep/cc (80; 3% instances), en-dep/cop (80; 3% instances), en-dep/nsubj (77; 3% instances), en-dep/nummod (64; 2% instances), en-dep/det (53; 2% instances), en-dep/parataxis (40; 1% instances), en-dep/amod (26; 1% instances), en-dep/acl:relcl (18; 1% instances), en-dep/mark (12; 0% instances), en-dep/aux (11; 0% instances), en-dep/obl (9; 0% instances), en-dep/advcl (8; 0% instances), en-dep/nmod:npmod (8; 0% instances), en-dep/discourse (5; 0% instances), en-dep/_ (3; 0% instances), en-dep/acl (3; 0% instances), en-dep/nmod:poss (2; 0% instances), en-dep/reparandum (2; 0% instances), en-dep/cc:preconj (1; 0% instances), en-dep/ccomp (1; 0% instances), en-dep/csubj (1; 0% instances), en-dep/det:predet (1; 0% instances), en-dep/goeswith (1; 0% instances), en-dep/obj (1; 0% instances), en-dep/vocative (1; 0% instances), en-dep/xcomp (1; 0% instances)

Children of NUM nodes belong to 17 different parts of speech: PUNCT (625; 23% instances), NOUN (521; 19% instances), ADP (416; 15% instances), NUM (375; 14% instances), ADV (180; 7% instances), SYM (100; 4% instances), AUX (91; 3% instances), ADJ (78; 3% instances), CCONJ (78; 3% instances), PRON (75; 3% instances), VERB (68; 3% instances), DET (60; 2% instances), PROPN (37; 1% instances), SCONJ (6; 0% instances), PART (4; 0% instances), INTJ (3; 0% instances), X (3; 0% instances)


Treebank Statistics (UD_English-ESL)

There are 1 NUM lemmas (6%), 1 NUM types (6%) and 776 NUM tokens (1%). Out of 17 observed tags, the rank of NUM is: 9 in number of lemmas, 9 in number of types and 14 in number of tokens.

The 10 most frequent NUM lemmas: _

The 10 most frequent NUM types: _

The 10 most frequent ambiguous lemmas: _ (NOUN 14135, VERB 13583, PRON 9575, DET 9068, PUNCT 8624, ADP 7769, ADJ 5278, ADV 5121, AUX 4111, PART 3169, CONJ 2865, SCONJ 2278, PROPN 1574, NUM 776, INTJ 67, X 60, SYM 37)

The 10 most frequent ambiguous types: _ (NOUN 14135, VERB 13583, PRON 9575, DET 9068, PUNCT 8624, ADP 7769, ADJ 5278, ADV 5121, AUX 4111, PART 3169, CONJ 2865, SCONJ 2278, PROPN 1574, NUM 776, INTJ 67, X 60, SYM 37)

Morphology

The form / lemma ratio of NUM is 1.000000 (the average of all parts of speech is 1.000000).

The 1st highest number of forms (1) was observed with the lemma “_”: _.

NUM does not occur with any features.

Relations

NUM nodes are attached to their parents using 19 different relations: en-dep/nummod (488; 63% instances), en-dep/nmod (140; 18% instances), en-dep/root (30; 4% instances), en-dep/conj (24; 3% instances), en-dep/nsubj (24; 3% instances), en-dep/dobj (20; 3% instances), en-dep/compound (12; 2% instances), en-dep/appos (10; 1% instances), en-dep/nmod:tmod (6; 1% instances), en-dep/advcl (4; 1% instances), en-dep/ccomp (4; 1% instances), en-dep/acl:relcl (3; 0% instances), en-dep/parataxis (3; 0% instances), en-dep/goeswith (2; 0% instances), en-dep/nmod:npmod (2; 0% instances), en-dep/amod (1; 0% instances), en-dep/csubjpass (1; 0% instances), en-dep/det (1; 0% instances), en-dep/xcomp (1; 0% instances)

Parents of NUM nodes belong to 10 different parts of speech: NOUN (469; 60% instances), VERB (161; 21% instances), PROPN (36; 5% instances), SYM (35; 5% instances), NUM (31; 4% instances), ROOT (30; 4% instances), ADJ (9; 1% instances), ADV (2; 0% instances), PRON (2; 0% instances), PUNCT (1; 0% instances)

473 (61%) NUM nodes are leaves.

191 (25%) NUM nodes have one child.

46 (6%) NUM nodes have two children.

66 (9%) NUM nodes have three or more children.

The highest child degree of a NUM node is 9.

Children of NUM nodes are attached using 25 different relations: en-dep/case (147; 26% instances), en-dep/nmod (89; 16% instances), en-dep/punct (52; 9% instances), en-dep/cop (46; 8% instances), en-dep/nsubj (44; 8% instances), en-dep/advmod (42; 7% instances), en-dep/conj (29; 5% instances), en-dep/cc (27; 5% instances), en-dep/det (20; 3% instances), en-dep/compound (17; 3% instances), en-dep/amod (13; 2% instances), en-dep/mark (9; 2% instances), en-dep/acl:relcl (7; 1% instances), en-dep/parataxis (6; 1% instances), en-dep/appos (4; 1% instances), en-dep/advcl (3; 1% instances), en-dep/aux (3; 1% instances), en-dep/goeswith (3; 1% instances), en-dep/neg (3; 1% instances), en-dep/acl (2; 0% instances), en-dep/nummod (2; 0% instances), en-dep/csubj (1; 0% instances), en-dep/discourse (1; 0% instances), en-dep/nmod:poss (1; 0% instances), en-dep/xcomp (1; 0% instances)

Children of NUM nodes belong to 16 different parts of speech: ADP (146; 26% instances), NOUN (90; 16% instances), VERB (75; 13% instances), PUNCT (51; 9% instances), ADV (47; 8% instances), PRON (34; 6% instances), NUM (31; 5% instances), CONJ (27; 5% instances), ADJ (23; 4% instances), DET (21; 4% instances), PROPN (13; 2% instances), SCONJ (7; 1% instances), AUX (3; 1% instances), PART (2; 0% instances), SYM (1; 0% instances), X (1; 0% instances)


Treebank Statistics (UD_English-LinES)

There are 1 NUM lemmas (6%), 106 NUM types (1%) and 462 NUM tokens (1%). Out of 17 observed tags, the rank of NUM is: 9 in number of lemmas, 6 in number of types and 14 in number of tokens.

The 10 most frequent NUM lemmas: _

The 10 most frequent NUM types: one, two, three, 2002, five, 2000, six, ten, 1, four

The 10 most frequent ambiguous lemmas: _ (NOUN 12161, PUNCT 8085, VERB 8020, ADP 6788, DET 6429, PRON 6303, ADJ 4270, ADV 3700, AUX 3539, PROPN 2257, CCONJ 2081, PART 1703, SCONJ 1231, NUM 462, INTJ 122, X 41, SYM 5)

The 10 most frequent ambiguous types: one (PRON 89, NUM 88, DET 7), 1 (NUM 10, ADJ 1), 12 (NUM 7, ADJ 1), 3 (NUM 2, ADJ 1), 30 (NUM 2, ADJ 1), 5 (NUM 2, ADJ 1), U (NUM 2, NOUN 1), 22 (ADJ 2, NUM 1)

Morphology

The form / lemma ratio of NUM is 106.000000 (the average of all parts of speech is 527.705882).

The 1st highest number of forms (106) was observed with the lemma “_”: 1, 1-100, 10, 100, 100c, 101-200, 11.25, 11.30, 111, 12, 12:30, 13, 14, 1857, 1875, 1910, 1947, 1952, 1953, 1955, 1973, 1976, 1996, 1996-1997, 1997, 2, 2.6, 2000, 2002, 2005, 22, 23, 25, 3, 30, 37, 38, 4, 4-5, 40, 43, 4:30, 5, 5.5, 50, 50000, 60, 6500, 7, 7.0, 7.15, 747, 84, 9, 96/23, 96/96/EC, 97, A4-0072/97, C4-0497/98-98/0126, H-0002/99, H-0045/99, H-0209/99, H-0218/97, H-0237/97, No-15, No-44, No-46, No-49, No-59, No-6, No-8, U, billion, eight, eleven, fifteen, five, forty, forty-eight, four, fourteen, hundred, million, n, nine, nineteen, nn, one, seven, six, six-thirty, sixteen, sixty, ten, thirty, thirty-eight, thirty-five, thousand, three, twelve, twenty, twenty-five, twenty-four, twenty-six, twenty-two, two.

NUM does not occur with any features.

Relations

NUM nodes are attached to their parents using 15 different relations: en-dep/nummod (316; 68% instances), en-dep/obl (38; 8% instances), en-dep/conj (27; 6% instances), en-dep/discourse (14; 3% instances), en-dep/nsubj (14; 3% instances), en-dep/flat (13; 3% instances), en-dep/appos (11; 2% instances), en-dep/root (10; 2% instances), en-dep/obj (9; 2% instances), en-dep/nsubj:pass (3; 1% instances), en-dep/nmod (2; 0% instances), en-dep/xcomp (2; 0% instances), en-dep/advmod (1; 0% instances), en-dep/ccomp (1; 0% instances), en-dep/dislocated (1; 0% instances)

Parents of NUM nodes belong to 11 different parts of speech: NOUN (278; 60% instances), VERB (72; 16% instances), NUM (42; 9% instances), PROPN (41; 9% instances), ROOT (10; 2% instances), ADJ (5; 1% instances), ADV (4; 1% instances), SYM (4; 1% instances), PRON (3; 1% instances), ADP (2; 0% instances), AUX (1; 0% instances)

262 (57%) NUM nodes are leaves.

119 (26%) NUM nodes have one child.

49 (11%) NUM nodes have two children.

32 (7%) NUM nodes have three or more children.

The highest child degree of a NUM node is 14.

Children of NUM nodes are attached using 19 different relations: en-dep/case (73; 22% instances), en-dep/nmod (46; 14% instances), en-dep/punct (38; 11% instances), en-dep/advmod (34; 10% instances), en-dep/conj (33; 10% instances), en-dep/compound (27; 8% instances), en-dep/cc (21; 6% instances), en-dep/det (14; 4% instances), en-dep/nummod (14; 4% instances), en-dep/fixed (10; 3% instances), en-dep/appos (8; 2% instances), en-dep/cop (5; 1% instances), en-dep/nsubj (4; 1% instances), en-dep/obl:agent (4; 1% instances), en-dep/amod (3; 1% instances), en-dep/acl (2; 1% instances), en-dep/acl:relcl (1; 0% instances), en-dep/aux (1; 0% instances), en-dep/mark (1; 0% instances)

Children of NUM nodes belong to 15 different parts of speech: NOUN (74; 22% instances), ADP (73; 22% instances), NUM (42; 12% instances), ADV (39; 12% instances), PUNCT (38; 11% instances), CCONJ (26; 8% instances), DET (14; 4% instances), PROPN (8; 2% instances), PRON (7; 2% instances), AUX (6; 2% instances), ADJ (5; 1% instances), VERB (4; 1% instances), PART (1; 0% instances), SCONJ (1; 0% instances), X (1; 0% instances)


Treebank Statistics (UD_English-ParTUT)

There are 222 NUM lemmas (4%), 222 NUM types (3%) and 691 NUM tokens (2%). Out of 17 observed tags, the rank of NUM is: 6 in number of lemmas, 6 in number of types and 13 in number of tokens.

The 10 most frequent NUM lemmas: two, one, 1, three, four, 18, 2, 3, 20, 2002

The 10 most frequent NUM types: two, one, 1, three, four, 18, 2, 3, 20, 2002

The 10 most frequent ambiguous lemmas: two (NUM 47, NOUN 2), one (NUM 42, PRON 21, DET 2, NOUN 1), three (NUM 20, NOUN 1), million (NUM 5, NOUN 2), ten (NUM 2, NOUN 2), - (PUNCT 289, X 6, NUM 1)

The 10 most frequent ambiguous types: two (NUM 37, NOUN 1), one (NUM 37, PRON 20, DET 2, NOUN 1), three (NUM 20, NOUN 1), ten (NUM 2, NOUN 1)

Morphology

The form / lemma ratio of NUM is 1.000000 (the average of all parts of speech is 1.187751).

The 1st highest number of forms (1) was observed with the lemma “-”: -20º.

The 2nd highest number of forms (1) was observed with the lemma “0083”: 0083.

The 3rd highest number of forms (1) was observed with the lemma “1”: 1.

NUM occurs with 1 features: en-feat/NumType (691; 100% instances)

NUM occurs with 1 feature-value pairs: NumType=Card

NUM occurs with 1 feature combinations. The most frequent feature combination is NumType=Card (691 tokens). Examples: two, one, 1, three, four, 18, 2, 3, 20, 2002

Relations

NUM nodes are attached to their parents using 15 different relations: en-dep/nummod (522; 76% instances), en-dep/obl (86; 12% instances), en-dep/conj (27; 4% instances), en-dep/compound (13; 2% instances), en-dep/flat (12; 2% instances), en-dep/root (10; 1% instances), en-dep/obj (8; 1% instances), en-dep/nsubj (5; 1% instances), en-dep/appos (2; 0% instances), en-dep/advcl (1; 0% instances), en-dep/ccomp (1; 0% instances), en-dep/goeswith (1; 0% instances), en-dep/nmod (1; 0% instances), en-dep/orphan (1; 0% instances), en-dep/xcomp (1; 0% instances)

Parents of NUM nodes belong to 10 different parts of speech: NOUN (295; 43% instances), VERB (137; 20% instances), PROPN (126; 18% instances), NUM (65; 9% instances), SYM (37; 5% instances), ADJ (12; 2% instances), ROOT (10; 1% instances), PUNCT (4; 1% instances), X (4; 1% instances), ADV (1; 0% instances)

388 (56%) NUM nodes are leaves.

134 (19%) NUM nodes have one child.

111 (16%) NUM nodes have two children.

58 (8%) NUM nodes have three or more children.

The highest child degree of a NUM node is 7.

Children of NUM nodes are attached using 20 different relations: en-dep/punct (224; 39% instances), en-dep/case (132; 23% instances), en-dep/nmod (51; 9% instances), en-dep/conj (29; 5% instances), en-dep/goeswith (26; 4% instances), en-dep/nummod (25; 4% instances), en-dep/cc (18; 3% instances), en-dep/det (13; 2% instances), en-dep/compound (12; 2% instances), en-dep/cop (10; 2% instances), en-dep/nsubj (10; 2% instances), en-dep/advmod (8; 1% instances), en-dep/amod (8; 1% instances), en-dep/advcl (3; 1% instances), en-dep/appos (2; 0% instances), en-dep/aux (2; 0% instances), en-dep/csubj (2; 0% instances), en-dep/mark (2; 0% instances), en-dep/fixed (1; 0% instances), en-dep/nmod:poss (1; 0% instances)

Children of NUM nodes belong to 15 different parts of speech: PUNCT (224; 39% instances), ADP (126; 22% instances), NUM (65; 11% instances), NOUN (38; 7% instances), ADJ (28; 5% instances), PROPN (24; 4% instances), CCONJ (18; 3% instances), DET (14; 2% instances), X (13; 2% instances), AUX (12; 2% instances), ADV (7; 1% instances), VERB (5; 1% instances), PRON (2; 0% instances), SCONJ (2; 0% instances), PART (1; 0% instances)


NUM in other languages: [am] [ar] [bg] [bxr] [ca] [ckb] [cop] [cs] [cu] [da] [de] [el] [en] [es] [et] [eu] [fa] [fi] [fo] [fr] [ga] [gl] [got] [grc] [he] [hi] [hr] [hu] [id] [it] [ja] [kk] [kmr] [ko] [la] [lv] [mr] [nl] [no] [pl] [pt] [ro] [ru] [sa] [sk] [sla] [sl] [so] [sr] [sv] [swl] [ta] [tr] [ug] [uk] [u] [urj] [ur] [vi] [yue] [zh]