NUM
: numeral
The English NUM
corresponds exactly to the PTB CD.
Treebank Statistics (UD_English)
There are 1063 NUM
lemmas (6%), 1064 NUM
types (5%) and 4377 NUM
tokens (2%).
Out of 17 observed tags, the rank of NUM
is: 5 in number of lemmas, 5 in number of types and 13 in number of tokens.
The 10 most frequent NUM
lemmas: one, two, 2, 3, 5, 1, 4, 10, three, 6
The 10 most frequent NUM
types: one, two, 2, 3, 5, 1, 4, 10, three, 6
The 10 most frequent ambiguous lemmas: one (NUM 415, NOUN 127, PRON 25, VERB 1), 2 (NUM 139, X 30, PART 1, PROPN 1), 3 (NUM 116, X 17, NOUN 1), 5 (NUM 103, X 4), 1 (NUM 101, X 31), 4 (NUM 93, X 13, ADP 1), 10 (NUM 89, X 2), 6 (NUM 59, X 2), 20 (NUM 58, NOUN 5), m (NUM 46, NOUN 17, PROPN 2)
The 10 most frequent ambiguous types: one (NUM 368, NOUN 88, PRON 22), 2 (NUM 139, X 30, PROPN 1, PART 1), 3 (NUM 116, X 17), 5 (NUM 103, X 4), 1 (NUM 101, X 31), 4 (NUM 93, X 13, ADP 1), 10 (NUM 89, X 2), 6 (NUM 59, X 2), 20 (NUM 58, NOUN 3), m (NUM 41, AUX 20, NOUN 11, PROPN 2, VERB 1)
- one
- 2
- NUM 139: Analyst Team 2 : Coach : Doug Sewell
- X 30: * 2 . The second ingredient is words , more precisely lies . *
- PROPN 1: and it seems this is the FIRST site of ragnarok 2 hahaha since the site is new send me your suggestions and comments
- PART 1: hi everyone …. just hav my hands on my new OLYMPUS X940 digital camera .. wel , i always wanted 2 hav one by sony .. but anyways , ended up having olympus X940 from my dad ……. does any1 already has it ?
- 3
- 5
- 1
- 4
- 10
- 6
- 20
- m
Morphology
The form / lemma ratio of NUM
is 1.000941 (the average of all parts of speech is 1.181137).
The 1st highest number of forms (2) was observed with the lemma “’72”: ‘72, ’72.
The 2nd highest number of forms (1) was observed with the lemma “’02”: ‘02.
The 3rd highest number of forms (1) was observed with the lemma “’05”: ‘05.
NUM
occurs with 2 features: en-feat/NumType (4376; 100% instances), en-feat/Number (1; 0% instances)
NUM
occurs with 2 feature-value pairs: NumType=Card
, Number=Sing
NUM
occurs with 2 feature combinations.
The most frequent feature combination is NumType=Card
(4376 tokens).
Examples: one, two, 2, 3, 5, 1, 4, 10, three, 6
Relations
NUM
nodes are attached to their parents using 29 different relations: en-dep/nummod (2630; 60% instances), en-dep/root (392; 9% instances), en-dep/nmod (250; 6% instances), en-dep/compound (235; 5% instances), en-dep/obl (219; 5% instances), en-dep/appos (179; 4% instances), en-dep/nsubj (95; 2% instances), en-dep/obj (88; 2% instances), en-dep/conj (82; 2% instances), en-dep/list (53; 1% instances), en-dep/nmod:tmod (49; 1% instances), en-dep/amod (18; 0% instances), en-dep/parataxis (15; 0% instances), en-dep/obl:tmod (12; 0% instances), en-dep/advmod (9; 0% instances), en-dep/xcomp (8; 0% instances), en-dep/advcl (7; 0% instances), en-dep/ccomp (7; 0% instances), en-dep/obl:npmod (7; 0% instances), en-dep/nmod:npmod (5; 0% instances), en-dep/nsubj:pass (4; 0% instances), en-dep/acl:relcl (3; 0% instances), en-dep/case (2; 0% instances), en-dep/det (2; 0% instances), en-dep/reparandum (2; 0% instances), en-dep/iobj (1; 0% instances), en-dep/nmod:poss (1; 0% instances), en-dep/orphan (1; 0% instances), en-dep/vocative (1; 0% instances)
Parents of NUM
nodes belong to 12 different parts of speech: NOUN (2151; 49% instances), PROPN (684; 16% instances), ROOT (392; 9% instances), VERB (379; 9% instances), NUM (378; 9% instances), SYM (315; 7% instances), ADJ (37; 1% instances), ADV (15; 0% instances), X (13; 0% instances), PRON (7; 0% instances), DET (5; 0% instances), AUX (1; 0% instances)
2787 (64%) NUM
nodes are leaves.
1003 (23%) NUM
nodes have one child.
250 (6%) NUM
nodes have two children.
337 (8%) NUM
nodes have three or more children.
The highest child degree of a NUM
node is 10.
Children of NUM
nodes are attached using 34 different relations: en-dep/punct (635; 23% instances), en-dep/case (485; 18% instances), en-dep/nmod (311; 11% instances), en-dep/advmod (201; 7% instances), en-dep/nmod:tmod (175; 6% instances), en-dep/appos (172; 6% instances), en-dep/compound (144; 5% instances), en-dep/conj (88; 3% instances), en-dep/cc (80; 3% instances), en-dep/cop (80; 3% instances), en-dep/nsubj (77; 3% instances), en-dep/nummod (64; 2% instances), en-dep/det (53; 2% instances), en-dep/parataxis (40; 1% instances), en-dep/amod (26; 1% instances), en-dep/acl:relcl (18; 1% instances), en-dep/mark (12; 0% instances), en-dep/aux (11; 0% instances), en-dep/obl (9; 0% instances), en-dep/advcl (8; 0% instances), en-dep/nmod:npmod (8; 0% instances), en-dep/discourse (5; 0% instances), en-dep/_ (3; 0% instances), en-dep/acl (3; 0% instances), en-dep/nmod:poss (2; 0% instances), en-dep/reparandum (2; 0% instances), en-dep/cc:preconj (1; 0% instances), en-dep/ccomp (1; 0% instances), en-dep/csubj (1; 0% instances), en-dep/det:predet (1; 0% instances), en-dep/goeswith (1; 0% instances), en-dep/obj (1; 0% instances), en-dep/vocative (1; 0% instances), en-dep/xcomp (1; 0% instances)
Children of NUM
nodes belong to 17 different parts of speech: PUNCT (625; 23% instances), NOUN (521; 19% instances), ADP (416; 15% instances), NUM (375; 14% instances), ADV (180; 7% instances), SYM (100; 4% instances), AUX (91; 3% instances), ADJ (78; 3% instances), CCONJ (78; 3% instances), PRON (75; 3% instances), VERB (68; 3% instances), DET (60; 2% instances), PROPN (37; 1% instances), SCONJ (6; 0% instances), PART (4; 0% instances), INTJ (3; 0% instances), X (3; 0% instances)
Treebank Statistics (UD_English-ESL)
There are 1 NUM
lemmas (6%), 1 NUM
types (6%) and 776 NUM
tokens (1%).
Out of 17 observed tags, the rank of NUM
is: 9 in number of lemmas, 9 in number of types and 14 in number of tokens.
The 10 most frequent NUM
lemmas: _
The 10 most frequent NUM
types: _
The 10 most frequent ambiguous lemmas: _ (NOUN 14135, VERB 13583, PRON 9575, DET 9068, PUNCT 8624, ADP 7769, ADJ 5278, ADV 5121, AUX 4111, PART 3169, CONJ 2865, SCONJ 2278, PROPN 1574, NUM 776, INTJ 67, X 60, SYM 37)
The 10 most frequent ambiguous types: _ (NOUN 14135, VERB 13583, PRON 9575, DET 9068, PUNCT 8624, ADP 7769, ADJ 5278, ADV 5121, AUX 4111, PART 3169, CONJ 2865, SCONJ 2278, PROPN 1574, NUM 776, INTJ 67, X 60, SYM 37)
- _
- NOUN 14135: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- VERB 13583: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- PRON 9575: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- DET 9068: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- PUNCT 8624: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- ADP 7769: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- ADJ 5278: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- ADV 5121: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- AUX 4111: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- PART 3169: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- CONJ 2865: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- SCONJ 2278: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- PROPN 1574: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- NUM 776: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- INTJ 67: _ _ _ _ _ _ _ _ _ _ _ _
- X 60: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- SYM 37: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
Morphology
The form / lemma ratio of NUM
is 1.000000 (the average of all parts of speech is 1.000000).
The 1st highest number of forms (1) was observed with the lemma “_”: _.
NUM
does not occur with any features.
Relations
NUM
nodes are attached to their parents using 19 different relations: en-dep/nummod (488; 63% instances), en-dep/nmod (140; 18% instances), en-dep/root (30; 4% instances), en-dep/conj (24; 3% instances), en-dep/nsubj (24; 3% instances), en-dep/dobj (20; 3% instances), en-dep/compound (12; 2% instances), en-dep/appos (10; 1% instances), en-dep/nmod:tmod (6; 1% instances), en-dep/advcl (4; 1% instances), en-dep/ccomp (4; 1% instances), en-dep/acl:relcl (3; 0% instances), en-dep/parataxis (3; 0% instances), en-dep/goeswith (2; 0% instances), en-dep/nmod:npmod (2; 0% instances), en-dep/amod (1; 0% instances), en-dep/csubjpass (1; 0% instances), en-dep/det (1; 0% instances), en-dep/xcomp (1; 0% instances)
Parents of NUM
nodes belong to 10 different parts of speech: NOUN (469; 60% instances), VERB (161; 21% instances), PROPN (36; 5% instances), SYM (35; 5% instances), NUM (31; 4% instances), ROOT (30; 4% instances), ADJ (9; 1% instances), ADV (2; 0% instances), PRON (2; 0% instances), PUNCT (1; 0% instances)
473 (61%) NUM
nodes are leaves.
191 (25%) NUM
nodes have one child.
46 (6%) NUM
nodes have two children.
66 (9%) NUM
nodes have three or more children.
The highest child degree of a NUM
node is 9.
Children of NUM
nodes are attached using 25 different relations: en-dep/case (147; 26% instances), en-dep/nmod (89; 16% instances), en-dep/punct (52; 9% instances), en-dep/cop (46; 8% instances), en-dep/nsubj (44; 8% instances), en-dep/advmod (42; 7% instances), en-dep/conj (29; 5% instances), en-dep/cc (27; 5% instances), en-dep/det (20; 3% instances), en-dep/compound (17; 3% instances), en-dep/amod (13; 2% instances), en-dep/mark (9; 2% instances), en-dep/acl:relcl (7; 1% instances), en-dep/parataxis (6; 1% instances), en-dep/appos (4; 1% instances), en-dep/advcl (3; 1% instances), en-dep/aux (3; 1% instances), en-dep/goeswith (3; 1% instances), en-dep/neg (3; 1% instances), en-dep/acl (2; 0% instances), en-dep/nummod (2; 0% instances), en-dep/csubj (1; 0% instances), en-dep/discourse (1; 0% instances), en-dep/nmod:poss (1; 0% instances), en-dep/xcomp (1; 0% instances)
Children of NUM
nodes belong to 16 different parts of speech: ADP (146; 26% instances), NOUN (90; 16% instances), VERB (75; 13% instances), PUNCT (51; 9% instances), ADV (47; 8% instances), PRON (34; 6% instances), NUM (31; 5% instances), CONJ (27; 5% instances), ADJ (23; 4% instances), DET (21; 4% instances), PROPN (13; 2% instances), SCONJ (7; 1% instances), AUX (3; 1% instances), PART (2; 0% instances), SYM (1; 0% instances), X (1; 0% instances)
Treebank Statistics (UD_English-LinES)
There are 1 NUM
lemmas (6%), 106 NUM
types (1%) and 462 NUM
tokens (1%).
Out of 17 observed tags, the rank of NUM
is: 9 in number of lemmas, 6 in number of types and 14 in number of tokens.
The 10 most frequent NUM
lemmas: _
The 10 most frequent NUM
types: one, two, three, 2002, five, 2000, six, ten, 1, four
The 10 most frequent ambiguous lemmas: _ (NOUN 12161, PUNCT 8085, VERB 8020, ADP 6788, DET 6429, PRON 6303, ADJ 4270, ADV 3700, AUX 3539, PROPN 2257, CCONJ 2081, PART 1703, SCONJ 1231, NUM 462, INTJ 122, X 41, SYM 5)
The 10 most frequent ambiguous types: one (PRON 89, NUM 88, DET 7), 1 (NUM 10, ADJ 1), 12 (NUM 7, ADJ 1), 3 (NUM 2, ADJ 1), 30 (NUM 2, ADJ 1), 5 (NUM 2, ADJ 1), U (NUM 2, NOUN 1), 22 (ADJ 2, NUM 1)
- one
- 1
- 12
- NUM 7: The vote will take place tomorrow at 12 noon .
- ADJ 1: On July 12 , after the raid , Israel was accused of giving comfort to the reactionaries of Rhodesia and South Africa by its demonstration of military superiority and its use of Western arms and techniques , upsetting the balance between poor and rich countries , disturbing the work of men of good will in Paris who were trying to create a new climate and to treat the countries of the Third World as equals and partners .
- 3
- 30
- 5
- NUM 2: Note that you are not required to link either a CSS file or an XSL style sheet to an XML document in order for Internet Explorer 5 ( and later versions ) to display the document .
- ADJ 1: We have done so : on 5 February we published an extremely detailed press release dealing with the questions you have raised .
- U
- 22
Morphology
The form / lemma ratio of NUM
is 106.000000 (the average of all parts of speech is 527.705882).
The 1st highest number of forms (106) was observed with the lemma “_”: 1, 1-100, 10, 100, 100c, 101-200, 11.25, 11.30, 111, 12, 12:30, 13, 14, 1857, 1875, 1910, 1947, 1952, 1953, 1955, 1973, 1976, 1996, 1996-1997, 1997, 2, 2.6, 2000, 2002, 2005, 22, 23, 25, 3, 30, 37, 38, 4, 4-5, 40, 43, 4:30, 5, 5.5, 50, 50000, 60, 6500, 7, 7.0, 7.15, 747, 84, 9, 96/23, 96/96/EC, 97, A4-0072/97, C4-0497/98-98/0126, H-0002/99, H-0045/99, H-0209/99, H-0218/97, H-0237/97, No-15, No-44, No-46, No-49, No-59, No-6, No-8, U, billion, eight, eleven, fifteen, five, forty, forty-eight, four, fourteen, hundred, million, n, nine, nineteen, nn, one, seven, six, six-thirty, sixteen, sixty, ten, thirty, thirty-eight, thirty-five, thousand, three, twelve, twenty, twenty-five, twenty-four, twenty-six, twenty-two, two.
NUM
does not occur with any features.
Relations
NUM
nodes are attached to their parents using 15 different relations: en-dep/nummod (316; 68% instances), en-dep/obl (38; 8% instances), en-dep/conj (27; 6% instances), en-dep/discourse (14; 3% instances), en-dep/nsubj (14; 3% instances), en-dep/flat (13; 3% instances), en-dep/appos (11; 2% instances), en-dep/root (10; 2% instances), en-dep/obj (9; 2% instances), en-dep/nsubj:pass (3; 1% instances), en-dep/nmod (2; 0% instances), en-dep/xcomp (2; 0% instances), en-dep/advmod (1; 0% instances), en-dep/ccomp (1; 0% instances), en-dep/dislocated (1; 0% instances)
Parents of NUM
nodes belong to 11 different parts of speech: NOUN (278; 60% instances), VERB (72; 16% instances), NUM (42; 9% instances), PROPN (41; 9% instances), ROOT (10; 2% instances), ADJ (5; 1% instances), ADV (4; 1% instances), SYM (4; 1% instances), PRON (3; 1% instances), ADP (2; 0% instances), AUX (1; 0% instances)
262 (57%) NUM
nodes are leaves.
119 (26%) NUM
nodes have one child.
49 (11%) NUM
nodes have two children.
32 (7%) NUM
nodes have three or more children.
The highest child degree of a NUM
node is 14.
Children of NUM
nodes are attached using 19 different relations: en-dep/case (73; 22% instances), en-dep/nmod (46; 14% instances), en-dep/punct (38; 11% instances), en-dep/advmod (34; 10% instances), en-dep/conj (33; 10% instances), en-dep/compound (27; 8% instances), en-dep/cc (21; 6% instances), en-dep/det (14; 4% instances), en-dep/nummod (14; 4% instances), en-dep/fixed (10; 3% instances), en-dep/appos (8; 2% instances), en-dep/cop (5; 1% instances), en-dep/nsubj (4; 1% instances), en-dep/obl:agent (4; 1% instances), en-dep/amod (3; 1% instances), en-dep/acl (2; 1% instances), en-dep/acl:relcl (1; 0% instances), en-dep/aux (1; 0% instances), en-dep/mark (1; 0% instances)
Children of NUM
nodes belong to 15 different parts of speech: NOUN (74; 22% instances), ADP (73; 22% instances), NUM (42; 12% instances), ADV (39; 12% instances), PUNCT (38; 11% instances), CCONJ (26; 8% instances), DET (14; 4% instances), PROPN (8; 2% instances), PRON (7; 2% instances), AUX (6; 2% instances), ADJ (5; 1% instances), VERB (4; 1% instances), PART (1; 0% instances), SCONJ (1; 0% instances), X (1; 0% instances)
Treebank Statistics (UD_English-ParTUT)
There are 222 NUM
lemmas (4%), 222 NUM
types (3%) and 691 NUM
tokens (2%).
Out of 17 observed tags, the rank of NUM
is: 6 in number of lemmas, 6 in number of types and 13 in number of tokens.
The 10 most frequent NUM
lemmas: two, one, 1, three, four, 18, 2, 3, 20, 2002
The 10 most frequent NUM
types: two, one, 1, three, four, 18, 2, 3, 20, 2002
The 10 most frequent ambiguous lemmas: two (NUM 47, NOUN 2), one (NUM 42, PRON 21, DET 2, NOUN 1), three (NUM 20, NOUN 1), million (NUM 5, NOUN 2), ten (NUM 2, NOUN 2), - (PUNCT 289, X 6, NUM 1)
The 10 most frequent ambiguous types: two (NUM 37, NOUN 1), one (NUM 37, PRON 20, DET 2, NOUN 1), three (NUM 20, NOUN 1), ten (NUM 2, NOUN 1)
- two
- one
- NUM 37: I should like to address one final point .
- PRON 20: No one shall be subjected to arbitrary arrest , detention or exile .
- DET 2: This was the first book Balzac released under his own name , and it gave him what one critic called “ passage into the Promised Land “ .
- NOUN 1: The late romances , with their shifts in time and surprising turns of plot , inspired a last poetic style in which long and short sentences are set against one another , clauses are piled up , subject and object are reversed , and words are omitted , creating an effect of spontaneity .
- three
- ten
Morphology
The form / lemma ratio of NUM
is 1.000000 (the average of all parts of speech is 1.187751).
The 1st highest number of forms (1) was observed with the lemma “-”: -20º.
The 2nd highest number of forms (1) was observed with the lemma “0083”: 0083.
The 3rd highest number of forms (1) was observed with the lemma “1”: 1.
NUM
occurs with 1 features: en-feat/NumType (691; 100% instances)
NUM
occurs with 1 feature-value pairs: NumType=Card
NUM
occurs with 1 feature combinations.
The most frequent feature combination is NumType=Card
(691 tokens).
Examples: two, one, 1, three, four, 18, 2, 3, 20, 2002
Relations
NUM
nodes are attached to their parents using 15 different relations: en-dep/nummod (522; 76% instances), en-dep/obl (86; 12% instances), en-dep/conj (27; 4% instances), en-dep/compound (13; 2% instances), en-dep/flat (12; 2% instances), en-dep/root (10; 1% instances), en-dep/obj (8; 1% instances), en-dep/nsubj (5; 1% instances), en-dep/appos (2; 0% instances), en-dep/advcl (1; 0% instances), en-dep/ccomp (1; 0% instances), en-dep/goeswith (1; 0% instances), en-dep/nmod (1; 0% instances), en-dep/orphan (1; 0% instances), en-dep/xcomp (1; 0% instances)
Parents of NUM
nodes belong to 10 different parts of speech: NOUN (295; 43% instances), VERB (137; 20% instances), PROPN (126; 18% instances), NUM (65; 9% instances), SYM (37; 5% instances), ADJ (12; 2% instances), ROOT (10; 1% instances), PUNCT (4; 1% instances), X (4; 1% instances), ADV (1; 0% instances)
388 (56%) NUM
nodes are leaves.
134 (19%) NUM
nodes have one child.
111 (16%) NUM
nodes have two children.
58 (8%) NUM
nodes have three or more children.
The highest child degree of a NUM
node is 7.
Children of NUM
nodes are attached using 20 different relations: en-dep/punct (224; 39% instances), en-dep/case (132; 23% instances), en-dep/nmod (51; 9% instances), en-dep/conj (29; 5% instances), en-dep/goeswith (26; 4% instances), en-dep/nummod (25; 4% instances), en-dep/cc (18; 3% instances), en-dep/det (13; 2% instances), en-dep/compound (12; 2% instances), en-dep/cop (10; 2% instances), en-dep/nsubj (10; 2% instances), en-dep/advmod (8; 1% instances), en-dep/amod (8; 1% instances), en-dep/advcl (3; 1% instances), en-dep/appos (2; 0% instances), en-dep/aux (2; 0% instances), en-dep/csubj (2; 0% instances), en-dep/mark (2; 0% instances), en-dep/fixed (1; 0% instances), en-dep/nmod:poss (1; 0% instances)
Children of NUM
nodes belong to 15 different parts of speech: PUNCT (224; 39% instances), ADP (126; 22% instances), NUM (65; 11% instances), NOUN (38; 7% instances), ADJ (28; 5% instances), PROPN (24; 4% instances), CCONJ (18; 3% instances), DET (14; 2% instances), X (13; 2% instances), AUX (12; 2% instances), ADV (7; 1% instances), VERB (5; 1% instances), PRON (2; 0% instances), SCONJ (2; 0% instances), PART (1; 0% instances)
NUM in other languages: [am] [ar] [bg] [bxr] [ca] [ckb] [cop] [cs] [cu] [da] [de] [el] [en] [es] [et] [eu] [fa] [fi] [fo] [fr] [ga] [gl] [got] [grc] [he] [hi] [hr] [hu] [id] [it] [ja] [kk] [kmr] [ko] [la] [lv] [mr] [nl] [no] [pl] [pt] [ro] [ru] [sa] [sk] [sla] [sl] [so] [sr] [sv] [swl] [ta] [tr] [ug] [uk] [u] [urj] [ur] [vi] [yue] [zh]