NUM

home zh/pos edit page issue tracker

This page pertains to UD version 2.

`NUM`: numeral

Definition

A numeral is a word, functioning most typically as a determiner, a pronoun or an adjective, that expresses a number and a relation to the number, such as quantity, sequence, frequency or fraction.

Cardinal numerals are covered by NUM regardless of syntactic function and regardless of whether they are expressed as words (五 / wǔ “five”) or digits (5). By contrast, ordinal numerals are always tagged ADJ.

Examples

1, 2, 3, 4, 5, 100, 10,358, 5.23, 3/4
一 / yī “one”, 二 / èr “two”, 三 / sān “three”, 一百 / yībǎi “one hundred”, 五十六 / wǔshíliù “fifty-six”, 一萬三百五十八 / yīwànsānbǎiwǔshíbā “ten thousand three hundred and fifty-eight”, 四分之三 / sìfēnzhīsān “three-quarters”

Treebank Statistics (UD_Chinese)

There are 1162 NUM lemmas (5%), 1162 NUM types (5%) and 6006 NUM tokens (5%). Out of 15 observed tags, the rank of NUM is: 4 in number of lemmas, 4 in number of types and 6 in number of tokens.

The 10 most frequent NUM lemmas: 一、兩、三、 1、第一、 3、 2、 12、 5、 10

The 10 most frequent NUM types: 一、兩、三、 1、第一、 3、 2、 12、 5、 10

The 10 most frequent ambiguous lemmas: 一 (NUM 1006, NOUN 1), 第一 (NUM 105, PROPN 2), 四 (NUM 80, X 1), 多 (NUM 74, ADV 26, ADJ 14, PART 2), 雙 (NUM 34, NOUN 1), 很多 (NUM 32, ADJ 4), 單 (NUM 26, PART 2), 半 (NUM 22, PART 4), 數 (NUM 22, PART 15), 第四 (NUM 14, X 1)

The 10 most frequent ambiguous types: 一 (NUM 1006, NOUN 1), 第一 (NUM 105, PROPN 2), 四 (NUM 80, X 1), 多 (NUM 74, ADV 26, ADJ 14, PART 2), 雙 (NUM 34, NOUN 1), 很多 (NUM 32, ADJ 4), 單 (NUM 26, PART 2), 半 (NUM 22, PART 4), 數 (NUM 22, PART 15), 第四 (NUM 14, X 1)

一
- NUM 1006: 其測試包含了美術治療法 , 認知行為治療和洞察療法 , 同時給行為分析提供了一個理論性的交流平台 .
- NOUN 1: 這一修正案涉及公民權利和平等法律保護 , 最初提出是為了解決南北戰爭後昔日奴隸的相關問題 .
第一
- NUM 105: 北京站是當時中國大陸規模最大、設備最先進的鐵路車站 , 也是第一個現代化大型鐵路客運站 .
- PROPN 2: KKR 的資本募集主要局限於一小部分投資者 , 這其中就包括希爾曼 ( Hillman ) 家族和第一芝加哥銀行 .
四
- NUM 80: 古巨基於 2006 年度得到四台聯頒音樂大獎歌曲大獎成為繼陳慧琳之後連續奪得最多次歌曲獎的歌手 .
- X 1: 四、妳不該離大家太遠 , 孤芳自賞 , 也不行打官腔裝正經 .
多
- NUM 74: 而且學校的伙食和住宿條件也多年遭到在校生的詬病 .
- ADV 26: 近年來 , 肯亞女子長距離田徑項目也開始嶄露頭角 , 而這些女運動員們也多為卡倫金人 .
- ADJ 14: 後來卡通造型的桑德斯上校 ( 由演員 Randy Quaid 配音 ) , 出現在越來越多的肯德基廣告中 .
- PART 2: 研討會和講座可以享用多媒體演示、視頻會議和同時由數個不同地點的通訊的設備的支持 .
雙
- NUM 34: 實際上的雙筒望遠鏡當然多少有些誤差 .
- NOUN 1: 而復寫眼持有者因為這雙特殊的眼睛可以直接跳過這一步 , 在其後的學習中也要比一般修習者快上數倍 .
很多
- NUM 32: 由於這次失事原因涉及很多敏感的爭議性 , 因此最後仍未有一個具體及統一的事故調查報告 .
- ADJ 4: 在很多城市 , 羅素被扣上異端的帽子 , 隨之而來的批評家數量也是直線上升 .
單
- NUM 26: 一般來說 , 同一款間格的單邊單位比非單邊的呎價約貴 20% .
- PART 2: 各地的分部辦事處之前會準備好足夠的邀請單 , 以便在發放時給住戶 .
半
- NUM 22: 半腰座椅 , 亦稱半腰位、半截座椅 , 鐵路車輛座位的一種 .
- PART 4: 她的姥姥講俄語 , 並且是半俄國血統半威爾士血統 .
數
- NUM 22: 研討會和講座可以享用多媒體演示、視頻會議和同時由數個不同地點的通訊的設備的支持 .
- PART 15: 與靜態酒不同 , 較高糖份的葡萄並不是氣泡酒的上選原料 , 所以葡萄植株的掛果數也會比較多 .
第四
- NUM 14: 薩爾曼 · 魯西迪的第四部小說 , 出版於 1988 年 , 其靈感來源於穆罕默德的生活 .
- X 1: 第四 , 繼任的市領導為促進經濟復甦 , 招商引資 , 造成嚴重的環境污染 .

Morphology

The form / lemma ratio of NUM is 1.000000 (the average of all parts of speech is 1.000284).

The 1st highest number of forms (1) was observed with the lemma “,”: ,.

The 2nd highest number of forms (1) was observed with the lemma “-15”: -15.

The 3rd highest number of forms (1) was observed with the lemma “-300”: -300.

NUM occurs with 1 features: zh-feat/NumType (6006; 100% instances)

NUM occurs with 1 feature-value pairs: NumType=Card

NUM occurs with 1 feature combinations. The most frequent feature combination is NumType=Card (6006 tokens). Examples: 一、兩、三、 1、第一、 3、 2、 12、 5、 10

Relations

NUM nodes are attached to their parents using 19 different relations: zh-dep/nummod (5578; 93% instances), zh-dep/root (73; 1% instances), zh-dep/obj (54; 1% instances), zh-dep/conj (51; 1% instances), zh-dep/advmod (48; 1% instances), zh-dep/nmod (47; 1% instances), zh-dep/det (37; 1% instances), zh-dep/nsubj (30; 0% instances), zh-dep/dep (24; 0% instances), zh-dep/acl (14; 0% instances), zh-dep/nmod:tmod (9; 0% instances), zh-dep/obl (8; 0% instances), zh-dep/appos (7; 0% instances), zh-dep/case:suff (7; 0% instances), zh-dep/amod (6; 0% instances), zh-dep/ccomp (5; 0% instances), zh-dep/xcomp (4; 0% instances), zh-dep/punct (3; 0% instances), zh-dep/nsubj:pass (1; 0% instances)

Parents of NUM nodes belong to 9 different parts of speech: NOUN (5593; 93% instances), VERB (143; 2% instances), PART (82; 1% instances), ROOT (73; 1% instances), NUM (66; 1% instances), X (21; 0% instances), PROPN (14; 0% instances), ADJ (13; 0% instances), SYM (1; 0% instances)

5673 (94%) NUM nodes are leaves.

179 (3%) NUM nodes have one child.

53 (1%) NUM nodes have two children.

101 (2%) NUM nodes have three or more children.

The highest child degree of a NUM node is 16.

Children of NUM nodes are attached using 23 different relations: zh-dep/punct (217; 26% instances), zh-dep/det (107; 13% instances), zh-dep/nsubj (91; 11% instances), zh-dep/cop (89; 11% instances), zh-dep/dep (64; 8% instances), zh-dep/conj (50; 6% instances), zh-dep/cc (43; 5% instances), zh-dep/case:dec (37; 4% instances), zh-dep/nmod (37; 4% instances), zh-dep/advmod (32; 4% instances), zh-dep/acl (24; 3% instances), zh-dep/appos (10; 1% instances), zh-dep/case (10; 1% instances), zh-dep/nummod (8; 1% instances), zh-dep/nmod:tmod (5; 1% instances), zh-dep/csubj (4; 0% instances), zh-dep/flat:foreign (4; 0% instances), zh-dep/mark (2; 0% instances), zh-dep/acl:relcl (1; 0% instances), zh-dep/amod (1; 0% instances), zh-dep/case:pref (1; 0% instances), zh-dep/obj (1; 0% instances), zh-dep/xcomp (1; 0% instances)

Children of NUM nodes belong to 15 different parts of speech: NOUN (213; 25% instances), PUNCT (213; 25% instances), AUX (89; 11% instances), PART (71; 8% instances), NUM (66; 8% instances), CCONJ (43; 5% instances), VERB (41; 5% instances), ADV (29; 3% instances), ADP (16; 2% instances), DET (14; 2% instances), PROPN (13; 2% instances), X (12; 1% instances), PRON (11; 1% instances), ADJ (4; 0% instances), SYM (4; 0% instances)