NOUN
: noun
Definition
Nouns are a part of speech typically denoting a person, place, thing, animal, or idea.
The NOUN tag is intended for common nouns only. See PROPN
for proper nouns and PRON
for pronouns.
As a special case, classifiers (量詞 / liàngcí) are also tagged NOUN
per UD guidelines. Their classifier status may be preserved in the feature column (FEATS) as NounType=CLf
.
Examples
- Nouns
- 杯子/ bēizǐ “cup”, 草 / cǎo “grass”, 氧氣 / yǎngqì “oxygen”, 地方 / dìfāng “place”, 能力 / nénglì “ability”, 歷史 / lìshǐ “history”
- Classifiers
- 個 / gè (generic classifier), 條 / tiáo (classifier for long, slim objects), 本 / běn (classifier for book-like objects), 雙 / shuāng “pair”, 杯 / bēi “cup (of something)”, 磅 / bàng “pound”, 年 / nián “year”
Treebank Statistics (UD_Chinese)
There are 7644 NOUN
lemmas (36%), 7645 NOUN
types (36%) and 30733 NOUN
tokens (28%).
Out of 15 observed tags, the rank of NOUN
is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.
The 10 most frequent NOUN
lemmas: 年、 個、 月、 人、 日、 等、 種、 次、 人口、 國家
The 10 most frequent NOUN
types: 年、 個、 月、 日、 人、 等、 種、 次、 人口、 國家
The 10 most frequent ambiguous lemmas: 年 (NOUN 1380, PART 4), 人 (NOUN 349, PART 222, VERB 1), 日 (NOUN 348, PROPN 50, PART 6, NUM 2), 等 (NOUN 208, VERB 3), 種 (NOUN 164, PART 5, VERB 1), 次 (NOUN 145, VERB 3, PART 3, NUM 1), 名 (NOUN 110, PART 5, VERB 3), 大學 (NOUN 110, PROPN 1), 世界 (NOUN 97, PROPN 1), 米 (NOUN 79, PART 1)
The 10 most frequent ambiguous types: 年 (NOUN 1380, PART 4), 日 (NOUN 348, PROPN 50, PART 6, NUM 2), 人 (NOUN 332, PART 222, VERB 1), 等 (NOUN 208, VERB 3), 種 (NOUN 164, PART 5, VERB 1), 次 (NOUN 145, PART 3, VERB 3, NUM 1), 名 (NOUN 110, PART 5, VERB 3), 大學 (NOUN 110, PROPN 1), 世界 (NOUN 97, PROPN 1), 米 (NOUN 79, PART 1)
- 年
- 日
- 人
- 等
- 種
- 次
- 名
- 大學
- 世界
- 米
Morphology
The form / lemma ratio of NOUN
is 1.000131 (the average of all parts of speech is 1.000284).
The 1st highest number of forms (2) was observed with the lemma “人”: 人, 人們.
The 2nd highest number of forms (1) was observed with the lemma “8.17”: 8.17.
The 3rd highest number of forms (1) was observed with the lemma “m”: m.
NOUN
occurs with 1 features: zh-feat/Number (17; 0% instances)
NOUN
occurs with 1 feature-value pairs: Number=Plur
NOUN
occurs with 2 feature combinations.
The most frequent feature combination is _
(30716 tokens).
Examples: 年、 個、 月、 日、 人、 等、 種、 次、 人口、 國家
Relations
NOUN
nodes are attached to their parents using 28 different relations: zh-dep/nmod (7256; 24% instances), zh-dep/obj (5209; 17% instances), zh-dep/nsubj (4821; 16% instances), zh-dep/clf (2015; 7% instances), zh-dep/case:suff (1746; 6% instances), zh-dep/obl (1740; 6% instances), zh-dep/det (1621; 5% instances), zh-dep/conj (1462; 5% instances), zh-dep/nmod:tmod (1397; 5% instances), zh-dep/appos (825; 3% instances), zh-dep/acl (670; 2% instances), zh-dep/advmod (580; 2% instances), zh-dep/root (476; 2% instances), zh-dep/dep (409; 1% instances), zh-dep/ccomp (187; 1% instances), zh-dep/nsubj:pass (135; 0% instances), zh-dep/xcomp (43; 0% instances), zh-dep/iobj (42; 0% instances), zh-dep/csubj (35; 0% instances), zh-dep/advcl (17; 0% instances), zh-dep/acl:relcl (13; 0% instances), zh-dep/amod (12; 0% instances), zh-dep/nummod (7; 0% instances), zh-dep/dislocated (6; 0% instances), zh-dep/mark (4; 0% instances), zh-dep/case:pref (2; 0% instances), zh-dep/orphan (2; 0% instances), zh-dep/mark:relcl (1; 0% instances)
Parents of NOUN
nodes belong to 16 different parts of speech: VERB (13001; 42% instances), NOUN (11979; 39% instances), PART (3357; 11% instances), ADJ (731; 2% instances), PROPN (716; 2% instances), ROOT (476; 2% instances), NUM (213; 1% instances), ADP (127; 0% instances), X (95; 0% instances), PRON (17; 0% instances), ADV (11; 0% instances), SYM (4; 0% instances), DET (2; 0% instances), PUNCT (2; 0% instances), AUX (1; 0% instances), CCONJ (1; 0% instances)
11668 (38%) NOUN
nodes are leaves.
9771 (32%) NOUN
nodes have one child.
5002 (16%) NOUN
nodes have two children.
4292 (14%) NOUN
nodes have three or more children.
The highest child degree of a NOUN
node is 19.
Children of NOUN
nodes are attached using 34 different relations: zh-dep/nmod (9471; 25% instances), zh-dep/nummod (5491; 15% instances), zh-dep/det (3461; 9% instances), zh-dep/punct (2812; 8% instances), zh-dep/clf (2002; 5% instances), zh-dep/case (1897; 5% instances), zh-dep/case:dec (1624; 4% instances), zh-dep/amod (1602; 4% instances), zh-dep/conj (1466; 4% instances), zh-dep/acl:relcl (1354; 4% instances), zh-dep/acl (1294; 3% instances), zh-dep/cop (1047; 3% instances), zh-dep/nsubj (981; 3% instances), zh-dep/cc (875; 2% instances), zh-dep/case:pref (564; 2% instances), zh-dep/appos (433; 1% instances), zh-dep/dep (346; 1% instances), zh-dep/advmod (242; 1% instances), zh-dep/mark (72; 0% instances), zh-dep/csubj (57; 0% instances), zh-dep/nmod:tmod (44; 0% instances), zh-dep/advcl (29; 0% instances), zh-dep/case:suff (28; 0% instances), zh-dep/dislocated (27; 0% instances), zh-dep/ccomp (17; 0% instances), zh-dep/mark:relcl (14; 0% instances), zh-dep/aux (10; 0% instances), zh-dep/obj (9; 0% instances), zh-dep/xcomp (9; 0% instances), zh-dep/discourse (6; 0% instances), zh-dep/orphan (2; 0% instances), zh-dep/aux:caus (1; 0% instances), zh-dep/case:aspect (1; 0% instances), zh-dep/mark:advb (1; 0% instances)
Children of NOUN
nodes belong to 15 different parts of speech: NOUN (11979; 32% instances), NUM (5593; 15% instances), PART (3750; 10% instances), ADP (2912; 8% instances), PROPN (2903; 8% instances), PUNCT (2795; 7% instances), VERB (1833; 5% instances), ADJ (1586; 4% instances), AUX (1058; 3% instances), DET (998; 3% instances), CCONJ (872; 2% instances), PRON (536; 1% instances), ADV (243; 1% instances), X (216; 1% instances), SYM (15; 0% instances)
NOUN in other languages: [am] [ar] [bg] [bxr] [ca] [ckb] [cop] [cs] [cu] [da] [de] [el] [en] [es] [et] [eu] [fa] [fi] [fo] [fr] [ga] [gl] [got] [grc] [he] [hi] [hr] [hu] [id] [it] [ja] [kk] [kmr] [ko] [la] [lv] [mr] [nl] [no] [pl] [pt] [ro] [ru] [sa] [sk] [sla] [sl] [so] [sr] [sv] [swl] [ta] [tr] [ug] [uk] [u] [urj] [ur] [vi] [yue] [zh]