home zh/pos edit page issue tracker

This page pertains to UD version 2.

NOUN: noun

Definition

Nouns are a part of speech typically denoting a person, place, thing, animal, or idea.

The NOUN tag is intended for common nouns only. See PROPN for proper nouns and PRON for pronouns.

As a special case, classifiers (量詞 / liàngcí) are also tagged NOUN per UD guidelines. Their classifier status may be preserved in the feature column (FEATS) as NounType=CLf.

Examples


Treebank Statistics (UD_Chinese)

There are 7644 NOUN lemmas (36%), 7645 NOUN types (36%) and 30733 NOUN tokens (28%). Out of 15 observed tags, the rank of NOUN is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.

The 10 most frequent NOUN lemmas: 年、 個、 月、 人、 日、 等、 種、 次、 人口、 國家

The 10 most frequent NOUN types: 年、 個、 月、 日、 人、 等、 種、 次、 人口、 國家

The 10 most frequent ambiguous lemmas: 年 (NOUN 1380, PART 4), 人 (NOUN 349, PART 222, VERB 1), 日 (NOUN 348, PROPN 50, PART 6, NUM 2), 等 (NOUN 208, VERB 3), 種 (NOUN 164, PART 5, VERB 1), 次 (NOUN 145, VERB 3, PART 3, NUM 1), 名 (NOUN 110, PART 5, VERB 3), 大學 (NOUN 110, PROPN 1), 世界 (NOUN 97, PROPN 1), 米 (NOUN 79, PART 1)

The 10 most frequent ambiguous types: 年 (NOUN 1380, PART 4), 日 (NOUN 348, PROPN 50, PART 6, NUM 2), 人 (NOUN 332, PART 222, VERB 1), 等 (NOUN 208, VERB 3), 種 (NOUN 164, PART 5, VERB 1), 次 (NOUN 145, PART 3, VERB 3, NUM 1), 名 (NOUN 110, PART 5, VERB 3), 大學 (NOUN 110, PROPN 1), 世界 (NOUN 97, PROPN 1), 米 (NOUN 79, PART 1)

Morphology

The form / lemma ratio of NOUN is 1.000131 (the average of all parts of speech is 1.000284).

The 1st highest number of forms (2) was observed with the lemma “人”: 人, 人們.

The 2nd highest number of forms (1) was observed with the lemma “8.17”: 8.17.

The 3rd highest number of forms (1) was observed with the lemma “m”: m.

NOUN occurs with 1 features: zh-feat/Number (17; 0% instances)

NOUN occurs with 1 feature-value pairs: Number=Plur

NOUN occurs with 2 feature combinations. The most frequent feature combination is _ (30716 tokens). Examples: 年、 個、 月、 日、 人、 等、 種、 次、 人口、 國家

Relations

NOUN nodes are attached to their parents using 28 different relations: zh-dep/nmod (7256; 24% instances), zh-dep/obj (5209; 17% instances), zh-dep/nsubj (4821; 16% instances), zh-dep/clf (2015; 7% instances), zh-dep/case:suff (1746; 6% instances), zh-dep/obl (1740; 6% instances), zh-dep/det (1621; 5% instances), zh-dep/conj (1462; 5% instances), zh-dep/nmod:tmod (1397; 5% instances), zh-dep/appos (825; 3% instances), zh-dep/acl (670; 2% instances), zh-dep/advmod (580; 2% instances), zh-dep/root (476; 2% instances), zh-dep/dep (409; 1% instances), zh-dep/ccomp (187; 1% instances), zh-dep/nsubj:pass (135; 0% instances), zh-dep/xcomp (43; 0% instances), zh-dep/iobj (42; 0% instances), zh-dep/csubj (35; 0% instances), zh-dep/advcl (17; 0% instances), zh-dep/acl:relcl (13; 0% instances), zh-dep/amod (12; 0% instances), zh-dep/nummod (7; 0% instances), zh-dep/dislocated (6; 0% instances), zh-dep/mark (4; 0% instances), zh-dep/case:pref (2; 0% instances), zh-dep/orphan (2; 0% instances), zh-dep/mark:relcl (1; 0% instances)

Parents of NOUN nodes belong to 16 different parts of speech: VERB (13001; 42% instances), NOUN (11979; 39% instances), PART (3357; 11% instances), ADJ (731; 2% instances), PROPN (716; 2% instances), ROOT (476; 2% instances), NUM (213; 1% instances), ADP (127; 0% instances), X (95; 0% instances), PRON (17; 0% instances), ADV (11; 0% instances), SYM (4; 0% instances), DET (2; 0% instances), PUNCT (2; 0% instances), AUX (1; 0% instances), CCONJ (1; 0% instances)

11668 (38%) NOUN nodes are leaves.

9771 (32%) NOUN nodes have one child.

5002 (16%) NOUN nodes have two children.

4292 (14%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 19.

Children of NOUN nodes are attached using 34 different relations: zh-dep/nmod (9471; 25% instances), zh-dep/nummod (5491; 15% instances), zh-dep/det (3461; 9% instances), zh-dep/punct (2812; 8% instances), zh-dep/clf (2002; 5% instances), zh-dep/case (1897; 5% instances), zh-dep/case:dec (1624; 4% instances), zh-dep/amod (1602; 4% instances), zh-dep/conj (1466; 4% instances), zh-dep/acl:relcl (1354; 4% instances), zh-dep/acl (1294; 3% instances), zh-dep/cop (1047; 3% instances), zh-dep/nsubj (981; 3% instances), zh-dep/cc (875; 2% instances), zh-dep/case:pref (564; 2% instances), zh-dep/appos (433; 1% instances), zh-dep/dep (346; 1% instances), zh-dep/advmod (242; 1% instances), zh-dep/mark (72; 0% instances), zh-dep/csubj (57; 0% instances), zh-dep/nmod:tmod (44; 0% instances), zh-dep/advcl (29; 0% instances), zh-dep/case:suff (28; 0% instances), zh-dep/dislocated (27; 0% instances), zh-dep/ccomp (17; 0% instances), zh-dep/mark:relcl (14; 0% instances), zh-dep/aux (10; 0% instances), zh-dep/obj (9; 0% instances), zh-dep/xcomp (9; 0% instances), zh-dep/discourse (6; 0% instances), zh-dep/orphan (2; 0% instances), zh-dep/aux:caus (1; 0% instances), zh-dep/case:aspect (1; 0% instances), zh-dep/mark:advb (1; 0% instances)

Children of NOUN nodes belong to 15 different parts of speech: NOUN (11979; 32% instances), NUM (5593; 15% instances), PART (3750; 10% instances), ADP (2912; 8% instances), PROPN (2903; 8% instances), PUNCT (2795; 7% instances), VERB (1833; 5% instances), ADJ (1586; 4% instances), AUX (1058; 3% instances), DET (998; 3% instances), CCONJ (872; 2% instances), PRON (536; 1% instances), ADV (243; 1% instances), X (216; 1% instances), SYM (15; 0% instances)


NOUN in other languages: [am] [ar] [bg] [bxr] [ca] [ckb] [cop] [cs] [cu] [da] [de] [el] [en] [es] [et] [eu] [fa] [fi] [fo] [fr] [ga] [gl] [got] [grc] [he] [hi] [hr] [hu] [id] [it] [ja] [kk] [kmr] [ko] [la] [lv] [mr] [nl] [no] [pl] [pt] [ro] [ru] [sa] [sk] [sla] [sl] [so] [sr] [sv] [swl] [ta] [tr] [ug] [uk] [u] [urj] [ur] [vi] [yue] [zh]