home zh/pos edit page issue tracker

This page pertains to UD version 2.

SYM: symbol

Definition

A symbol is a word-like entity that differs from ordinary words by form, function, or both.

Many symbols are or contain special non-alphanumeric, non-standard logographic characters, similarly to punctuation. What makes them different from punctuation is that they can be substituted by normal words. This involves all currency symbols, e.g. $ 75 is identical to 七十五圓 / qīshíwǔ yuán “seventy-five dollars”.

Mathematical operators form another group of symbols.

Another group of symbols is emoticons and emoji.

Examples


Treebank Statistics (UD_Chinese)

There are 11 SYM lemmas (0%), 11 SYM types (0%) and 34 SYM tokens (0%). Out of 15 observed tags, the rank of SYM is: 15 in number of lemmas, 15 in number of types and 15 in number of tokens.

The 10 most frequent SYM lemmas: /、 $、 +、 Kink.com、 km、 t.163.com、 t.qq.com、 t.sina.com.cn、 t.sohu.com、 t.xxxx.com

The 10 most frequent SYM types: /、 $、 +、 Kink.com、 km、 t.163.com、 t.qq.com、 t.sina.com.cn、 t.sohu.com、 t.xxxx.com

The 10 most frequent ambiguous lemmas:

The 10 most frequent ambiguous types:

Morphology

The form / lemma ratio of SYM is 1.000000 (the average of all parts of speech is 1.000284).

The 1st highest number of forms (1) was observed with the lemma “$”: $.

The 2nd highest number of forms (1) was observed with the lemma “+”: +.

The 3rd highest number of forms (1) was observed with the lemma “/”: /.

SYM does not occur with any features.

Relations

SYM nodes are attached to their parents using 3 different relations: zh-dep/punct (27; 79% instances), zh-dep/obj (6; 18% instances), zh-dep/nsubj (1; 3% instances)

Parents of SYM nodes belong to 7 different parts of speech: NOUN (15; 44% instances), VERB (7; 21% instances), NUM (4; 12% instances), X (4; 12% instances), PROPN (2; 6% instances), ADJ (1; 3% instances), PART (1; 3% instances)

31 (91%) SYM nodes are leaves.

0 (0%) SYM nodes have one child.

1 (3%) SYM nodes have two children.

2 (6%) SYM nodes have three or more children.

The highest child degree of a SYM node is 5.

Children of SYM nodes are attached using 7 different relations: zh-dep/acl (2; 20% instances), zh-dep/nmod (2; 20% instances), zh-dep/punct (2; 20% instances), zh-dep/advmod (1; 10% instances), zh-dep/appos (1; 10% instances), zh-dep/conj (1; 10% instances), zh-dep/nummod (1; 10% instances)

Children of SYM nodes belong to 6 different parts of speech: NOUN (4; 40% instances), PUNCT (2; 20% instances), ADP (1; 10% instances), ADV (1; 10% instances), NUM (1; 10% instances), PROPN (1; 10% instances)


SYM in other languages: [am] [ar] [bg] [bxr] [ca] [ckb] [cop] [cs] [cu] [da] [de] [el] [en] [es] [et] [eu] [fa] [fi] [fo] [fr] [ga] [gl] [got] [grc] [he] [hi] [hr] [hu] [id] [it] [ja] [kk] [kmr] [ko] [la] [lv] [mr] [nl] [no] [pl] [pt] [ro] [ru] [sa] [sk] [sla] [sl] [so] [sr] [sv] [swl] [ta] [tr] [ug] [uk] [u] [urj] [ur] [vi] [yue] [zh]