SYM
: symbol
Definition
A symbol is a word-like entity that differs from ordinary words by form, function, or both.
We recognize as symbols:
- currency symbols: $
- mathematical operators: µg / m3
- ’/’ used a separator: 2001 / 923 / CE
- emoticons and emojis: :-)
- URLs and emails
- hashtags: #
- at-mentions: @
The following are not symbols:
- Proper nouns with numbers and special characters: 130XE, DC10, DC-10 are tagged PROPN.
- Acronyms for proper nouns: UN, NATO are tagged as PROPN.
- Abbreviated words: Sig. (signore), kg (chilogrammo), km (chilometro), dott (dottore) are tagged NOUN.
- Characters used as bullets in itemized lists (*, •, ‣) are PUNCT.
Corresponding language-specific part-of-speech tags
SYM: Symbol
X: Other
Examples
- $, %, §, ©
- +, −, ×, ÷, =, <, >
- :), ♥‿♥, 😝
- john.doe@universal.org, http://universaldependencies.org/
- #universaldependencies
- @johndoe
Treebank Statistics (UD_Italian)
There are 10 SYM
lemmas (0%), 10 SYM
types (0%) and 95 SYM
tokens (0%).
Out of 17 observed tags, the rank of SYM
is: 16 in number of lemmas, 16 in number of types and 15 in number of tokens.
The 10 most frequent SYM
lemmas: %, &, +, -, http://europa.eu.int/comm/secretariat@general/sgb/state@aids, http://www.linkiesta.it/Locatelli-lombardia-Nicoli-regione, www.amnesty.it, www.centrodonmilani.org, www.legadelcane.org, x
The 10 most frequent SYM
types: %, &, +, -, http://europa.eu.int/comm/secretariat@general/sgb/state@aids, http://www.linkiesta.it/Locatelli-lombardia-Nicoli-regione, www.amnesty.it, www.centrodonmilani.org, www.legadelcane.org, x
The 10 most frequent ambiguous lemmas: & (SYM 6, CCONJ 1), - (PUNCT 749, SYM 2)
The 10 most frequent ambiguous types: & (SYM 6, CCONJ 1), - (PUNCT 750, SYM 2)
- &
- -
Morphology
The form / lemma ratio of SYM
is 1.000000 (the average of all parts of speech is 1.485733).
The 1st highest number of forms (1) was observed with the lemma “%”: %.
The 2nd highest number of forms (1) was observed with the lemma “&”: &.
The 3rd highest number of forms (1) was observed with the lemma “+”: +.
SYM
does not occur with any features.
Relations
SYM
nodes are attached to their parents using 11 different relations: it-dep/obl (39; 41% instances), it-dep/nmod (21; 22% instances), it-dep/obj (11; 12% instances), it-dep/flat:name (8; 8% instances), it-dep/nsubj (5; 5% instances), it-dep/conj (3; 3% instances), it-dep/xcomp (3; 3% instances), it-dep/nsubj:pass (2; 2% instances), it-dep/fixed (1; 1% instances), it-dep/nummod (1; 1% instances), it-dep/root (1; 1% instances)
Parents of SYM
nodes belong to 9 different parts of speech: VERB (49; 52% instances), NOUN (21; 22% instances), ADJ (10; 11% instances), PROPN (8; 8% instances), ADP (2; 2% instances), NUM (2; 2% instances), ADV (1; 1% instances), ROOT (1; 1% instances), SYM (1; 1% instances)
14 (15%) SYM
nodes are leaves.
0 (0%) SYM
nodes have one child.
6 (6%) SYM
nodes have two children.
75 (79%) SYM
nodes have three or more children.
The highest child degree of a SYM
node is 5.
Children of SYM
nodes are attached using 14 different relations: it-dep/det (79; 29% instances), it-dep/nummod (79; 29% instances), it-dep/case (54; 20% instances), it-dep/nmod (34; 13% instances), it-dep/advmod (8; 3% instances), it-dep/punct (4; 1% instances), it-dep/amod (3; 1% instances), it-dep/cc (3; 1% instances), it-dep/acl:relcl (1; 0% instances), it-dep/advcl (1; 0% instances), it-dep/conj (1; 0% instances), it-dep/cop (1; 0% instances), it-dep/nsubj (1; 0% instances), it-dep/orphan (1; 0% instances)
Children of SYM
nodes belong to 12 different parts of speech: DET (79; 29% instances), NUM (79; 29% instances), ADP (54; 20% instances), NOUN (33; 12% instances), ADV (8; 3% instances), PUNCT (4; 1% instances), ADJ (3; 1% instances), CCONJ (3; 1% instances), PROPN (3; 1% instances), VERB (2; 1% instances), AUX (1; 0% instances), SYM (1; 0% instances)
Treebank Statistics (UD_Italian-ParTUT)
There are 1 SYM
lemmas (0%), 1 SYM
types (0%) and 32 SYM
tokens (0%).
Out of 15 observed tags, the rank of SYM
is: 15 in number of lemmas, 15 in number of types and 15 in number of tokens.
The 10 most frequent SYM
lemmas: %
The 10 most frequent SYM
types: %
The 10 most frequent ambiguous lemmas:
The 10 most frequent ambiguous types:
Morphology
The form / lemma ratio of SYM
is 1.000000 (the average of all parts of speech is 1.446672).
The 1st highest number of forms (1) was observed with the lemma “%”: %.
SYM
does not occur with any features.
Relations
SYM
nodes are attached to their parents using 7 different relations: it-dep/obl (17; 53% instances), it-dep/nmod (5; 16% instances), it-dep/conj (3; 9% instances), it-dep/nsubj (2; 6% instances), it-dep/obj (2; 6% instances), it-dep/root (2; 6% instances), it-dep/nsubj:pass (1; 3% instances)
Parents of SYM
nodes belong to 6 different parts of speech: VERB (19; 59% instances), NOUN (5; 16% instances), ADJ (4; 13% instances), ROOT (2; 6% instances), NUM (1; 3% instances), SYM (1; 3% instances)
0 (0%) SYM
nodes are leaves.
0 (0%) SYM
nodes have one child.
0 (0%) SYM
nodes have two children.
32 (100%) SYM
nodes have three or more children.
The highest child degree of a SYM
node is 8.
Children of SYM
nodes are attached using 14 different relations: it-dep/det (32; 28% instances), it-dep/nummod (32; 28% instances), it-dep/case (24; 21% instances), it-dep/nmod (10; 9% instances), it-dep/cc (3; 3% instances), it-dep/punct (3; 3% instances), it-dep/advmod (2; 2% instances), it-dep/amod (2; 2% instances), it-dep/cop (2; 2% instances), it-dep/nsubj (2; 2% instances), it-dep/acl:relcl (1; 1% instances), it-dep/aux (1; 1% instances), it-dep/conj (1; 1% instances), it-dep/orphan (1; 1% instances)
Children of SYM
nodes belong to 11 different parts of speech: DET (32; 28% instances), NUM (32; 28% instances), ADP (24; 21% instances), NOUN (13; 11% instances), AUX (3; 3% instances), CCONJ (3; 3% instances), PUNCT (3; 3% instances), ADJ (2; 2% instances), ADV (2; 2% instances), SYM (1; 1% instances), VERB (1; 1% instances)
SYM in other languages: [am] [ar] [bg] [bxr] [ca] [ckb] [cop] [cs] [cu] [da] [de] [el] [en] [es] [et] [eu] [fa] [fi] [fo] [fr] [ga] [gl] [got] [grc] [he] [hi] [hr] [hu] [id] [it] [ja] [kk] [kmr] [ko] [la] [lv] [mr] [nl] [no] [pl] [pt] [ro] [ru] [sa] [sk] [sla] [sl] [so] [sr] [sv] [swl] [ta] [tr] [ug] [uk] [u] [urj] [ur] [vi] [yue] [zh]