home no/pos edit page issue tracker

This page still pertains to UD version 1.

SYM: symbol

#####Definition A symbol is a word-like entity that differs from ordinary words by form, function, or both.

Many symbols are or contain special non-alphanumeric characters, similarly to punctuation. What makes them different from punctuation is that they can be substituted by normal words.

#####Examples


Treebank Statistics (UD_Norwegian-Bokmaal)

There are 11 SYM lemmas (0%), 10 SYM types (0%) and 70 SYM tokens (0%). Out of 17 observed tags, the rank of SYM is: 15 in number of lemmas, 16 in number of types and 17 in number of tokens.

The 10 most frequent SYM lemmas: $/, /, ©, +, +5°C, -6°C, 70°, 72°N, 8°, 9°V

The 10 most frequent SYM types: /, ©, +, +5°C, -6°C, 70°, 72°N, 8°, 9°V, =

The 10 most frequent ambiguous lemmas:

The 10 most frequent ambiguous types:

Morphology

The form / lemma ratio of SYM is 0.909091 (the average of all parts of speech is 1.383513).

The 1st highest number of forms (1) was observed with the lemma “$/”: /.

The 2nd highest number of forms (1) was observed with the lemma “+”: +.

The 3rd highest number of forms (1) was observed with the lemma “+5°C”: +5°C.

SYM does not occur with any features.

Relations

SYM nodes are attached to their parents using 6 different relations: no-dep/compound (59; 84% instances), no-dep/conj (4; 6% instances), no-dep/root (4; 6% instances), no-dep/flat:name (1; 1% instances), no-dep/obl (1; 1% instances), no-dep/orphan (1; 1% instances)

Parents of SYM nodes belong to 8 different parts of speech: NOUN (32; 46% instances), PROPN (19; 27% instances), CCONJ (5; 7% instances), ROOT (4; 6% instances), SYM (4; 6% instances), ADJ (3; 4% instances), NUM (2; 3% instances), VERB (1; 1% instances)

60 (86%) SYM nodes are leaves.

7 (10%) SYM nodes have one child.

1 (1%) SYM nodes have two children.

2 (3%) SYM nodes have three or more children.

The highest child degree of a SYM node is 3.

Children of SYM nodes are attached using 4 different relations: no-dep/cc (4; 27% instances), no-dep/conj (4; 27% instances), no-dep/parataxis (4; 27% instances), no-dep/case (3; 20% instances)

Children of SYM nodes belong to 4 different parts of speech: CCONJ (4; 27% instances), PROPN (4; 27% instances), SYM (4; 27% instances), ADP (3; 20% instances)


Treebank Statistics (UD_Norwegian-Nynorsk)

There are 2 SYM lemmas (0%), 2 SYM types (0%) and 82 SYM tokens (0%). Out of 17 observed tags, the rank of SYM is: 16 in number of lemmas, 17 in number of types and 17 in number of tokens.

The 10 most frequent SYM lemmas: $/, ©

The 10 most frequent SYM types: /, ©

The 10 most frequent ambiguous lemmas:

The 10 most frequent ambiguous types:

Morphology

The form / lemma ratio of SYM is 1.000000 (the average of all parts of speech is 1.343969).

The 1st highest number of forms (1) was observed with the lemma “$/”: /.

The 2nd highest number of forms (1) was observed with the lemma “©”: ©.

SYM does not occur with any features.

Relations

SYM nodes are attached to their parents using 3 different relations: no-dep/compound (70; 85% instances), no-dep/flat:name (11; 13% instances), no-dep/flat:foreign (1; 1% instances)

Parents of SYM nodes belong to 7 different parts of speech: NUM (27; 33% instances), PROPN (26; 32% instances), NOUN (22; 27% instances), ADJ (4; 5% instances), CCONJ (1; 1% instances), VERB (1; 1% instances), X (1; 1% instances)

82 (100%) SYM nodes are leaves.

The highest child degree of a SYM node is 0.


SYM in other languages: [am] [ar] [bg] [bxr] [ca] [ckb] [cop] [cs] [cu] [da] [de] [el] [en] [es] [et] [eu] [fa] [fi] [fo] [fr] [ga] [gl] [got] [grc] [he] [hi] [hr] [hu] [id] [it] [ja] [kk] [kmr] [ko] [la] [lv] [mr] [nl] [no] [pl] [pt] [ro] [ru] [sa] [sk] [sla] [sl] [so] [sr] [sv] [swl] [ta] [tr] [ug] [uk] [u] [urj] [ur] [vi] [yue] [zh]