home fi/pos edit page issue tracker

This page still pertains to UD version 1.

SYM: symbol

A symbol is a word-like entity that differs from ordinary words by form, function, or both.

Examples


Treebank Statistics (UD_Finnish)

There are 198 SYM lemmas (1%), 200 SYM types (0%) and 458 SYM tokens (0%). Out of 15 observed tags, the rank of SYM is: 8 in number of lemmas, 10 in number of types and 13 in number of tokens.

The 10 most frequent SYM lemmas: :), %, &, ;), :D, +, 3.Rf3, >, 2.f4, E21

The 10 most frequent SYM types: :), %, &, ;), :D, +, 3.Rf3, >, 2.f4, E21

The 10 most frequent ambiguous lemmas: :) (SYM 63, PUNCT 1), % (SYM 37, NOUN 9), & (SYM 21, PROPN 1), + (SYM 16, PROPN 2), °C (SYM 3, NOUN 1), A (NOUN 21, PROPN 7, SYM 1), B (NOUN 3, PROPN 1, SYM 1), K (PROPN 1, SYM 1), V (ADJ 10, SYM 1, NOUN 1), × (PROPN 4, SYM 1)

The 10 most frequent ambiguous types: :) (SYM 63, PUNCT 1), & (SYM 21, PROPN 1), + (SYM 16, PROPN 2), A (NOUN 9, PROPN 7, SYM 1), B (NOUN 3, SYM 1, PROPN 1), V (ADJ 7, SYM 1, NOUN 1), × (PROPN 4, SYM 1)

Morphology

The form / lemma ratio of SYM is 1.010101 (the average of all parts of speech is 2.037154).

The 1st highest number of forms (2) was observed with the lemma “SRT#8”: SRT-8, SRT-8:ssa.

The 2nd highest number of forms (2) was observed with the lemma “°C”: °C, °C:ta.

The 3rd highest number of forms (1) was observed with the lemma “#”: #.

SYM occurs with 1 features: fi-feat/Case (2; 0% instances)

SYM occurs with 2 feature-value pairs: Case=Ine, Case=Par

SYM occurs with 3 feature combinations. The most frequent feature combination is _ (456 tokens). Examples: :), %, &, ;), :D, +, 3.Rf3, >, 2.f4, E21

Relations

SYM nodes are attached to their parents using 24 different relations: fi-dep/discourse (117; 26% instances), fi-dep/flat:name (95; 21% instances), fi-dep/nmod (50; 11% instances), fi-dep/obj (27; 6% instances), fi-dep/appos (26; 6% instances), fi-dep/punct (26; 6% instances), fi-dep/nsubj (18; 4% instances), fi-dep/obl (17; 4% instances), fi-dep/conj (16; 3% instances), fi-dep/root (11; 2% instances), fi-dep/compound:nn (10; 2% instances), fi-dep/cc (9; 2% instances), fi-dep/nsubj:cop (8; 2% instances), fi-dep/advcl (6; 1% instances), fi-dep/compound (6; 1% instances), fi-dep/dep (3; 1% instances), fi-dep/nummod (3; 1% instances), fi-dep/parataxis (3; 1% instances), fi-dep/acl:relcl (2; 0% instances), fi-dep/advmod (1; 0% instances), fi-dep/amod (1; 0% instances), fi-dep/case (1; 0% instances), fi-dep/orphan (1; 0% instances), fi-dep/vocative (1; 0% instances)

Parents of SYM nodes belong to 11 different parts of speech: VERB (142; 31% instances), NOUN (141; 31% instances), SYM (83; 18% instances), ADJ (33; 7% instances), PROPN (26; 6% instances), ROOT (11; 2% instances), NUM (9; 2% instances), ADV (5; 1% instances), PRON (4; 1% instances), X (3; 1% instances), PUNCT (1; 0% instances)

298 (65%) SYM nodes are leaves.

41 (9%) SYM nodes have one child.

63 (14%) SYM nodes have two children.

56 (12%) SYM nodes have three or more children.

The highest child degree of a SYM node is 14.

Children of SYM nodes are attached using 21 different relations: fi-dep/punct (151; 36% instances), fi-dep/flat:name (90; 21% instances), fi-dep/nummod (49; 12% instances), fi-dep/nsubj:cop (18; 4% instances), fi-dep/conj (17; 4% instances), fi-dep/nmod (17; 4% instances), fi-dep/cop (15; 4% instances), fi-dep/advmod (12; 3% instances), fi-dep/cc (12; 3% instances), fi-dep/compound:nn (12; 3% instances), fi-dep/acl:relcl (5; 1% instances), fi-dep/appos (5; 1% instances), fi-dep/compound (4; 1% instances), fi-dep/obl (4; 1% instances), fi-dep/mark (3; 1% instances), fi-dep/acl (2; 0% instances), fi-dep/amod (2; 0% instances), fi-dep/advcl (1; 0% instances), fi-dep/case (1; 0% instances), fi-dep/nmod:poss (1; 0% instances), fi-dep/nsubj (1; 0% instances)

Children of SYM nodes belong to 13 different parts of speech: PUNCT (151; 36% instances), SYM (82; 19% instances), NUM (68; 16% instances), NOUN (58; 14% instances), AUX (15; 4% instances), ADV (13; 3% instances), CCONJ (12; 3% instances), VERB (9; 2% instances), ADJ (5; 1% instances), PRON (3; 1% instances), PROPN (3; 1% instances), SCONJ (2; 0% instances), ADP (1; 0% instances)


Treebank Statistics (UD_Finnish-FTB)

There are 6 SYM lemmas (0%), 6 SYM types (0%) and 21 SYM tokens (0%). Out of 17 observed tags, the rank of SYM is: 17 in number of lemmas, 17 in number of types and 17 in number of tokens.

The 10 most frequent SYM lemmas: %, &, /, +, *, @

The 10 most frequent SYM types: %, &, /, +, *, @

The 10 most frequent ambiguous lemmas:

The 10 most frequent ambiguous types:

Morphology

The form / lemma ratio of SYM is 1.000000 (the average of all parts of speech is 2.026917).

The 1st highest number of forms (1) was observed with the lemma “%”: %.

The 2nd highest number of forms (1) was observed with the lemma “&”: &.

The 3rd highest number of forms (1) was observed with the lemma “*”: *.

SYM does not occur with any features.

Relations

SYM nodes are attached to their parents using 1 different relations: fi-dep/dep (21; 100% instances)

Parents of SYM nodes belong to 3 different parts of speech: NOUN (10; 48% instances), PROPN (7; 33% instances), VERB (4; 19% instances)

12 (57%) SYM nodes are leaves.

7 (33%) SYM nodes have one child.

2 (10%) SYM nodes have two children.

The highest child degree of a SYM node is 2.

Children of SYM nodes are attached using 2 different relations: fi-dep/nummod (8; 73% instances), fi-dep/punct (3; 27% instances)

Children of SYM nodes belong to 2 different parts of speech: NUM (8; 73% instances), PUNCT (3; 27% instances)


SYM in other languages: [am] [ar] [bg] [bxr] [ca] [ckb] [cop] [cs] [cu] [da] [de] [el] [en] [es] [et] [eu] [fa] [fi] [fo] [fr] [ga] [gl] [got] [grc] [he] [hi] [hr] [hu] [id] [it] [ja] [kk] [kmr] [ko] [la] [lv] [mr] [nl] [no] [pl] [pt] [ro] [ru] [sa] [sk] [sla] [sl] [so] [sr] [sv] [swl] [ta] [tr] [ug] [uk] [u] [urj] [ur] [vi] [yue] [zh]