SYM
: symbol
The English SYM
covers PTB tags NFP (except for lines of separators, which become PUNCT), #, $, SYM, and for the percent sign (%).
Treebank Statistics (UD_English)
There are 77 SYM
lemmas (0%), 77 SYM
types (0%) and 666 SYM
tokens (0%).
Out of 17 observed tags, the rank of SYM
is: 12 in number of lemmas, 12 in number of types and 17 in number of tokens.
The 10 most frequent SYM
lemmas: $, -, :), %, /, +, |, :(, :-), x
The 10 most frequent SYM
types: $, -, :), %, /, +, |, :(, :-), x
The 10 most frequent ambiguous lemmas: $ (SYM 264, NOUN 3), - (PUNCT 1478, SYM 105, X 11), :) (SYM 46, PUNCT 2), % (SYM 36, X 1), / (PUNCT 210, SYM 28, X 1), + (SYM 21, CCONJ 1), | (SYM 20, PUNCT 1), x (NOUN 10, SYM 6, X 2, ADP 1), … (PUNCT 284, SYM 4), = (PUNCT 5, SYM 4)
The 10 most frequent ambiguous types: $ (SYM 264, NOUN 3), - (PUNCT 1460, SYM 105, X 11), :) (SYM 46, PUNCT 2), % (SYM 36, X 1), / (PUNCT 210, SYM 28, X 1), + (SYM 21, CCONJ 1), | (SYM 20, PUNCT 1), x (SYM 5, NOUN 5, X 1), … (PUNCT 284, SYM 4), = (PUNCT 5, SYM 4)
- $
- -
- :)
- %
- /
- +
- |
- x
- …
- =
Morphology
The form / lemma ratio of SYM
is 1.000000 (the average of all parts of speech is 1.181137).
The 1st highest number of forms (1) was observed with the lemma “###”: ###.
The 2nd highest number of forms (1) was observed with the lemma “$”: $.
The 3rd highest number of forms (1) was observed with the lemma “%”: %.
SYM
occurs with 1 features: en-feat/Number (38; 6% instances)
SYM
occurs with 1 feature-value pairs: Number=Sing
SYM
occurs with 2 feature combinations.
The most frequent feature combination is _
(628 tokens).
Examples: $, -, :), /, +, |, :(, :-), x, ====================================================
Relations
SYM
nodes are attached to their parents using 25 different relations: en-dep/case (107; 16% instances), en-dep/root (101; 15% instances), en-dep/discourse (91; 14% instances), en-dep/punct (66; 10% instances), en-dep/obj (59; 9% instances), en-dep/compound (54; 8% instances), en-dep/obl (35; 5% instances), en-dep/nmod (34; 5% instances), en-dep/cc (21; 3% instances), en-dep/appos (20; 3% instances), en-dep/obl:npmod (18; 3% instances), en-dep/conj (17; 3% instances), en-dep/advmod (11; 2% instances), en-dep/parataxis (7; 1% instances), en-dep/list (5; 1% instances), en-dep/nmod:npmod (5; 1% instances), en-dep/nsubj:pass (5; 1% instances), en-dep/advcl (2; 0% instances), en-dep/nsubj (2; 0% instances), en-dep/acl:relcl (1; 0% instances), en-dep/amod (1; 0% instances), en-dep/ccomp (1; 0% instances), en-dep/goeswith (1; 0% instances), en-dep/reparandum (1; 0% instances), en-dep/xcomp (1; 0% instances)
Parents of SYM
nodes belong to 13 different parts of speech: VERB (174; 26% instances), NOUN (165; 25% instances), ROOT (101; 15% instances), NUM (100; 15% instances), ADJ (37; 6% instances), PROPN (29; 4% instances), SYM (27; 4% instances), X (17; 3% instances), ADV (10; 2% instances), CCONJ (2; 0% instances), DET (2; 0% instances), ADP (1; 0% instances), PRON (1; 0% instances)
345 (52%) SYM
nodes are leaves.
96 (14%) SYM
nodes have one child.
86 (13%) SYM
nodes have two children.
139 (21%) SYM
nodes have three or more children.
The highest child degree of a SYM
node is 10.
Children of SYM
nodes are attached using 27 different relations: en-dep/nummod (267; 32% instances), en-dep/punct (164; 20% instances), en-dep/case (69; 8% instances), en-dep/appos (65; 8% instances), en-dep/compound (49; 6% instances), en-dep/nmod (48; 6% instances), en-dep/advmod (36; 4% instances), en-dep/cop (21; 3% instances), en-dep/nsubj (20; 2% instances), en-dep/det (19; 2% instances), en-dep/conj (16; 2% instances), en-dep/cc (14; 2% instances), en-dep/advcl (9; 1% instances), en-dep/amod (6; 1% instances), en-dep/nmod:npmod (4; 0% instances), en-dep/parataxis (4; 0% instances), en-dep/acl:relcl (3; 0% instances), en-dep/mark (3; 0% instances), en-dep/nmod:poss (2; 0% instances), en-dep/obl (2; 0% instances), en-dep/_ (1; 0% instances), en-dep/acl (1; 0% instances), en-dep/aux (1; 0% instances), en-dep/discourse (1; 0% instances), en-dep/goeswith (1; 0% instances), en-dep/obj (1; 0% instances), en-dep/xcomp (1; 0% instances)
Children of SYM
nodes belong to 16 different parts of speech: NUM (315; 38% instances), PUNCT (161; 19% instances), NOUN (122; 15% instances), ADP (68; 8% instances), ADV (28; 3% instances), SYM (26; 3% instances), AUX (22; 3% instances), DET (22; 3% instances), VERB (17; 2% instances), CCONJ (14; 2% instances), ADJ (13; 2% instances), PRON (12; 1% instances), PROPN (4; 0% instances), SCONJ (2; 0% instances), PART (1; 0% instances), X (1; 0% instances)
Treebank Statistics (UD_English-ESL)
There are 1 SYM
lemmas (6%), 1 SYM
types (6%) and 37 SYM
tokens (0%).
Out of 17 observed tags, the rank of SYM
is: 15 in number of lemmas, 15 in number of types and 17 in number of tokens.
The 10 most frequent SYM
lemmas: _
The 10 most frequent SYM
types: _
The 10 most frequent ambiguous lemmas: _ (NOUN 14135, VERB 13583, PRON 9575, DET 9068, PUNCT 8624, ADP 7769, ADJ 5278, ADV 5121, AUX 4111, PART 3169, CONJ 2865, SCONJ 2278, PROPN 1574, NUM 776, INTJ 67, X 60, SYM 37)
The 10 most frequent ambiguous types: _ (NOUN 14135, VERB 13583, PRON 9575, DET 9068, PUNCT 8624, ADP 7769, ADJ 5278, ADV 5121, AUX 4111, PART 3169, CONJ 2865, SCONJ 2278, PROPN 1574, NUM 776, INTJ 67, X 60, SYM 37)
- _
- NOUN 14135: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- VERB 13583: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- PRON 9575: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- DET 9068: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- PUNCT 8624: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- ADP 7769: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- ADJ 5278: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- ADV 5121: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- AUX 4111: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- PART 3169: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- CONJ 2865: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- SCONJ 2278: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- PROPN 1574: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- NUM 776: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- INTJ 67: _ _ _ _ _ _ _ _ _ _ _ _
- X 60: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- SYM 37: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
Morphology
The form / lemma ratio of SYM
is 1.000000 (the average of all parts of speech is 1.000000).
The 1st highest number of forms (1) was observed with the lemma “_”: _.
SYM
does not occur with any features.
Relations
SYM
nodes are attached to their parents using 9 different relations: en-dep/dobj (7; 19% instances), en-dep/nmod (7; 19% instances), en-dep/compound (6; 16% instances), en-dep/conj (5; 14% instances), en-dep/nsubj (5; 14% instances), en-dep/appos (2; 5% instances), en-dep/punct (2; 5% instances), en-dep/root (2; 5% instances), en-dep/acl:relcl (1; 3% instances)
Parents of SYM
nodes belong to 7 different parts of speech: NOUN (13; 35% instances), VERB (12; 32% instances), SYM (7; 19% instances), ROOT (2; 5% instances), ADJ (1; 3% instances), NUM (1; 3% instances), PROPN (1; 3% instances)
2 (5%) SYM
nodes are leaves.
14 (38%) SYM
nodes have one child.
9 (24%) SYM
nodes have two children.
12 (32%) SYM
nodes have three or more children.
The highest child degree of a SYM
node is 13.
Children of SYM
nodes are attached using 13 different relations: en-dep/nummod (35; 41% instances), en-dep/punct (10; 12% instances), en-dep/case (7; 8% instances), en-dep/nmod (7; 8% instances), en-dep/conj (6; 7% instances), en-dep/advmod (4; 5% instances), en-dep/acl:relcl (3; 3% instances), en-dep/cc (3; 3% instances), en-dep/cop (3; 3% instances), en-dep/det (3; 3% instances), en-dep/nsubj (3; 3% instances), en-dep/amod (1; 1% instances), en-dep/appos (1; 1% instances)
Children of SYM
nodes belong to 10 different parts of speech: NUM (35; 41% instances), PUNCT (10; 12% instances), NOUN (8; 9% instances), ADP (7; 8% instances), SYM (7; 8% instances), VERB (7; 8% instances), ADV (4; 5% instances), DET (4; 5% instances), CONJ (3; 3% instances), ADJ (1; 1% instances)
Treebank Statistics (UD_English-LinES)
There are 1 SYM
lemmas (6%), 2 SYM
types (0%) and 5 SYM
tokens (0%).
Out of 17 observed tags, the rank of SYM
is: 15 in number of lemmas, 17 in number of types and 17 in number of tokens.
The 10 most frequent SYM
lemmas: _
The 10 most frequent SYM
types: %, -%
The 10 most frequent ambiguous lemmas: _ (NOUN 12161, PUNCT 8085, VERB 8020, ADP 6788, DET 6429, PRON 6303, ADJ 4270, ADV 3700, AUX 3539, PROPN 2257, CCONJ 2081, PART 1703, SCONJ 1231, NUM 462, INTJ 122, X 41, SYM 5)
The 10 most frequent ambiguous types: % (NOUN 3, SYM 3), -% (SYM 2, NOUN 1)
- %
- -%
Morphology
The form / lemma ratio of SYM
is 2.000000 (the average of all parts of speech is 527.705882).
The 1st highest number of forms (2) was observed with the lemma “_”: %, -%.
SYM
does not occur with any features.
Relations
SYM
nodes are attached to their parents using 4 different relations: en-dep/advmod (2; 40% instances), en-dep/amod (1; 20% instances), en-dep/appos (1; 20% instances), en-dep/obj (1; 20% instances)
Parents of SYM
nodes belong to 2 different parts of speech: VERB (3; 60% instances), NOUN (2; 40% instances)
0 (0%) SYM
nodes are leaves.
0 (0%) SYM
nodes have one child.
3 (60%) SYM
nodes have two children.
2 (40%) SYM
nodes have three or more children.
The highest child degree of a SYM
node is 3.
Children of SYM
nodes are attached using 5 different relations: en-dep/nummod (4; 33% instances), en-dep/punct (4; 33% instances), en-dep/case (2; 17% instances), en-dep/det (1; 8% instances), en-dep/mark (1; 8% instances)
Children of SYM
nodes belong to 5 different parts of speech: NUM (4; 33% instances), PUNCT (4; 33% instances), ADP (2; 17% instances), ADV (1; 8% instances), DET (1; 8% instances)
Treebank Statistics (UD_English-ParTUT)
There are 2 SYM
lemmas (0%), 2 SYM
types (0%) and 39 SYM
tokens (0%).
Out of 17 observed tags, the rank of SYM
is: 17 in number of lemmas, 17 in number of types and 16 in number of tokens.
The 10 most frequent SYM
lemmas: %, $
The 10 most frequent SYM
types: %, $
The 10 most frequent ambiguous lemmas:
The 10 most frequent ambiguous types:
Morphology
The form / lemma ratio of SYM
is 1.000000 (the average of all parts of speech is 1.187751).
The 1st highest number of forms (1) was observed with the lemma “$”: $.
The 2nd highest number of forms (1) was observed with the lemma “%”: %.
SYM
does not occur with any features.
Relations
SYM
nodes are attached to their parents using 7 different relations: en-dep/obl (18; 46% instances), en-dep/nmod (12; 31% instances), en-dep/nsubj (3; 8% instances), en-dep/conj (2; 5% instances), en-dep/obj (2; 5% instances), en-dep/nsubj:pass (1; 3% instances), en-dep/root (1; 3% instances)
Parents of SYM
nodes belong to 6 different parts of speech: VERB (22; 56% instances), NOUN (12; 31% instances), PROPN (2; 5% instances), ADJ (1; 3% instances), ADV (1; 3% instances), ROOT (1; 3% instances)
2 (5%) SYM
nodes are leaves.
8 (21%) SYM
nodes have one child.
20 (51%) SYM
nodes have two children.
9 (23%) SYM
nodes have three or more children.
The highest child degree of a SYM
node is 7.
Children of SYM
nodes are attached using 12 different relations: en-dep/nummod (37; 44% instances), en-dep/case (22; 26% instances), en-dep/nmod (7; 8% instances), en-dep/advmod (6; 7% instances), en-dep/cc (2; 2% instances), en-dep/cop (2; 2% instances), en-dep/det (2; 2% instances), en-dep/nsubj (2; 2% instances), en-dep/punct (2; 2% instances), en-dep/advcl (1; 1% instances), en-dep/amod (1; 1% instances), en-dep/orphan (1; 1% instances)
Children of SYM
nodes belong to 10 different parts of speech: NUM (37; 44% instances), ADP (22; 26% instances), NOUN (10; 12% instances), ADV (5; 6% instances), ADJ (2; 2% instances), AUX (2; 2% instances), CCONJ (2; 2% instances), DET (2; 2% instances), PUNCT (2; 2% instances), VERB (1; 1% instances)
SYM in other languages: [am] [ar] [bg] [bxr] [ca] [ckb] [cop] [cs] [cu] [da] [de] [el] [en] [es] [et] [eu] [fa] [fi] [fo] [fr] [ga] [gl] [got] [grc] [he] [hi] [hr] [hu] [id] [it] [ja] [kk] [kmr] [ko] [la] [lv] [mr] [nl] [no] [pl] [pt] [ro] [ru] [sa] [sk] [sla] [sl] [so] [sr] [sv] [swl] [ta] [tr] [ug] [uk] [u] [urj] [ur] [vi] [yue] [zh]