home en/pos edit page issue tracker

This page still pertains to UD version 1.

PART: particle

The following English words (only) are currently being treated as PART in English:

(This is a slightly motley list and we may still want to rethink this category for English….)

This covers PTB tags POS and some (old PTB style) or all uses of TO, and the subset of RB that is negation.


Treebank Statistics (UD_English)

There are 13 PART lemmas (0%), 21 PART types (0%) and 6197 PART tokens (3%). Out of 17 observed tags, the rank of PART is: 17 in number of lemmas, 17 in number of types and 12 in number of tokens.

The 10 most frequent PART lemmas: to, not, ‘s, s, ‘, na, ta, too, -s, 2

The 10 most frequent PART types: to, not, n’t, ‘s, s, nt, ‘, ’s, na, n’t

The 10 most frequent ambiguous lemmas: to (PART 3616, ADP 1999, SCONJ 73, ADV 8, NOUN 2, VERB 1), not (PART 1791, ADV 173, CCONJ 15), ’s (PART 644, PRON 14), s (PART 86, X 9, PRON 7, NOUN 3, PROPN 1), (PUNCT 222, PART 36, NOUN 7, AUX 1), ta (PART 5, ADP 4), too (ADV 135, PART 2, ADP 1), 2 (NUM 139, X 30, PART 1, PROPN 1), `s (AUX 9, PART 1), a (DET 4781, NOUN 19, X 13, ADV 5, ADP 4, PART 1, CCONJ 1, AUX 1)

The 10 most frequent ambiguous types: to (PART 3572, ADP 1976, SCONJ 71, ADV 8, NOUN 2, VERB 1), not (PART 891, ADV 148, CCONJ 13), ’s (PART 614, AUX 343, VERB 53, PRON 14), s (AUX 87, PART 85, X 9, VERB 8, PRON 7, NOUN 2, PROPN 1), (PUNCT 218, PART 33, NOUN 7), ’s (PART 29, AUX 11, VERB 3, PRON 1), ta (PART 5, ADP 3), (PART 3, PUNCT 2), n (PART 2, NOUN 1, CCONJ 1), too (ADV 126, PART 2, ADP 1)

Morphology

The form / lemma ratio of PART is 1.615385 (the average of all parts of speech is 1.181137).

The 1st highest number of forms (6) was observed with the lemma “not”: n, n’t, not, nt, n’t, t.

The 2nd highest number of forms (2) was observed with the lemma “’”: ’, ’.

The 3rd highest number of forms (2) was observed with the lemma “’s”: ’s, ’s.

PART does not occur with any features.

Relations

PART nodes are attached to their parents using 12 different relations: en-dep/mark (3616; 58% instances), en-dep/advmod (1756; 28% instances), en-dep/case (768; 12% instances), en-dep/conj (18; 0% instances), en-dep/xcomp (12; 0% instances), en-dep/advcl (9; 0% instances), en-dep/fixed (7; 0% instances), en-dep/parataxis (3; 0% instances), en-dep/root (3; 0% instances), en-dep/ccomp (2; 0% instances), en-dep/compound (2; 0% instances), en-dep/reparandum (1; 0% instances)

Parents of PART nodes belong to 15 different parts of speech: VERB (4794; 77% instances), PROPN (519; 8% instances), NOUN (464; 7% instances), ADJ (293; 5% instances), ADV (64; 1% instances), PRON (29; 0% instances), AUX (13; 0% instances), DET (6; 0% instances), NUM (4; 0% instances), PART (3; 0% instances), ROOT (3; 0% instances), SCONJ (2; 0% instances), ADP (1; 0% instances), CCONJ (1; 0% instances), SYM (1; 0% instances)

6133 (99%) PART nodes are leaves.

47 (1%) PART nodes have one child.

10 (0%) PART nodes have two children.

7 (0%) PART nodes have three or more children.

The highest child degree of a PART node is 4.

Children of PART nodes are attached using 15 different relations: en-dep/punct (35; 39% instances), en-dep/cc (20; 22% instances), en-dep/mark (11; 12% instances), en-dep/advmod (10; 11% instances), en-dep/advcl (3; 3% instances), en-dep/conj (2; 2% instances), en-dep/_ (1; 1% instances), en-dep/amod (1; 1% instances), en-dep/ccomp (1; 1% instances), en-dep/csubj (1; 1% instances), en-dep/discourse (1; 1% instances), en-dep/flat (1; 1% instances), en-dep/nsubj (1; 1% instances), en-dep/orphan (1; 1% instances), en-dep/parataxis (1; 1% instances)

Children of PART nodes belong to 9 different parts of speech: PUNCT (35; 39% instances), CCONJ (20; 22% instances), ADV (12; 13% instances), SCONJ (11; 12% instances), VERB (7; 8% instances), ADJ (2; 2% instances), INTJ (1; 1% instances), NOUN (1; 1% instances), PROPN (1; 1% instances)


Treebank Statistics (UD_English-ESL)

There are 1 PART lemmas (6%), 1 PART types (6%) and 3169 PART tokens (4%). Out of 17 observed tags, the rank of PART is: 10 in number of lemmas, 10 in number of types and 10 in number of tokens.

The 10 most frequent PART lemmas: _

The 10 most frequent PART types: _

The 10 most frequent ambiguous lemmas: _ (NOUN 14135, VERB 13583, PRON 9575, DET 9068, PUNCT 8624, ADP 7769, ADJ 5278, ADV 5121, AUX 4111, PART 3169, CONJ 2865, SCONJ 2278, PROPN 1574, NUM 776, INTJ 67, X 60, SYM 37)

The 10 most frequent ambiguous types: _ (NOUN 14135, VERB 13583, PRON 9575, DET 9068, PUNCT 8624, ADP 7769, ADJ 5278, ADV 5121, AUX 4111, PART 3169, CONJ 2865, SCONJ 2278, PROPN 1574, NUM 776, INTJ 67, X 60, SYM 37)

Morphology

The form / lemma ratio of PART is 1.000000 (the average of all parts of speech is 1.000000).

The 1st highest number of forms (1) was observed with the lemma “_”: _.

PART does not occur with any features.

Relations

PART nodes are attached to their parents using 11 different relations: en-dep/mark (2159; 68% instances), en-dep/neg (842; 27% instances), en-dep/case (140; 4% instances), en-dep/conj (7; 0% instances), en-dep/nmod (5; 0% instances), en-dep/mwe (4; 0% instances), en-dep/compound:prt (3; 0% instances), en-dep/nsubj (3; 0% instances), en-dep/xcomp (3; 0% instances), en-dep/advmod (2; 0% instances), en-dep/cc (1; 0% instances)

Parents of PART nodes belong to 12 different parts of speech: VERB (2667; 84% instances), NOUN (214; 7% instances), ADJ (186; 6% instances), PROPN (43; 1% instances), ADV (29; 1% instances), PRON (16; 1% instances), AUX (5; 0% instances), SCONJ (3; 0% instances), DET (2; 0% instances), NUM (2; 0% instances), ADP (1; 0% instances), PART (1; 0% instances)

3164 (100%) PART nodes are leaves.

3 (0%) PART nodes have one child.

2 (0%) PART nodes have two children.

The highest child degree of a PART node is 2.

Children of PART nodes are attached using 6 different relations: en-dep/mwe (2; 29% instances), en-dep/aux (1; 14% instances), en-dep/cop (1; 14% instances), en-dep/dobj (1; 14% instances), en-dep/nsubj (1; 14% instances), en-dep/punct (1; 14% instances)

Children of PART nodes belong to 6 different parts of speech: VERB (2; 29% instances), AUX (1; 14% instances), DET (1; 14% instances), PART (1; 14% instances), PRON (1; 14% instances), PUNCT (1; 14% instances)


Treebank Statistics (UD_English-LinES)

There are 1 PART lemmas (6%), 7 PART types (0%) and 1703 PART tokens (3%). Out of 17 observed tags, the rank of PART is: 10 in number of lemmas, 16 in number of types and 12 in number of tokens.

The 10 most frequent PART lemmas: _

The 10 most frequent PART types: to, not, ‘s, n’t, ‘, in, t’

The 10 most frequent ambiguous lemmas: _ (NOUN 12161, PUNCT 8085, VERB 8020, ADP 6788, DET 6429, PRON 6303, ADJ 4270, ADV 3700, AUX 3539, PROPN 2257, CCONJ 2081, PART 1703, SCONJ 1231, NUM 462, INTJ 122, X 41, SYM 5)

The 10 most frequent ambiguous types: to (PART 901, ADP 685), ’s (PART 234, AUX 107, VERB 35, PRON 1), n’t (PART 183, ADV 22), (PUNCT 48, PART 21), in (ADP 910, ADV 34, PART 3, ADJ 1)

Morphology

The form / lemma ratio of PART is 7.000000 (the average of all parts of speech is 527.705882).

The 1st highest number of forms (7) was observed with the lemma “_”: ’, ‘s, in, n’t, not, t’, to.

PART does not occur with any features.

Relations

PART nodes are attached to their parents using 7 different relations: en-dep/mark (919; 54% instances), en-dep/advmod (519; 30% instances), en-dep/case (255; 15% instances), en-dep/conj (5; 0% instances), en-dep/amod (2; 0% instances), en-dep/obj (2; 0% instances), en-dep/root (1; 0% instances)

Parents of PART nodes belong to 14 different parts of speech: VERB (1238; 73% instances), NOUN (176; 10% instances), PROPN (152; 9% instances), ADJ (79; 5% instances), ADV (23; 1% instances), AUX (14; 1% instances), PRON (12; 1% instances), DET (2; 0% instances), SCONJ (2; 0% instances), ADP (1; 0% instances), CCONJ (1; 0% instances), NUM (1; 0% instances), PUNCT (1; 0% instances), ROOT (1; 0% instances)

1686 (99%) PART nodes are leaves.

9 (1%) PART nodes have one child.

8 (0%) PART nodes have two children.

The highest child degree of a PART node is 2.

Children of PART nodes are attached using 10 different relations: en-dep/cc (7; 28% instances), en-dep/fixed (6; 24% instances), en-dep/mark (4; 16% instances), en-dep/punct (2; 8% instances), en-dep/advcl (1; 4% instances), en-dep/advmod (1; 4% instances), en-dep/case (1; 4% instances), en-dep/conj (1; 4% instances), en-dep/nmod (1; 4% instances), en-dep/nsubj (1; 4% instances)

Children of PART nodes belong to 8 different parts of speech: CCONJ (7; 28% instances), ADP (5; 20% instances), NOUN (4; 16% instances), SCONJ (3; 12% instances), PUNCT (2; 8% instances), VERB (2; 8% instances), ADV (1; 4% instances), PRON (1; 4% instances)


Treebank Statistics (UD_English-ParTUT)

There are 4 PART lemmas (0%), 4 PART types (0%) and 1012 PART tokens (3%). Out of 17 observed tags, the rank of PART is: 15 in number of lemmas, 15 in number of types and 12 in number of tokens.

The 10 most frequent PART lemmas: to, ‘s, not, ‘

The 10 most frequent PART types: to, ‘s, not, ‘

The 10 most frequent ambiguous lemmas: to (PART 509, ADP 453, SCONJ 3), (PART 33, PUNCT 12, X 10, ADP 1)

The 10 most frequent ambiguous types: to (PART 509, ADP 441, SCONJ 3), ’s (PART 291, AUX 11, VERB 3), (PART 33, PUNCT 12, X 10)

Morphology

The form / lemma ratio of PART is 1.000000 (the average of all parts of speech is 1.187751).

The 1st highest number of forms (1) was observed with the lemma “’”: .

The 2nd highest number of forms (1) was observed with the lemma “’s”: ’s.

The 3rd highest number of forms (1) was observed with the lemma “not”: not.

PART occurs with 1 features: en-feat/Polarity (179; 18% instances)

PART occurs with 1 feature-value pairs: Polarity=Neg

PART occurs with 2 feature combinations. The most frequent feature combination is _ (833 tokens). Examples: to, ‘s, ‘

Relations

PART nodes are attached to their parents using 3 different relations: en-dep/mark (509; 50% instances), en-dep/case (324; 32% instances), en-dep/advmod (179; 18% instances)

Parents of PART nodes belong to 8 different parts of speech: VERB (589; 58% instances), PROPN (181; 18% instances), NOUN (176; 17% instances), ADJ (27; 3% instances), ADV (27; 3% instances), AUX (10; 1% instances), NUM (1; 0% instances), PRON (1; 0% instances)

1011 (100%) PART nodes are leaves.

1 (0%) PART nodes have one child.

The highest child degree of a PART node is 1.

Children of PART nodes are attached using 1 different relations: en-dep/punct (1; 100% instances)

Children of PART nodes belong to 1 different parts of speech: PUNCT (1; 100% instances)


PART in other languages: [am] [ar] [bg] [bxr] [ca] [ckb] [cop] [cs] [cu] [da] [de] [el] [en] [es] [et] [eu] [fa] [fi] [fo] [fr] [ga] [gl] [got] [grc] [he] [hi] [hr] [hu] [id] [it] [ja] [kk] [kmr] [ko] [la] [lv] [mr] [nl] [no] [pl] [pt] [ro] [ru] [sa] [sk] [sla] [sl] [so] [sr] [sv] [swl] [ta] [tr] [ug] [uk] [u] [urj] [ur] [vi] [yue] [zh]