home ja/pos edit page issue tracker

This page still pertains to UD version 1.

PART: particle

Definition

PART for Japanese covers functional words which are not classified into ADP, CCONJ nor SCONJ. Namely, PART corresponds to final postpositional particles, particle(phrase_final / 助詞-終助詞 in UniDic, and suffixes to change the category of phrases.

Examples


Treebank Statistics (UD_Japanese)

There are 63 PART lemmas (0%), 70 PART types (0%) and 2273 PART tokens (1%). Out of 14 observed tags, the rank of PART is: 9 in number of lemmas, 9 in number of types and 10 in number of tokens.

The 10 most frequent PART lemmas: の, か, 第, 約, とともに, において, さ, -, よ, ん

The 10 most frequent PART types: の, か, 第, 約, において, さ, -, よ, ん, 年

The 10 most frequent ambiguous lemmas: の (ADP 7759, PART 1040), 約 (PART 91, ADV 2), - (PART 48, NOUN 46, SYM 28), ん (PART 42, PROPN 1), 年 (NOUN 914, PART 40), に (ADP 5055, PART 32), ね (PART 31, NOUN 7, AUX 1), 翌 (PART 14, NOUN 2), ~ (SYM 14, PART 13), 同 (ADJ 80, NOUN 22, PART 9)

The 10 most frequent ambiguous types: の (ADP 7759, PART 1039, AUX 67), 約 (PART 91, ADV 2), さ (AUX 1061, VERB 94, PART 49), - (PART 48, NOUN 46, SYM 28), ん (AUX 132, PART 42, PROPN 1), 年 (NOUN 914, PART 40), に (ADP 5055, AUX 562, PART 32, SCONJ 16), ね (PART 31, NOUN 7, AUX 1), な (AUX 826, PART 21, VERB 3), 翌 (PART 14, NOUN 2)

Morphology

The form / lemma ratio of PART is 1.111111 (the average of all parts of speech is 1.059217).

The 1st highest number of forms (4) was observed with the lemma “をもとに”: をもとに, を元に, を基に, を基にして.

The 2nd highest number of forms (3) was observed with the lemma “か”: か, かどうか, か否か.

The 3rd highest number of forms (2) was observed with the lemma “とともに”: とともに, と共に.

PART does not occur with any features.

Relations

PART nodes are attached to their parents using 5 different relations: ja-dep/mark (913; 40% instances), ja-dep/case (845; 37% instances), ja-dep/amod (469; 21% instances), ja-dep/aux (45; 2% instances), ja-dep/dep (1; 0% instances)

Parents of PART nodes belong to 8 different parts of speech: NOUN (793; 35% instances), VERB (647; 28% instances), NUM (358; 16% instances), ADJ (189; 8% instances), PROPN (166; 7% instances), ADV (86; 4% instances), PRON (33; 1% instances), CCONJ (1; 0% instances)

2272 (100%) PART nodes are leaves.

1 (0%) PART nodes have one child.

The highest child degree of a PART node is 1.

Children of PART nodes are attached using 1 different relations: ja-dep/dep (1; 100% instances)

Children of PART nodes belong to 1 different parts of speech: SCONJ (1; 100% instances)


Treebank Statistics (UD_Japanese-KTC)

There are 12 PART lemmas (0%), 1 PART types (6%) and 1921 PART tokens (1%). Out of 16 observed tags, the rank of PART is: 10 in number of lemmas, 10 in number of types and 12 in number of tokens.

The 10 most frequent PART lemmas: _, くらい, な, ばかり, ほど, タイラ, きり, 入る, 八, 太い

The 10 most frequent PART types: _

The 10 most frequent ambiguous lemmas: _ (NOUN 52356, ADP 40131, PUNCT 20670, AUX 7362, SCONJ 6334, NUM 6286, VERB 6156, ADJ 2302, PART 1887, CONJ 1517, PROPN 1293, ADV 1200, SYM 865, PRON 102, DET 68, INTJ 8), 入る (VERB 58, PART 1), 八 (NUM 65, PART 1), 我が (ADJ 3, PART 1)

The 10 most frequent ambiguous types: _ (NOUN 59392, ADP 40132, PUNCT 20670, AUX 20538, VERB 17383, NUM 7782, SCONJ 6539, PROPN 5774, ADJ 3509, CONJ 1977, ADV 1949, PART 1921, SYM 865, DET 751, PRON 744, INTJ 17)

Morphology

The form / lemma ratio of PART is 0.083333 (the average of all parts of speech is 0.003541).

The 1st highest number of forms (1) was observed with the lemma “_”: _.

The 2nd highest number of forms (1) was observed with the lemma “きり”: _.

The 3rd highest number of forms (1) was observed with the lemma “くらい”: _.

PART does not occur with any features.

Relations

PART nodes are attached to their parents using 5 different relations: ja-dep/case (1239; 64% instances), ja-dep/mark (599; 31% instances), ja-dep/dep (81; 4% instances), ja-dep/mwe (1; 0% instances), ja-dep/root (1; 0% instances)

Parents of PART nodes belong to 10 different parts of speech: NOUN (978; 51% instances), VERB (593; 31% instances), PRON (156; 8% instances), ADJ (70; 4% instances), ADV (61; 3% instances), PROPN (53; 3% instances), CONJ (5; 0% instances), NUM (3; 0% instances), ADP (1; 0% instances), ROOT (1; 0% instances)

1920 (100%) PART nodes are leaves.

0 (0%) PART nodes have one child.

0 (0%) PART nodes have two children.

1 (0%) PART nodes have three or more children.

The highest child degree of a PART node is 5.

Children of PART nodes are attached using 5 different relations: ja-dep/advmod (1; 20% instances), ja-dep/aux (1; 20% instances), ja-dep/dep (1; 20% instances), ja-dep/nsubj (1; 20% instances), ja-dep/punct (1; 20% instances)

Children of PART nodes belong to 4 different parts of speech: AUX (2; 40% instances), ADV (1; 20% instances), NOUN (1; 20% instances), PUNCT (1; 20% instances)


PART in other languages: [am] [ar] [bg] [bxr] [ca] [ckb] [cop] [cs] [cu] [da] [de] [el] [en] [es] [et] [eu] [fa] [fi] [fo] [fr] [ga] [gl] [got] [grc] [he] [hi] [hr] [hu] [id] [it] [ja] [kk] [kmr] [ko] [la] [lv] [mr] [nl] [no] [pl] [pt] [ro] [ru] [sa] [sk] [sla] [sl] [so] [sr] [sv] [swl] [ta] [tr] [ug] [uk] [u] [urj] [ur] [vi] [yue] [zh]