home ja/pos edit page issue tracker

This page still pertains to UD version 1.

PRON: pronoun

Definition

Pronouns are words that substitute for nouns or noun phrases, whose meaning is recoverable from the linguistic or extralinguistic context.

Since Japanese does not have a specific class of posessive personal pronoun, 我が  “my” is classified into ADJ as well as other words in the same class in UniDic, instead of labeling DET or PRON.

Examples


Treebank Statistics (UD_Japanese)

There are 49 PRON lemmas (0%), 49 PRON types (0%) and 931 PRON tokens (1%). Out of 14 observed tags, the rank of PRON is: 11 in number of lemmas, 11 in number of types and 13 in number of tokens.

The 10 most frequent PRON lemmas: これ, それ, ここ, 彼, 私, これら, そこ, こちら, 彼女, 彼ら

The 10 most frequent PRON types: これ, それ, ここ, 彼, 私, これら, そこ, こちら, 彼女, 彼ら

The 10 most frequent ambiguous lemmas: 私 (PRON 72, NOUN 4), そこ (PRON 37, NOUN 1), いずれ (PRON 22, ADV 1), それぞれ (ADV 20, PRON 15), その他 (PRON 8, NOUN 1), みんな (ADV 9, PRON 6), 僕 (PRON 6, NOUN 1), 君 (NOUN 8, PRON 5), 皆 (PRON 4, NOUN 2)

The 10 most frequent ambiguous types: 私 (PRON 72, NOUN 4), そこ (PRON 37, NOUN 1), いずれ (PRON 22, ADV 1), それぞれ (ADV 20, PRON 15), その他 (PRON 8, NOUN 1), みんな (ADV 9, PRON 6), 僕 (PRON 6, NOUN 1), わたし (PRON 5, VERB 1), 君 (NOUN 8, PRON 5), 皆 (PRON 4, NOUN 2)

Morphology

The form / lemma ratio of PRON is 1.000000 (the average of all parts of speech is 1.059217).

The 1st highest number of forms (1) was observed with the lemma “あちこち”: あちこち.

The 2nd highest number of forms (1) was observed with the lemma “あちら”: あちら.

The 3rd highest number of forms (1) was observed with the lemma “あなた”: あなた.

PRON does not occur with any features.

Relations

PRON nodes are attached to their parents using 7 different relations: ja-dep/nsubj (261; 28% instances), ja-dep/obl (251; 27% instances), ja-dep/nmod (224; 24% instances), ja-dep/iobj (100; 11% instances), ja-dep/obj (83; 9% instances), ja-dep/root (11; 1% instances), ja-dep/acl (1; 0% instances)

Parents of PRON nodes belong to 8 different parts of speech: VERB (557; 60% instances), NOUN (282; 30% instances), ADJ (56; 6% instances), NUM (11; 1% instances), ROOT (11; 1% instances), PROPN (7; 1% instances), ADV (5; 1% instances), PRON (2; 0% instances)

19 (2%) PRON nodes are leaves.

672 (72%) PRON nodes have one child.

181 (19%) PRON nodes have two children.

59 (6%) PRON nodes have three or more children.

The highest child degree of a PRON node is 10.

Children of PRON nodes are attached using 17 different relations: ja-dep/case (985; 79% instances), ja-dep/punct (96; 8% instances), ja-dep/nmod (44; 4% instances), ja-dep/acl (25; 2% instances), ja-dep/mark (23; 2% instances), ja-dep/cop (17; 1% instances), ja-dep/fixed (15; 1% instances), ja-dep/aux (13; 1% instances), ja-dep/compound (6; 0% instances), ja-dep/nsubj (4; 0% instances), ja-dep/advmod (3; 0% instances), ja-dep/amod (2; 0% instances), ja-dep/cc (2; 0% instances), ja-dep/csubj (2; 0% instances), ja-dep/det (1; 0% instances), ja-dep/nummod (1; 0% instances), ja-dep/obl (1; 0% instances)

Children of PRON nodes belong to 14 different parts of speech: ADP (974; 79% instances), PUNCT (101; 8% instances), NOUN (60; 5% instances), PART (33; 3% instances), AUX (28; 2% instances), VERB (19; 2% instances), ADJ (8; 1% instances), PROPN (6; 0% instances), SCONJ (3; 0% instances), ADV (2; 0% instances), PRON (2; 0% instances), SYM (2; 0% instances), CCONJ (1; 0% instances), NUM (1; 0% instances)


Treebank Statistics (UD_Japanese-KTC)

There are 22 PRON lemmas (0%), 1 PRON types (6%) and 744 PRON tokens (0%). Out of 16 observed tags, the rank of PRON is: 8 in number of lemmas, 11 in number of types and 15 in number of tokens.

The 10 most frequent PRON lemmas: 此れ, 其れ, _, 私-代名詞, 此処, 誰, 何れ, 何処, 其処, 何時

The 10 most frequent PRON types: _

The 10 most frequent ambiguous lemmas: _ (NOUN 52356, ADP 40131, PUNCT 20670, AUX 7362, SCONJ 6334, NUM 6286, VERB 6156, ADJ 2302, PART 1887, CONJ 1517, PROPN 1293, ADV 1200, SYM 865, PRON 102, DET 68, INTJ 8), 何れ (PRON 25, NOUN 2), 何時 (PRON 23, NOUN 1)

The 10 most frequent ambiguous types: _ (NOUN 59392, ADP 40132, PUNCT 20670, AUX 20538, VERB 17383, NUM 7782, SCONJ 6539, PROPN 5774, ADJ 3509, CONJ 1977, ADV 1949, PART 1921, SYM 865, DET 751, PRON 744, INTJ 17)

Morphology

The form / lemma ratio of PRON is 0.045455 (the average of all parts of speech is 0.003541).

The 1st highest number of forms (1) was observed with the lemma “_”: _.

The 2nd highest number of forms (1) was observed with the lemma “何”: _.

The 3rd highest number of forms (1) was observed with the lemma “何れ”: _.

PRON does not occur with any features.

Relations

PRON nodes are attached to their parents using 14 different relations: ja-dep/nmod (242; 33% instances), ja-dep/nsubj (175; 24% instances), ja-dep/dep (144; 19% instances), ja-dep/dobj (77; 10% instances), ja-dep/iobj (75; 10% instances), ja-dep/case (8; 1% instances), ja-dep/root (6; 1% instances), ja-dep/advcl (5; 1% instances), ja-dep/ccomp (3; 0% instances), ja-dep/nsubjpass (3; 0% instances), ja-dep/compound (2; 0% instances), ja-dep/conj (2; 0% instances), ja-dep/acl (1; 0% instances), ja-dep/advmod (1; 0% instances)

Parents of PRON nodes belong to 9 different parts of speech: VERB (448; 60% instances), NOUN (234; 31% instances), ADJ (45; 6% instances), ROOT (6; 1% instances), ADV (3; 0% instances), CONJ (3; 0% instances), INTJ (2; 0% instances), NUM (2; 0% instances), PRON (1; 0% instances)

85 (11%) PRON nodes are leaves.

484 (65%) PRON nodes have one child.

142 (19%) PRON nodes have two children.

33 (4%) PRON nodes have three or more children.

The highest child degree of a PRON node is 7.

Children of PRON nodes are attached using 18 different relations: ja-dep/case (741; 83% instances), ja-dep/punct (69; 8% instances), ja-dep/dep (17; 2% instances), ja-dep/mark (13; 1% instances), ja-dep/nmod (13; 1% instances), ja-dep/cop (12; 1% instances), ja-dep/nsubj (10; 1% instances), ja-dep/acl (9; 1% instances), ja-dep/compound (3; 0% instances), ja-dep/aux (2; 0% instances), ja-dep/advcl (1; 0% instances), ja-dep/advmod (1; 0% instances), ja-dep/appos (1; 0% instances), ja-dep/cc (1; 0% instances), ja-dep/conj (1; 0% instances), ja-dep/csubj (1; 0% instances), ja-dep/dobj (1; 0% instances), ja-dep/nummod (1; 0% instances)

Children of PRON nodes belong to 13 different parts of speech: ADP (601; 67% instances), PART (156; 17% instances), PUNCT (68; 8% instances), NOUN (34; 4% instances), AUX (15; 2% instances), VERB (9; 1% instances), SCONJ (7; 1% instances), ADJ (2; 0% instances), ADV (1; 0% instances), CONJ (1; 0% instances), NUM (1; 0% instances), PRON (1; 0% instances), SYM (1; 0% instances)


PRON in other languages: [am] [ar] [bg] [bxr] [ca] [ckb] [cop] [cs] [cu] [da] [de] [el] [en] [es] [et] [eu] [fa] [fi] [fo] [fr] [ga] [gl] [got] [grc] [he] [hi] [hr] [hu] [id] [it] [ja] [kk] [kmr] [ko] [la] [lv] [mr] [nl] [no] [pl] [pt] [ro] [ru] [sa] [sk] [sla] [sl] [so] [sr] [sv] [swl] [ta] [tr] [ug] [uk] [u] [urj] [ur] [vi] [yue] [zh]