home pt/pos edit page issue tracker

This page still pertains to UD version 1.

PRON: pronoun

Pronouns are words that substitute for nouns or noun phrases, whose meaning is recoverable from the linguistic or extralinguistic context.

Lemmatization rules = ?

Examples

clitic pronouns: se, me, te, lhe (including reflexive pronouns)

demonstrative pronouns: isto, esse, aquilo

personal pronouns: eu, tu, ele, vocês

indefinite pronouns: um, outro, qualquer

possessive pronouns: meu, seu, dele

interrogative pronouns: que, quanto, qual

relative pronouns: que, cujo, qual

totality pronouns: todo, todas

negative pronouns: nenhum, ninguém


Treebank Statistics (UD_Portuguese)

There are 58 PRON lemmas (0%), 111 PRON types (0%) and 6925 PRON tokens (3%). Out of 17 observed tags, the rank of PRON is: 9 in number of lemmas, 8 in number of types and 9 in number of tokens.

The 10 most frequent PRON lemmas: que, se, ele, o, eu, ela, isso, quem, eles, tudo

The 10 most frequent PRON types: que, se, o, ele, isso, quem, lhe, tudo, eles, ela

The 10 most frequent ambiguous lemmas: que (PRON 2542, SCONJ 1545, ADV 86, NOUN 53, DET 18, PROPN 12, ADP 9, X 1), se (PRON 1380, SCONJ 266, ADP 2), o (DET 26587, PRON 506, PROPN 9, NOUN 4, ADP 3), ela (PRON 167, NOUN 1), isso (PRON 160, NOUN 1), quem (PRON 133, ADV 1), tudo (PRON 106, DET 3), outro (DET 266, PRON 94, NOUN 1), este (DET 557, PRON 86), qual (PRON 84, DET 17, ADV 1, INTJ 1)

The 10 most frequent ambiguous types: que (PRON 2536, SCONJ 1537, ADV 86, NOUN 53, DET 15, PROPN 12, ADP 9, X 1), se (PRON 1362, SCONJ 180, ADP 2), o (DET 9876, PRON 494, PROPN 9, NOUN 4), isso (PRON 147, NOUN 1), quem (PRON 101, ADV 1), tudo (PRON 89, DET 3), ela (PRON 75, NOUN 1), a (DET 9174, ADP 3836, PRON 88, PROPN 14, NOUN 3, ADV 2), me (PRON 82, PROPN 2), os (DET 3204, PRON 76, ADP 5, PROPN 4)

Morphology

The form / lemma ratio of PRON is 1.913793 (the average of all parts of speech is 1.425915).

The 1st highest number of forms (9) was observed with the lemma “ele”: Ihe, ela, elas, ele, eles, lhe, lo, no, o.

The 2nd highest number of forms (6) was observed with the lemma “ela”: a, ela, la, las, lhe, na.

The 3rd highest number of forms (5) was observed with the lemma “elas”: as, elas, las, lhes, nas.

PRON occurs with 7 features: pt-feat/PronType (6925; 100% instances), pt-feat/Gender (6900; 100% instances), pt-feat/Number (6613; 95% instances), pt-feat/Case (2420; 35% instances), pt-feat/Person (2307; 33% instances), pt-feat/Definite (22; 0% instances), pt-feat/VerbForm (1; 0% instances)

PRON occurs with 22 feature-value pairs: Case=Acc, Case=Dat, Case=Nom, Definite=Def, Gender=Fem, Gender=Masc, Gender=Unsp, Number=Plur, Number=Sing, Number=Unsp, Person=1, Person=2, Person=3, PronType=Art, PronType=Dem, PronType=Ind, PronType=Int, PronType=Neg, PronType=Prs, PronType=Rel, PronType=Tot, VerbForm=Ger

PRON occurs with 108 feature combinations. The most frequent feature combination is Gender=Masc|Number=Sing|PronType=Rel (1261 tokens). Examples: que, quem, qual, tudo, quanto, Nada

Relations

PRON nodes are attached to their parents using 24 different relations: pt-dep/nsubj (2879; 42% instances), pt-dep/obj (1081; 16% instances), pt-dep/expl (917; 13% instances), pt-dep/obl (605; 9% instances), pt-dep/det (292; 4% instances), pt-dep/iobj (225; 3% instances), pt-dep/fixed (189; 3% instances), pt-dep/nmod (180; 3% instances), pt-dep/nsubj:pass (164; 2% instances), pt-dep/root (123; 2% instances), pt-dep/conj (85; 1% instances), pt-dep/mark (36; 1% instances), pt-dep/dep (34; 0% instances), pt-dep/xcomp (30; 0% instances), pt-dep/ccomp (16; 0% instances), pt-dep/acl:relcl (14; 0% instances), pt-dep/appos (14; 0% instances), pt-dep/parataxis (14; 0% instances), pt-dep/nmod:npmod (11; 0% instances), pt-dep/obl:agent (6; 0% instances), pt-dep/advcl (5; 0% instances), pt-dep/advmod (3; 0% instances), pt-dep/csubj (1; 0% instances), pt-dep/dislocated (1; 0% instances)

Parents of PRON nodes belong to 14 different parts of speech: VERB (5535; 80% instances), NOUN (516; 7% instances), PRON (360; 5% instances), ADJ (222; 3% instances), ROOT (123; 2% instances), ADV (72; 1% instances), PROPN (32; 0% instances), NUM (31; 0% instances), ADP (14; 0% instances), DET (12; 0% instances), SYM (4; 0% instances), X (2; 0% instances), INTJ (1; 0% instances), SCONJ (1; 0% instances)

5204 (75%) PRON nodes are leaves.

900 (13%) PRON nodes have one child.

529 (8%) PRON nodes have two children.

292 (4%) PRON nodes have three or more children.

The highest child degree of a PRON node is 12.

Children of PRON nodes are attached using 24 different relations: pt-dep/case (1010; 32% instances), pt-dep/punct (398; 13% instances), pt-dep/det (388; 12% instances), pt-dep/nmod (302; 9% instances), pt-dep/acl:relcl (220; 7% instances), pt-dep/cop (176; 6% instances), pt-dep/nsubj (136; 4% instances), pt-dep/fixed (132; 4% instances), pt-dep/advmod (112; 4% instances), pt-dep/cc (83; 3% instances), pt-dep/conj (47; 1% instances), pt-dep/acl (29; 1% instances), pt-dep/appos (26; 1% instances), pt-dep/mark (23; 1% instances), pt-dep/advcl (22; 1% instances), pt-dep/amod (18; 1% instances), pt-dep/aux (13; 0% instances), pt-dep/dep (10; 0% instances), pt-dep/nmod:npmod (10; 0% instances), pt-dep/parataxis (9; 0% instances), pt-dep/xcomp (9; 0% instances), pt-dep/csubj (4; 0% instances), pt-dep/ccomp (3; 0% instances), pt-dep/obj (2; 0% instances)

Children of PRON nodes belong to 13 different parts of speech: ADP (1025; 32% instances), PUNCT (398; 13% instances), PRON (360; 11% instances), NOUN (359; 11% instances), VERB (291; 9% instances), DET (219; 7% instances), AUX (189; 6% instances), ADV (116; 4% instances), CCONJ (80; 3% instances), PROPN (65; 2% instances), ADJ (51; 2% instances), SCONJ (18; 1% instances), NUM (11; 0% instances)


Treebank Statistics (UD_Portuguese-BR)

There are 1 PRON lemmas (5%), 131 PRON types (0%) and 6681 PRON tokens (2%). Out of 14 observed tags, the rank of PRON is: 10 in number of lemmas, 10 in number of types and 11 in number of tokens.

The 10 most frequent PRON lemmas: _

The 10 most frequent PRON types: que, se, ele, isso, o, ela, um, eu, eles, quem

The 10 most frequent ambiguous lemmas: _ (NOUN 51670, PUNCT 37916, PROPN 29660, ADP 27823, VERB 26752, DET 23518, ADJ 13618, CCONJ 9896, ADV 8825, NUM 7639, PRON 6681, AUX 4729, PART 687, X 472)

The 10 most frequent ambiguous types: que (PRON 2673, CCONJ 2020, ADP 107, DET 6, NOUN 3, X 1), se (PRON 687, PART 362, CCONJ 173, ADP 3, PROPN 1), o (DET 14809, PRON 204, PROPN 1, X 1, ADP 1), ela (PRON 122, NOUN 3), um (DET 1545, PRON 154, NUM 105, NOUN 1), você (PRON 80, PROPN 1), uma (DET 1467, NUM 82, PRON 78), qual (PRON 72, DET 8), me (PRON 64, ADP 1), nós (PRON 35, NOUN 7)

Morphology

The form / lemma ratio of PRON is 131.000000 (the average of all parts of speech is 1740.105263).

The 1st highest number of forms (131) was observed with the lemma “_”: Agra, Almeida, Big, Como, Elano, Gu, Hiato, Lynn, Maxim, Merss, Mosquini, OQ, Odenville, PMs, Paraisópolis, Quantos, Sharapova, Tidico, Vos, Xandele, a, algo, alguma, algumas, alguns, alguém, ambas, ambos, ao, aquela, aquelas, aquele, aqueles, aquilo, as, bastante, cada, de, demais, dessa, diferencial, duque, ela, elas, ele, eles, elle, essa, essas, esse, esses, esta, estas, este, estes, eu, hoc, isso, isto, la, las, latim, lhe, lhes, lo, los, me, mesma, mesmas, mesmo, mesmos, mim, minha, muitas, muito, muitos, nada, nenhum, nenhuma, ninguém, no, nos, nossa, nosso, nós, o, os, outos, outra, outras, outro, outros, poucas, pouco, poucos, próprio, quais, qual, qualquer, quanto, que, quebra, quem, quê, se, seu, si, sua, tais, tal, tanto, te, that, they, toda, todas, todo, todos, tudo, ue, um, uma, umas, uns, vive, você, vocês, várias, vários, vós, which.

PRON does not occur with any features.

Relations

PRON nodes are attached to their parents using 24 different relations: pt-dep/nsubj (3565; 53% instances), pt-dep/obj (1201; 18% instances), pt-dep/nmod (754; 11% instances), pt-dep/nsubj:pass (279; 4% instances), pt-dep/expl:pv (244; 4% instances), pt-dep/root (150; 2% instances), pt-dep/appos (106; 2% instances), pt-dep/iobj (104; 2% instances), pt-dep/conj (74; 1% instances), pt-dep/ccomp (44; 1% instances), pt-dep/dep (44; 1% instances), pt-dep/mark (38; 1% instances), pt-dep/acl:relcl (20; 0% instances), pt-dep/det (17; 0% instances), pt-dep/parataxis (11; 0% instances), pt-dep/advcl (10; 0% instances), pt-dep/acl:part (6; 0% instances), pt-dep/det:poss (3; 0% instances), pt-dep/fixed (3; 0% instances), pt-dep/advmod (2; 0% instances), pt-dep/cc (2; 0% instances), pt-dep/csubj (2; 0% instances), pt-dep/flat (1; 0% instances), pt-dep/xcomp (1; 0% instances)

Parents of PRON nodes belong to 13 different parts of speech: VERB (5666; 85% instances), NOUN (609; 9% instances), ROOT (150; 2% instances), PRON (97; 1% instances), ADJ (68; 1% instances), PROPN (56; 1% instances), ADV (14; 0% instances), NUM (7; 0% instances), PART (5; 0% instances), ADP (4; 0% instances), AUX (3; 0% instances), DET (1; 0% instances), X (1; 0% instances)

5041 (75%) PRON nodes are leaves.

772 (12%) PRON nodes have one child.

484 (7%) PRON nodes have two children.

384 (6%) PRON nodes have three or more children.

The highest child degree of a PRON node is 20.

Children of PRON nodes are attached using 28 different relations: pt-dep/case (782; 24% instances), pt-dep/punct (604; 18% instances), pt-dep/nmod (472; 14% instances), pt-dep/det (329; 10% instances), pt-dep/cop (238; 7% instances), pt-dep/acl:relcl (219; 7% instances), pt-dep/nsubj (163; 5% instances), pt-dep/advmod (79; 2% instances), pt-dep/conj (76; 2% instances), pt-dep/cc (74; 2% instances), pt-dep/appos (52; 2% instances), pt-dep/amod (46; 1% instances), pt-dep/acl:part (38; 1% instances), pt-dep/csubj (34; 1% instances), pt-dep/mark (27; 1% instances), pt-dep/advcl (25; 1% instances), pt-dep/aux (11; 0% instances), pt-dep/acl:inf (8; 0% instances), pt-dep/parataxis (8; 0% instances), pt-dep/ccomp (4; 0% instances), pt-dep/expl:pv (4; 0% instances), pt-dep/obj (4; 0% instances), pt-dep/aux:pass (3; 0% instances), pt-dep/dep (3; 0% instances), pt-dep/fixed (2; 0% instances), pt-dep/nsubj:pass (2; 0% instances), pt-dep/xcomp:adj (2; 0% instances), pt-dep/det:poss (1; 0% instances)

Children of PRON nodes belong to 14 different parts of speech: ADP (779; 24% instances), PUNCT (604; 18% instances), VERB (592; 18% instances), NOUN (524; 16% instances), DET (317; 10% instances), PROPN (114; 3% instances), PRON (97; 3% instances), CCONJ (90; 3% instances), ADV (87; 3% instances), ADJ (48; 1% instances), X (33; 1% instances), AUX (14; 0% instances), NUM (6; 0% instances), PART (5; 0% instances)


PRON in other languages: [am] [ar] [bg] [bxr] [ca] [ckb] [cop] [cs] [cu] [da] [de] [el] [en] [es] [et] [eu] [fa] [fi] [fo] [fr] [ga] [gl] [got] [grc] [he] [hi] [hr] [hu] [id] [it] [ja] [kk] [kmr] [ko] [la] [lv] [mr] [nl] [no] [pl] [pt] [ro] [ru] [sa] [sk] [sla] [sl] [so] [sr] [sv] [swl] [ta] [tr] [ug] [uk] [u] [urj] [ur] [vi] [yue] [zh]