home es/pos edit page issue tracker

This page still pertains to UD version 1.

NOUN: noun

Definition

Nouns are a part of speech typically denoting a person, place, thing, animal or idea.

The NOUN tag is intended for common nouns only. See PROPN for proper nouns and PRON for pronouns.

Spanish nouns have the lexical feature es-feat/Gender. Furthermore, the nouns inflect for es-feat/Number.

Examples


Treebank Statistics (UD_Spanish)

There are 10285 NOUN lemmas (26%), 12787 NOUN types (25%) and 75295 NOUN tokens (18%). Out of 16 observed tags, the rank of NOUN is: 2 in number of lemmas, 2 in number of types and 1 in number of tokens.

The 10 most frequent NOUN lemmas: año, parte, población, ciudad, vez, persona, estado, municipio, familia, país

The 10 most frequent NOUN types: años, año, parte, población, ciudad, personas, municipio, estado, km, familia

The 10 most frequent ambiguous lemmas: año (NOUN 1022, PROPN 6), parte (NOUN 461, VERB 2, PROPN 1), ciudad (NOUN 391, PROPN 39), vez (NOUN 374, X 1, PROPN 1), persona (NOUN 361, PROPN 2), estado (NOUN 317, PROPN 78, VERB 4), familia (NOUN 312, PROPN 6), país (NOUN 295, PROPN 23), día (NOUN 282, PROPN 12), equipo (NOUN 281, PROPN 2)

The 10 most frequent ambiguous types: años (NOUN 511, PROPN 3), año (NOUN 500, PROPN 1), parte (NOUN 417, VERB 2), ciudad (NOUN 328, PROPN 2), estado (NOUN 208, VERB 18, AUX 11, PROPN 1), km (NOUN 280, SYM 120), familia (NOUN 273, PROPN 2), forma (NOUN 199, VERB 25), día (NOUN 167, PROPN 1), embargo (NOUN 171, ADV 3)

Morphology

The form / lemma ratio of NOUN is 1.243267 (the average of all parts of speech is 1.255824).

The 1st highest number of forms (4) was observed with the lemma “conocido”: conocida, conocidas, conocido, conocidos.

The 2nd highest number of forms (4) was observed with the lemma “ganador”: ganador, ganadora, ganadoras, ganadores.

The 3rd highest number of forms (4) was observed with the lemma “medio”: media, medias, medio, medios.

NOUN occurs with 3 features: es-feat/Number (71392; 95% instances), es-feat/Gender (68418; 91% instances), es-feat/VerbForm (435; 1% instances)

NOUN occurs with 7 feature-value pairs: Gender=Fem, Gender=Masc, Number=Plur, Number=Sing, VerbForm=Ger, VerbForm=Inf, VerbForm=Part

NOUN occurs with 15 feature combinations. The most frequent feature combination is Gender=Masc|Number=Sing (25388 tokens). Examples: año, municipio, nombre, lugar, equipo, estado, grupo, tiempo, país, embargo

Relations

NOUN nodes are attached to their parents using 29 different relations: es-dep/nmod (23426; 31% instances), es-dep/obl (17337; 23% instances), es-dep/obj (10365; 14% instances), es-dep/nsubj (9413; 13% instances), es-dep/conj (6799; 9% instances), es-dep/root (2588; 3% instances), es-dep/appos (2177; 3% instances), es-dep/nsubj:pass (797; 1% instances), es-dep/iobj (545; 1% instances), es-dep/fixed (256; 0% instances), es-dep/parataxis (219; 0% instances), es-dep/acl:relcl (177; 0% instances), es-dep/nummod (175; 0% instances), es-dep/amod (153; 0% instances), es-dep/compound (133; 0% instances), es-dep/advcl (132; 0% instances), es-dep/ccomp (128; 0% instances), es-dep/dep (113; 0% instances), es-dep/acl (107; 0% instances), es-dep/case (89; 0% instances), es-dep/xcomp (59; 0% instances), es-dep/csubj (31; 0% instances), es-dep/mark (25; 0% instances), es-dep/cop (23; 0% instances), es-dep/advmod (12; 0% instances), es-dep/flat (9; 0% instances), es-dep/cc (3; 0% instances), es-dep/aux (2; 0% instances), es-dep/det (2; 0% instances)

Parents of NOUN nodes belong to 17 different parts of speech: VERB (37296; 50% instances), NOUN (28271; 38% instances), ADJ (2819; 4% instances), ROOT (2588; 3% instances), PROPN (1979; 3% instances), PRON (983; 1% instances), ADP (285; 0% instances), ADV (249; 0% instances), SYM (245; 0% instances), NUM (205; 0% instances), X (197; 0% instances), DET (92; 0% instances), CCONJ (42; 0% instances), AUX (34; 0% instances), SCONJ (5; 0% instances), PART (3; 0% instances), PUNCT (2; 0% instances)

1789 (2%) NOUN nodes are leaves.

15636 (21%) NOUN nodes have one child.

24574 (33%) NOUN nodes have two children.

33296 (44%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 27.

Children of NOUN nodes are attached using 29 different relations: es-dep/det (49693; 26% instances), es-dep/case (42111; 22% instances), es-dep/nmod (28619; 15% instances), es-dep/amod (18639; 10% instances), es-dep/punct (12744; 7% instances), es-dep/conj (6613; 3% instances), es-dep/appos (5138; 3% instances), es-dep/nummod (5056; 3% instances), es-dep/cc (5019; 3% instances), es-dep/acl:relcl (4060; 2% instances), es-dep/cop (3344; 2% instances), es-dep/acl (2367; 1% instances), es-dep/nsubj (2352; 1% instances), es-dep/advmod (1336; 1% instances), es-dep/advcl (1084; 1% instances), es-dep/mark (505; 0% instances), es-dep/dep (426; 0% instances), es-dep/parataxis (256; 0% instances), es-dep/fixed (167; 0% instances), es-dep/csubj (157; 0% instances), es-dep/compound (139; 0% instances), es-dep/aux (121; 0% instances), es-dep/aux:pass (58; 0% instances), es-dep/obj (49; 0% instances), es-dep/ccomp (37; 0% instances), es-dep/flat (31; 0% instances), es-dep/iobj (26; 0% instances), es-dep/nsubj:pass (25; 0% instances), es-dep/xcomp (6; 0% instances)

Children of NOUN nodes belong to 16 different parts of speech: DET (49474; 26% instances), ADP (41777; 22% instances), NOUN (28271; 15% instances), ADJ (19797; 10% instances), PUNCT (12723; 7% instances), PROPN (12023; 6% instances), VERB (11242; 6% instances), CCONJ (4985; 3% instances), NUM (4964; 3% instances), ADV (1787; 1% instances), PRON (1112; 1% instances), X (744; 0% instances), SYM (668; 0% instances), SCONJ (407; 0% instances), AUX (191; 0% instances), PART (13; 0% instances)


Treebank Statistics (UD_Spanish-AnCora)

There are 8639 NOUN lemmas (31%), 11289 NOUN types (27%) and 91130 NOUN tokens (18%). Out of 17 observed tags, the rank of NOUN is: 2 in number of lemmas, 1 in number of types and 1 in number of tokens.

The 10 most frequent NOUN lemmas: año, país, presidente, millón, partido, día, equipo, parte, vez, grupo

The 10 most frequent NOUN types: años, presidente, millones, año, equipo, partido, país, parte, vez, día

The 10 most frequent ambiguous lemmas: país (NOUN 630, PROPN 1), día (NOUN 515, PROPN 3), parte (NOUN 424, PROPN 1, AUX 1), vez (NOUN 374, PROPN 1), mes (NOUN 341, PROPN 1), caso (NOUN 309, PROPN 6), hora (NOUN 287, PROPN 1), punto (NOUN 283, AUX 1), tiempo (NOUN 277, PROPN 2), mundo (NOUN 250, PROPN 2)

The 10 most frequent ambiguous types: partido (NOUN 396, VERB 1), país (NOUN 384, PROPN 1), parte (NOUN 380, VERB 3, AUX 2, PROPN 1), vez (NOUN 304, PROPN 1), día (NOUN 289, PROPN 2), mundo (NOUN 244, PROPN 2), caso (NOUN 238, PROPN 6), vida (NOUN 239, PROPN 5), tiempo (NOUN 231, PROPN 2), días (NOUN 225, PROPN 1)

Morphology

The form / lemma ratio of NOUN is 1.306748 (the average of all parts of speech is 1.500342).

The 1st highest number of forms (4) was observed with the lemma “petrolero”: petrolera, petroleras, petrolero, petroleros.

The 2nd highest number of forms (3) was observed with the lemma “_”: cargo, prueba, sonar.

The 3rd highest number of forms (3) was observed with the lemma “candidato”: CANDIDATAS, candidato, candidatos.

NOUN occurs with 5 features: es-feat/Number (83083; 91% instances), es-feat/Gender (79551; 87% instances), es-feat/AdvType (1691; 2% instances), es-feat/NumForm (602; 1% instances), es-feat/VerbForm (1; 0% instances)

NOUN occurs with 7 feature-value pairs: AdvType=Tim, Gender=Fem, Gender=Masc, NumForm=Digit, Number=Plur, Number=Sing, VerbForm=Part

NOUN occurs with 13 feature combinations. The most frequent feature combination is Gender=Masc|Number=Sing (28494 tokens). Examples: presidente, equipo, partido, país, año, ministro, grupo, mundo, día, tiempo

Relations

NOUN nodes are attached to their parents using 27 different relations: es-dep/nmod (24951; 27% instances), es-dep/obj (19701; 22% instances), es-dep/obl (16413; 18% instances), es-dep/nsubj (13961; 15% instances), es-dep/conj (5106; 6% instances), es-dep/fixed (3135; 3% instances), es-dep/appos (2763; 3% instances), es-dep/root (1364; 1% instances), es-dep/compound (1326; 1% instances), es-dep/iobj (464; 1% instances), es-dep/ccomp (431; 0% instances), es-dep/advcl (310; 0% instances), es-dep/case (300; 0% instances), es-dep/acl (280; 0% instances), es-dep/advmod (144; 0% instances), es-dep/mark (141; 0% instances), es-dep/dep (85; 0% instances), es-dep/xcomp (53; 0% instances), es-dep/cc (45; 0% instances), es-dep/parataxis (42; 0% instances), es-dep/csubj (39; 0% instances), es-dep/nsubj:pass (26; 0% instances), es-dep/det (23; 0% instances), es-dep/orphan (19; 0% instances), es-dep/cop (6; 0% instances), es-dep/csubj:pass (1; 0% instances), es-dep/nummod (1; 0% instances)

Parents of NOUN nodes belong to 17 different parts of speech: VERB (44893; 49% instances), NOUN (30349; 33% instances), ADJ (4967; 5% instances), ADP (2958; 3% instances), PROPN (1787; 2% instances), ROOT (1364; 1% instances), ADV (1310; 1% instances), AUX (1083; 1% instances), NUM (962; 1% instances), PRON (777; 1% instances), DET (260; 0% instances), CCONJ (166; 0% instances), SYM (124; 0% instances), PART (67; 0% instances), SCONJ (46; 0% instances), PUNCT (15; 0% instances), INTJ (2; 0% instances)

5370 (6%) NOUN nodes are leaves.

22174 (24%) NOUN nodes have one child.

28730 (32%) NOUN nodes have two children.

34856 (38%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 16.

Children of NOUN nodes are attached using 26 different relations: es-dep/det (55019; 26% instances), es-dep/case (46366; 22% instances), es-dep/nmod (30724; 15% instances), es-dep/amod (24749; 12% instances), es-dep/punct (14623; 7% instances), es-dep/acl (8076; 4% instances), es-dep/appos (5517; 3% instances), es-dep/conj (5067; 2% instances), es-dep/cc (4709; 2% instances), es-dep/nummod (3976; 2% instances), es-dep/cop (2272; 1% instances), es-dep/advmod (2050; 1% instances), es-dep/mark (1740; 1% instances), es-dep/nsubj (1499; 1% instances), es-dep/fixed (729; 0% instances), es-dep/compound (723; 0% instances), es-dep/aux (556; 0% instances), es-dep/obl (478; 0% instances), es-dep/advcl (296; 0% instances), es-dep/csubj (104; 0% instances), es-dep/obj (101; 0% instances), es-dep/parataxis (74; 0% instances), es-dep/dep (43; 0% instances), es-dep/orphan (21; 0% instances), es-dep/ccomp (5; 0% instances), es-dep/xcomp (4; 0% instances)

Children of NOUN nodes belong to 17 different parts of speech: DET (55090; 26% instances), ADP (46790; 22% instances), NOUN (30349; 14% instances), ADJ (25416; 12% instances), PUNCT (14638; 7% instances), PROPN (11780; 6% instances), VERB (8118; 4% instances), NUM (4655; 2% instances), CCONJ (4602; 2% instances), AUX (2939; 1% instances), ADV (2016; 1% instances), SCONJ (1762; 1% instances), PRON (1203; 1% instances), SYM (131; 0% instances), PART (20; 0% instances), INTJ (11; 0% instances), X (1; 0% instances)


NOUN in other languages: [am] [ar] [bg] [bxr] [ca] [ckb] [cop] [cs] [cu] [da] [de] [el] [en] [es] [et] [eu] [fa] [fi] [fo] [fr] [ga] [gl] [got] [grc] [he] [hi] [hr] [hu] [id] [it] [ja] [kk] [kmr] [ko] [la] [lv] [mr] [nl] [no] [pl] [pt] [ro] [ru] [sa] [sk] [sla] [sl] [so] [sr] [sv] [swl] [ta] [tr] [ug] [uk] [u] [urj] [ur] [vi] [yue] [zh]