Gender
: gender
Masc
: masculine gender
Nouns denoting male persons are masculine. Other nouns may be also grammatically masculine, without any relation to sex.
Examples
- castello “castle”
Fem
: feminine gender
Nouns denoting female persons are feminine. Other nouns may be also grammatically feminine, without any relation to sex.
Examples
- nave “ship”
Neut
: neuter gender
Not used.
Com
: common gender
Not used.
Treebank Statistics (UD_Italian)
This feature is universal.
It occurs with 2 different values: Fem
, Masc
.
117224 tokens (41%) have a non-empty value of Gender
.
14552 types (54%) occur at least once with a non-empty value of Gender
.
9767 lemmas (54%) occur at least once with a non-empty value of Gender
.
The feature is used with 10 part-of-speech tags: it-pos/NOUN (54468; 19% instances), it-pos/DET (39424; 14% instances), it-pos/ADJ (11593; 4% instances), it-pos/VERB (8098; 3% instances), it-pos/PRON (2921; 1% instances), it-pos/AUX (715; 0% instances), it-pos/ADP (2; 0% instances), it-pos/ADV (1; 0% instances), it-pos/PROPN (1; 0% instances), it-pos/X (1; 0% instances).
NOUN
54468 it-pos/NOUN tokens (97% of all NOUN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NOUN
and Gender
co-occurred: Number=Sing (35519; 65%).
NOUN
tokens may have the following values of Gender
:
Fem
(24397; 45% of non-emptyGender
): città, parte, società, legge, persone, proprietà, attività, vita, commissione, servitùMasc
(30071; 55% of non-emptyGender
): anni, presidente, fondo, diritto, anno, proprietario, film, stato, mondo, casoEMPTY
(1804): presidente, onorevole, abitanti, giovani, grazie, leader, rappresentanti, fronte, enfiteuta, partecipanti
Paradigm proprietario | Masc | Fem |
---|---|---|
Number=Sing | proprietario | proprietaria |
Number=Plur | proprietari |
Gender
seems to be lexical feature of NOUN
. 98% lemmas (6519) occur only with one value of Gender
.
DET
39424 it-pos/DET tokens (86% of all DET
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which DET
and Gender
co-occurred: PronType=Art (35664; 90%), Definite=Def (31258; 79%), Number=Sing (27610; 70%).
DET
tokens may have the following values of Gender
:
Fem
(16492; 42% of non-emptyGender
): la, le, una, sua, un’, questa, sue, queste, tutte, molteMasc
(22932; 58% of non-emptyGender
): il, i, un, gli, lo, suo, questo, tutti, suoi, alcuniEMPTY
(6534): l’, quale, ogni, loro, che, l’, qualche, tale, qualsiasi, tali
Paradigm il | Masc | Fem |
---|---|---|
Definite=Def | l’ | |
Definite=Def|Number=Sing|PronType=Art | il, lo, l’, l', lu, i1 | la, l', l’, le, il |
Definite=Def|Number=Plur|PronType=Art | i, gli, il | le, l’ |
Number=Sing | il | la |
ADJ
11593 it-pos/ADJ tokens (63% of all ADJ
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADJ
and Gender
co-occurred: Number=Sing (7747; 67%).
ADJ
tokens may have the following values of Gender
:
Fem
(5187; 45% of non-emptyGender
): prima, italiana, altra, stessa, altre, nuova, nuove, economica, seconda, altaMasc
(6406; 55% of non-emptyGender
): primo, nuovo, altri, altro, stesso, vero, europeo, secondo, terzo, pubblicoEMPTY
(6664): grande, presente, comune, ex, internazionale, maggiore, nazionale, mondiale, possibile, sociale
Paradigm primo | Masc | Fem |
---|---|---|
Number=Sing | primo | prima |
Number=Sing|NumType=Ord | primo, 1º | prima |
Number=Plur | prime | |
Number=Plur|NumType=Ord | primi | prime |
VERB
8098 it-pos/VERB tokens (33% of all VERB
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which VERB
and Gender
co-occurred: Tense=Past (8098; 100%), Person=EMPTY (8098; 100%), Mood=EMPTY (8098; 100%), VerbForm=Part (8097; 100%), Number=Sing (5916; 73%).
VERB
tokens may have the following values of Gender
:
Fem
(2415; 30% of non-emptyGender
): fatta, stabilite, dovuta, fatte, vista, considerata, costituita, fondata, nata, chiamataMasc
(5683; 70% of non-emptyGender
): fatto, visto, vinto, avuto, tenuto, detto, nato, ricevuto, dato, messoEMPTY
(16363): ha, è, hanno, fare, far, trova, sono, fa, chiama, vedere
Paradigm avere | Masc | Fem |
---|---|---|
avuto | avuta |
PRON
2921 it-pos/PRON tokens (27% of all PRON
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PRON
and Gender
co-occurred: Number=Sing (2116; 72%), Clitic=EMPTY (2106; 72%), Person=EMPTY (1791; 61%).
PRON
tokens may have the following values of Gender
:
Fem
(730; 25% of non-emptyGender
): la, le, quella, quelle, una, questa, essa, esse, altra, leiMasc
(2191; 75% of non-emptyGender
): lo, quello, uno, questo, li, gli, lui, tutto, ciò, tuttiEMPTY
(7852): si, che, chi, ci, cui, ne, qual, mi, c’, quale
Paradigm lo | Masc | Fem |
---|---|---|
Number=Sing|Person=3 | lo, gli | la |
Number=Sing | lo | |
Number=Plur|Person=3 | li | le |
Number=Plur | le | |
Person=3 | le |
AUX
715 it-pos/AUX tokens (6% of all AUX
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which AUX
and Gender
co-occurred: VerbForm=Part (715; 100%), Tense=Past (715; 100%), Person=EMPTY (715; 100%), Mood=EMPTY (715; 100%), Number=Sing (544; 76%).
AUX
tokens may have the following values of Gender
:
Fem
(219; 31% of non-emptyGender
): stata, state, potuta, andata, fattaMasc
(496; 69% of non-emptyGender
): stato, stati, potuto, dovuto, voluto, andato, fattoEMPTY
(10343): è, sono, ha, può, essere, hanno, era, deve, possono, sia
Paradigm essere | Masc | Fem |
---|---|---|
Number=Sing | stato | stata |
Number=Plur | stati | state |
ADP
2 it-pos/ADP tokens (0% of all ADP
tokens) have a non-empty value of Gender
.
ADP
tokens may have the following values of Gender
:
Masc
(2; 100% of non-emptyGender
): del, duEMPTY
(42940): di, a, in, da, per, con, su, come, ad, tra
ADV
1 it-pos/ADV tokens (0% of all ADV
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADV
and Gender
co-occurred: PronType=EMPTY (1; 100%).
ADV
tokens may have the following values of Gender
:
Masc
(1; 100% of non-emptyGender
): pochissimoEMPTY
(10815): non, più, anche, dove, come, quando, solo, prima, sempre, poi
PROPN
1 it-pos/PROPN tokens (0% of all PROPN
tokens) have a non-empty value of Gender
.
PROPN
tokens may have the following values of Gender
:
Fem
(1; 100% of non-emptyGender
): hyeEMPTY
(13891): Italia, Shakespeare, Balzac, Europa, San, Roma, Stati, Uniti, Albania, Marco
X
1 it-pos/X tokens (0% of all X
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which X
and Gender
co-occurred: Foreign=Yes (1; 100%).
X
tokens may have the following values of Gender
:
Masc
(1; 100% of non-emptyGender
): mixerEMPTY
(237): a, b, Illusions, perdues, De, ad, f, home, la, Come
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender
:
NOUN –[det]–> DET (32309; 84%),
NOUN –[amod]–> ADJ (9111; 63%),
NOUN –[conj]–> NOUN (2333; 55%),
NOUN –[acl]–> VERB (1931; 64%),
VERB –[nsubj:pass]–> NOUN (1403; 81%),
NOUN –[det:poss]–> DET (1396; 79%),
VERB –[conj]–> VERB (455; 53%),
ADJ –[conj]–> ADJ (362; 53%),
ADJ –[nsubj]–> NOUN (354; 57%),
NOUN –[det:predet]–> DET (344; 96%).
Treebank Statistics (UD_Italian-ParTUT)
This feature is universal.
It occurs with 2 different values: Fem
, Masc
.
18506 tokens (43%) have a non-empty value of Gender
.
4235 types (57%) occur at least once with a non-empty value of Gender
.
3110 lemmas (60%) occur at least once with a non-empty value of Gender
.
The feature is used with 8 part-of-speech tags: it-pos/NOUN (8458; 20% instances), it-pos/DET (6323; 15% instances), it-pos/ADJ (1990; 5% instances), it-pos/VERB (1101; 3% instances), it-pos/PRON (519; 1% instances), it-pos/AUX (113; 0% instances), it-pos/ADP (1; 0% instances), it-pos/PROPN (1; 0% instances).
NOUN
8458 it-pos/NOUN tokens (97% of all NOUN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NOUN
and Gender
co-occurred: Number=Sing (5422; 64%).
NOUN
tokens may have the following values of Gender
:
Fem
(4044; 48% of non-emptyGender
): società, opere, parte, commissione, opera, vita, crescita, attività, sicurezza, direttivaMasc
(4414; 52% of non-emptyGender
): anni, lavoro, paesi, modo, parlamento, diritto, sviluppo, euro, periodo, programmaEMPTY
(251): presidente, onorevole, account, commissario, rappresentanti, fine, grazie, partecipanti, collega, consulenti
Paradigm signore | Masc | Fem |
---|---|---|
signor | signora |
Gender
seems to be lexical feature of NOUN
. 99% lemmas (2017) occur only with one value of Gender
.
DET
6323 it-pos/DET tokens (86% of all DET
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which DET
and Gender
co-occurred: PronType=Art (5511; 87%), Definite=Def (4747; 75%), Number=Sing (4329; 68%).
DET
tokens may have the following values of Gender
:
Fem
(2761; 44% of non-emptyGender
): la, le, una, sua, un’, sue, questa, queste, nostra, alcunaMasc
(3562; 56% of non-emptyGender
): il, i, un, gli, suo, lo, questo, suoi, tutti, alcuniEMPTY
(992): l’, ogni, loro, tale, tali, qualsiasi, più, tal, cui, qualche
Paradigm il | Masc | Fem |
---|---|---|
Number=Sing | il, lo, l' | la |
Number=Plur | i, gli | le |
ADJ
1990 it-pos/ADJ tokens (61% of all ADJ
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADJ
and Gender
co-occurred: Number=Sing (1236; 62%).
ADJ
tokens may have the following values of Gender
:
Fem
(884; 44% of non-emptyGender
): economica, altre, prima, pericolose, relative, nuova, stessa, umana, direttrici, europeaMasc
(1106; 56% of non-emptyGender
): altri, primo, europeo, nuovo, stesso, finanziario, altro, nuovi, ultimi, necessarioEMPTY
(1270): presente, sociale, grande, importante, maggiore, internazionale, importanti, principali, strutturali, intellettuale
Paradigm altro | Masc | Fem |
---|---|---|
Number=Sing | altro | altra |
Number=Plur | altri | altre |
VERB
1101 it-pos/VERB tokens (30% of all VERB
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which VERB
and Gender
co-occurred: Person=EMPTY (1101; 100%), Tense=Past (1101; 100%), Mood=EMPTY (1101; 100%), VerbForm=Part (1101; 100%), Number=Sing (726; 66%).
VERB
tokens may have the following values of Gender
:
Fem
(359; 33% of non-emptyGender
): presentata, data, messe, presentate, pubblicate, volta, limitata, previste, scritte, sostenutaMasc
(742; 67% of non-emptyGender
): considerato, fatto, dato, avuto, visto, portato, svolto, previsto, riconosciuto, scrittoEMPTY
(2515): ha, è, scrisse, hanno, far, fare, garantire, rappresenta, creare, dare
Paradigm avere | Masc | Fem |
---|---|---|
avuto | avuta |
PRON
519 it-pos/PRON tokens (38% of all PRON
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PRON
and Gender
co-occurred: Clitic=EMPTY (413; 80%), Number=Sing (383; 74%), Person=EMPTY (338; 65%).
PRON
tokens may have the following values of Gender
:
Fem
(142; 27% of non-emptyGender
): quella, la, le, lei, questa, una, molte, essa, esse, quelleMasc
(377; 73% of non-emptyGender
): lo, ciò, quanto, quello, uno, altri, questo, lui, alcuni, gliEMPTY
(849): che, si, cui, ci, ne, mi, vi, noi, c’, quale
Paradigm quello | Masc | Fem |
---|---|---|
Number=Sing | quello, quel | quella |
Number=Plur | quelli | quelle |
AUX
113 it-pos/AUX tokens (7% of all AUX
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which AUX
and Gender
co-occurred: Tense=Past (113; 100%), Person=EMPTY (113; 100%), VerbForm=Part (113; 100%), Mood=EMPTY (113; 100%), Number=Sing (78; 69%).
AUX
tokens may have the following values of Gender
:
Fem
(32; 28% of non-emptyGender
): stata, state, andata, potutaMasc
(81; 72% of non-emptyGender
): stato, stati, potuto, dovuto, andatoEMPTY
(1420): è, sono, ha, essere, era, hanno, può, fu, sia, possono
Paradigm essere | Masc | Fem |
---|---|---|
Number=Sing | stato | stata |
Number=Plur | stati | state |
PROPN
1 it-pos/PROPN tokens (0% of all PROPN
tokens) have a non-empty value of Gender
.
PROPN
tokens may have the following values of Gender
:
Fem
(1; 100% of non-emptyGender
): hyeEMPTY
(1724): Shakespeare, Balzac, Ucraina, Europa, Facebook, Pericle, Stati, Uniti, Cina, John
ADP
1 it-pos/ADP tokens (0% of all ADP
tokens) have a non-empty value of Gender
.
ADP
tokens may have the following values of Gender
:
Masc
(1; 100% of non-emptyGender
): duEMPTY
(6857): di, a, in, per, da, su, con, come, ad, tra
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender
:
NOUN –[det]–> DET (5160; 85%),
NOUN –[amod]–> ADJ (1604; 60%),
NOUN –[det:poss]–> DET (385; 86%),
NOUN –[conj]–> NOUN (368; 56%),
NOUN –[acl]–> VERB (319; 57%),
VERB –[nsubj:pass]–> NOUN (227; 95%),
PRON –[nmod]–> NOUN (68; 73%),
NOUN –[det:predet]–> DET (65; 100%),
ADJ –[conj]–> ADJ (62; 53%),
NOUN –[nsubj]–> NOUN (50; 56%).
Gender in other languages: [am] [ar] [bg] [bxr] [ca] [ckb] [cop] [cs] [cu] [da] [de] [el] [en] [es] [et] [eu] [fa] [fo] [fr] [ga] [gl] [got] [grc] [he] [hi] [hr] [hu] [id] [it] [ja] [kk] [kmr] [ko] [la] [lv] [mr] [nl] [no] [pl] [pt] [ro] [ru] [sa] [sk] [sla] [sl] [so] [sr] [sv] [swl] [ta] [tr] [u] [ug] [uk] [ur] [vi] [yue] [zh]