Gender
: gender
The possible values for Gender
in French is masculine or feminine. It occurs with nouns, adjectives, past participles, determiners and pronouns. Words from other languages can have a neutral gender.
Treebank Statistics (UD_French)
This feature is universal.
It occurs with 3 different values: Fem
, Masc
, Neut
.
171614 tokens (44%) have a non-empty value of Gender
.
20761 types (49%) occur at least once with a non-empty value of Gender
.
13751 lemmas (42%) occur at least once with a non-empty value of Gender
.
The feature is used with 8 part-of-speech tags: fr-pos/NOUN (72710; 19% instances), fr-pos/DET (57159; 15% instances), fr-pos/ADJ (22137; 6% instances), fr-pos/VERB (10913; 3% instances), fr-pos/PRON (7772; 2% instances), fr-pos/AUX (912; 0% instances), fr-pos/NUM (8; 0% instances), fr-pos/PROPN (3; 0% instances).
NOUN
72710 fr-pos/NOUN tokens (99% of all NOUN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NOUN
and Gender
co-occurred: Number=Sing (54660; 75%).
NOUN
tokens may have the following values of Gender
:
Fem
(32298; 44% of non-emptyGender
): ville, partie, région, commune, fois, années, famille, année, fin, guerreMasc
(40411; 56% of non-emptyGender
): ans, pays, monde, nom, temps, groupe, siècle, état, cours, lieuNeut
(1; 0% of non-emptyGender
): MuseumEMPTY
(495): A, Co., league, world, Association, Company, Mt, Panther, Trail, blackface
Paradigm enfant | Masc | Fem |
---|---|---|
Number=Sing | enfant | enfant |
Number=Plur | enfants |
Gender
seems to be lexical feature of NOUN
. 98% lemmas (9120) occur only with one value of Gender
.
DET
57159 fr-pos/DET tokens (95% of all DET
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which DET
and Gender
co-occurred: PronType=Art (49211; 86%), Number=Sing (44559; 78%), Definite=Def (39410; 69%).
DET
tokens may have the following values of Gender
:
Fem
(25470; 45% of non-emptyGender
): la, une, les, l’, sa, cette, des, ses, son, leurMasc
(31689; 55% of non-emptyGender
): le, les, un, l’, son, des, ce, ses, de, cesEMPTY
(3134): les, l’, the, des, son, d’, de, ses, a, chaque
Paradigm le | Masc | Fem |
---|---|---|
Number=Sing | la | |
Number=Sing|PronType=Art | le, l', l | la, l', l, là, Les |
Number=Plur|PronType=Art | les | les, L |
ADJ
22137 fr-pos/ADJ tokens (99% of all ADJ
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADJ
and Gender
co-occurred: Number=Sing (16053; 73%).
ADJ
tokens may have the following values of Gender
:
Fem
(10463; 47% of non-emptyGender
): première, française, grande, même, nouvelle, nombreuses, nationale, autres, seule, internationaleMasc
(11673; 53% of non-emptyGender
): premier, français, autres, grand, nouveau, même, dernier, nombreux, seul, ancienNeut
(1; 0% of non-emptyGender
): KoninklijkEMPTY
(177): National, live, American, Blue, complete, Last, new, Black, Dead, Global
Paradigm premier | Masc | Fem |
---|---|---|
Number=Sing | premier, 1er, Ier, 1e, 1 | première, 1ère, 1re |
Number=Plur | premiers | premières |
VERB
10913 fr-pos/VERB tokens (36% of all VERB
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which VERB
and Gender
co-occurred: VerbForm=Part (10913; 100%), Person=EMPTY (10913; 100%), Tense=Past (10913; 100%), Mood=EMPTY (10913; 100%), Number=Sing (8770; 80%).
VERB
tokens may have the following values of Gender
:
Fem
(3192; 29% of non-emptyGender
): située, née, créée, appelée, utilisée, connue, construite, mise, publiée, nomméeMasc
(7721; 71% of non-emptyGender
): né, fait, situé, eu, mort, connu, nommé, réalisé, mis, utiliséEMPTY
(19603): a, fait, faire, est, partir, trouve, devient, ont, permet, voir
Paradigm faire | Masc | Fem |
---|---|---|
Number=Sing | fait, fais | faite |
Number=Plur | faits | faites |
PRON
7772 fr-pos/PRON tokens (44% of all PRON
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PRON
and Gender
co-occurred: Number=Sing (6723; 87%), Person=3 (6474; 83%), PronType=Prs (5674; 73%).
PRON
tokens may have the following values of Gender
:
Fem
(1708; 22% of non-emptyGender
): elle, elles, une, la, celle, laquelle, celles, -elle, celle-ci, lesquellesMasc
(6064; 78% of non-emptyGender
): il, on, ils, le, un, -il, lequel, celui, tout, ceuxEMPTY
(9714): qui, se, s’, c’, lui, ce, dont, où, nous, je
Paradigm il | Masc | Fem |
---|---|---|
Number=Sing|Person=2 | -Tu | |
Number=Sing|Person=3 | il, -il, Lui, t-il | -elle, elle |
Number=Sing | Lui | |
Number=Plur|Person=3 | ils, -ils | elles, -elles |
AUX
912 fr-pos/AUX tokens (7% of all AUX
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which AUX
and Gender
co-occurred: Mood=EMPTY (912; 100%), Person=EMPTY (912; 100%), Tense=Past (912; 100%), VerbForm=Part (912; 100%), Number=Sing (911; 100%).
AUX
tokens may have the following values of Gender
:
Fem
(1; 0% of non-emptyGender
): alléeMasc
(911; 100% of non-emptyGender
): été, pu, dû, voulu, su, fallu, censé, fait, pû, restéEMPTY
(12586): est, a, sont, ont, était, fut, être, peut, avait, avoir
Gender
seems to be lexical feature of AUX
. 100% lemmas (12) occur only with one value of Gender
.
NUM
8 fr-pos/NUM tokens (0% of all NUM
tokens) have a non-empty value of Gender
.
NUM
tokens may have the following values of Gender
:
Fem
(8; 100% of non-emptyGender
): 00H30, 12H30, 14h25, 15H00, 20h40, 22h, 23h, 48HEMPTY
(10425): deux, trois, 2, 3, 5, quatre, 4, 2010, 2009, 2008
PROPN
3 fr-pos/PROPN tokens (0% of all PROPN
tokens) have a non-empty value of Gender
.
PROPN
tokens may have the following values of Gender
:
Fem
(1; 33% of non-emptyGender
): ItalieMasc
(2; 67% of non-emptyGender
): Palais, mémoriqueEMPTY
(29912): France, Paris, Europe, États-Unis, Jean, Maroc, Espagne, la, New, York
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender
:
NOUN –[det]–> DET (48477; 99%),
NOUN –[amod]–> ADJ (18023; 99%),
NOUN –[nmod:poss]–> DET (4175; 99%),
NOUN –[conj]–> NOUN (3312; 63%),
NOUN –[acl]–> VERB (2894; 70%),
VERB –[nsubj:pass]–> NOUN (1597; 96%),
ADJ –[conj]–> ADJ (877; 97%),
NOUN –[appos]–> NOUN (872; 58%),
ADJ –[nsubj]–> NOUN (866; 97%),
NOUN –[nsubj]–> NOUN (575; 62%).
Treebank Statistics (UD_French-ParTUT)
This feature is universal.
It occurs with 2 different values: Fem
, Masc
.
7036 tokens (39%) have a non-empty value of Gender
.
2017 types (62%) occur at least once with a non-empty value of Gender
.
1596 lemmas (65%) occur at least once with a non-empty value of Gender
.
The feature is used with 6 part-of-speech tags: fr-pos/NOUN (3730; 21% instances), fr-pos/DET (1740; 10% instances), fr-pos/ADJ (801; 4% instances), fr-pos/VERB (458; 3% instances), fr-pos/PRON (270; 2% instances), fr-pos/AUX (37; 0% instances).
NOUN
3730 fr-pos/NOUN tokens (99% of all NOUN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NOUN
and Gender
co-occurred: Number=Sing (2580; 69%).
NOUN
tokens may have the following values of Gender
:
Fem
(1976; 53% of non-emptyGender
): commission, sécurité, oeuvre, directive, mesures, protection, madame, mme, question, matièreMasc
(1754; 47% of non-emptyGender
): parlement, pays, droit, programme, membres, états, cas, conseil, rapport, monsieurEMPTY
(34): commissaire, coopération, gens, intermédiaire, responsables, adultes, collègue, fantômes, intermédiaires, jeunes
Paradigm président | Masc | Fem |
---|---|---|
Number=Sing | président | présidente |
Number=Plur | présidents |
Gender
seems to be lexical feature of NOUN
. 97% lemmas (1031) occur only with one value of Gender
.
DET
1740 fr-pos/DET tokens (58% of all DET
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which DET
and Gender
co-occurred: Number=Sing (1528; 88%), PronType=Art (1216; 70%), Definite=Def (932; 54%).
DET
tokens may have the following values of Gender
:
Fem
(915; 53% of non-emptyGender
): la, une, cette, des, toute, sa, leur, aucune, notre, toutesMasc
(825; 47% of non-emptyGender
): le, un, ce, des, son, tous, cet, mon, tout, votreEMPTY
(1244): les, l’, le, ces, des, quelques, chaque, d’, plusieurs, ce
Paradigm le | Masc | Fem |
---|---|---|
Definite=Def|Number=Sing | le | la |
Definite=Def|Number=Plur | les | |
Number=Sing | le | la |
ADJ
801 fr-pos/ADJ tokens (70% of all ADJ
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADJ
and Gender
co-occurred: Number=Sing (510; 64%).
ADJ
tokens may have the following values of Gender
:
Fem
(409; 51% of non-emptyGender
): présente, dangereuses, grande, telle, sociale, première, collective, dernière, directrices, dérivéeMasc
(392; 49% of non-emptyGender
): européen, présent, structurels, faux, public, premier, important, international, nouveau, socialEMPTY
(348): possible, technique, autres, communautaire, mêmes, nécessaires, responsable, applicables, communautaires, même
Paradigm présent | Masc | Fem |
---|---|---|
présent | présente |
VERB
458 fr-pos/VERB tokens (29% of all VERB
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which VERB
and Gender
co-occurred: Person=EMPTY (458; 100%), VerbForm=Part (456; 100%), Mood=EMPTY (456; 100%), Tense=Past (455; 99%), Number=Sing (289; 63%).
VERB
tokens may have the following values of Gender
:
Fem
(160; 35% of non-emptyGender
): dite, concernées, imposées, incorporée, limitée, prévues, rendues, transportées, accompagnée, accordéeMasc
(298; 65% of non-emptyGender
): fait, compris, tenu, soumis, donné, mis, nommés, établi, dit, utilisésEMPTY
(1100): a, faire, concernant, est, convient, fait, ont, dire, font, pense
Paradigm dire | Masc | Fem |
---|---|---|
Number=Sing | dit | dite |
Number=Plur | dites |
PRON
270 fr-pos/PRON tokens (27% of all PRON
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PRON
and Gender
co-occurred: Person=3 (219; 81%), Number=Sing (203; 75%), PronType=Prs (197; 73%).
PRON
tokens may have the following values of Gender
:
Fem
(55; 20% of non-emptyGender
): elles, elle, celle, laquelle, une, celle-ci, auxquelles, elle-même, la, aucuneMasc
(215; 80% of non-emptyGender
): il, on, ils, le, ceux, chacun, tous, Nul, -il, celuiEMPTY
(713): nous, qui, je, vous, ce, se, c’, s’, que, l’
Paradigm le | Masc | Fem |
---|---|---|
le | la |
AUX
37 fr-pos/AUX tokens (6% of all AUX
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which AUX
and Gender
co-occurred: Number=Sing (37; 100%), Tense=Past (37; 100%), Mood=EMPTY (37; 100%), Person=EMPTY (37; 100%), VerbForm=Part (37; 100%).
AUX
tokens may have the following values of Gender
:
Masc
(37; 100% of non-emptyGender
): été, puEMPTY
(623): est, a, sont, ont, être, peut, voudrais, devrait, doit, soient
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender
:
NOUN –[det]–> DET (1421; 54%),
NOUN –[amod]–> ADJ (611; 70%),
NOUN –[conj]–> NOUN (185; 55%),
NOUN –[nmod:poss]–> DET (165; 93%),
NOUN –[acl]–> VERB (137; 50%),
VERB –[nsubj:pass]–> NOUN (75; 90%),
NOUN –[compound]–> NOUN (38; 93%),
ADJ –[conj]–> ADJ (35; 66%),
ADJ –[nsubj]–> NOUN (23; 57%),
NOUN –[nsubj]–> NOUN (14; 54%).
Treebank Statistics (UD_French-Sequoia)
This feature is universal.
It occurs with 2 different values: Fem
, Masc
.
23584 tokens (39%) have a non-empty value of Gender
.
5559 types (64%) occur at least once with a non-empty value of Gender
.
4026 lemmas (64%) occur at least once with a non-empty value of Gender
.
The feature is used with 9 part-of-speech tags: fr-pos/NOUN (12128; 20% instances), fr-pos/DET (5091; 8% instances), fr-pos/ADJ (2366; 4% instances), fr-pos/VERB (1837; 3% instances), fr-pos/PROPN (1369; 2% instances), fr-pos/PRON (720; 1% instances), fr-pos/AUX (65; 0% instances), fr-pos/ADP (7; 0% instances), fr-pos/NUM (1; 0% instances).
NOUN
12128 fr-pos/NOUN tokens (94% of all NOUN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NOUN
and Gender
co-occurred: Number=Sing (8530; 70%).
NOUN
tokens may have the following values of Gender
:
Fem
(5696; 47% of non-emptyGender
): affaire, bivalirudine, perfusion, solution, administration, dose, étude, fois, maladie, guerreMasc
(6432; 53% of non-emptyGender
): %, patients, mg, ans, cas, traitement, président, effets, cours, M.EMPTY
(767): h, kg, enfants, HLM, hui, D, collègues, ICP, intermédiaires, ACT
Paradigm patient | Masc | Fem |
---|---|---|
Number=Sing | patient | patiente |
Number=Plur | patients | patientes |
Gender
seems to be lexical feature of NOUN
. 99% lemmas (2527) occur only with one value of Gender
.
DET
5091 fr-pos/DET tokens (57% of all DET
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which DET
and Gender
co-occurred: Number=Sing (5055; 99%), PronType=Art (4649; 91%), Definite=Def (3531; 69%).
DET
tokens may have the following values of Gender
:
Fem
(2378; 47% of non-emptyGender
): la, une, cette, sa, aucune, toute, certaines, ma, quelles, toutesMasc
(2713; 53% of non-emptyGender
): le, un, ce, les, cet, aucun, tout, du, quel, certainsEMPTY
(3874): les, l’, des, son, ces, ses, votre, de, d’, leur
Paradigm le | Masc | Fem |
---|---|---|
Definite=Def|PronType=Art | le, les, l' | la, l' |
la |
ADJ
2366 fr-pos/ADJ tokens (63% of all ADJ
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADJ
and Gender
co-occurred: Number=Sing (1540; 65%).
ADJ
tokens may have the following values of Gender
:
Fem
(1101; 47% of non-emptyGender
): première, européenne, rénale, française, toutes, nouvelle, nationale, seule, intraveineuse, osseuseMasc
(1265; 53% of non-emptyGender
): français, ancien, osseux, tous, premier, faux, compris, nombreux, dernier, généralEMPTY
(1409): autres, indésirables, autre, zolédronique, même, politique, clinique, politiques, deuxième, cliniques
Paradigm tout | Masc | Fem |
---|---|---|
Number=Sing | tout | toute |
Number=Plur | tous | toutes |
VERB
1837 fr-pos/VERB tokens (39% of all VERB
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which VERB
and Gender
co-occurred: Mood=EMPTY (1837; 100%), Tense=Past (1837; 100%), Person=EMPTY (1837; 100%), VerbForm=Part (1837; 100%), Number=Sing (1281; 70%), Voice=EMPTY (1205; 66%).
VERB
tokens may have the following values of Gender
:
Fem
(564; 31% of non-emptyGender
): observée, recommandée, administrée, destinée, rapportées, versées, menée, traitées, maintenue, associéeMasc
(1273; 69% of non-emptyGender
): mis, traités, utilisé, eu, atteints, administré, fait, reçu, pris, présentéEMPTY
(2909): voir, doit, a, faire, faut, est, concernant, prendre, agit, utiliser
Paradigm avoir | Masc | Fem |
---|---|---|
eu | eue |
PROPN
1369 fr-pos/PROPN tokens (45% of all PROPN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PROPN
and Gender
co-occurred: Number=Sing (1338; 98%).
PROPN
tokens may have the following values of Gender
:
Fem
(398; 29% of non-emptyGender
): commission, France, Paget, Europe, Denise, Christine, Chine, Jean, Afrique, BlancheMasc
(971; 71% of non-emptyGender
): paris, Jacques, Taïwan, Chirac, Michel, conseil, Parlement, Hauts-de-Seine, Alain, DidierEMPTY
(1656): Aclasta, Angiox, Union, RPR, Halphen, Jean-Claude, Thomson, Méry, Éric, Francis
Paradigm Jean | Masc | Fem |
---|---|---|
Jean | Jean |
Gender
seems to be lexical feature of PROPN
. 100% lemmas (436) occur only with one value of Gender
.
PRON
720 fr-pos/PRON tokens (29% of all PRON
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PRON
and Gender
co-occurred: Reflex=EMPTY (720; 100%), Person=3 (580; 81%), Number=Sing (571; 79%), PronType=EMPTY (541; 75%).
PRON
tokens may have the following values of Gender
:
Fem
(151; 21% of non-emptyGender
): elle, laquelle, la, elles, lesquelles, celle-ci, celle, celles, une, chacuneMasc
(569; 79% of non-emptyGender
): il, ils, un, le, -il, eux, lui, ceux, lequel, tousEMPTY
(1756): qui, nous, se, je, s’, vous, ce, que, c’, y
Paradigm il | Masc | Fem |
---|---|---|
Number=Sing | il, -il, On, -on | elle, -elle |
Number=Plur | ils, -ils | elles |
AUX
65 fr-pos/AUX tokens (3% of all AUX
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which AUX
and Gender
co-occurred: Person=EMPTY (65; 100%), VerbForm=Part (65; 100%), Tense=Past (65; 100%), Mood=EMPTY (65; 100%), Number=Sing (50; 77%).
AUX
tokens may have the following values of Gender
:
Fem
(22; 34% of non-emptyGender
): appelée, avérées, considérée, devenue, appelées, avérée, classée, codée, considérées, devenuesMasc
(43; 66% of non-emptyGender
): dû, appelé, considérés, intitulé, voulu, considéré, resté, souhaité, Interrogé, apparuEMPTY
(2271): est, a, été, ont, être, sont, était, avait, avoir, peut
Paradigm appeler | Masc | Fem |
---|---|---|
Number=Sing | appelé | appelée |
Number=Sing|Voice=Pass | appelé | |
Number=Plur | appelés | appelées |
ADP
7 fr-pos/ADP tokens (0% of all ADP
tokens) have a non-empty value of Gender
.
ADP
tokens may have the following values of Gender
:
Fem
(2; 29% of non-emptyGender
): àMasc
(5; 71% of non-emptyGender
): à, deEMPTY
(9764): de, à, d’, en, pour, dans, par, sur, avec, chez
Paradigm à | Masc | Fem |
---|---|---|
à | à |
NUM
1 fr-pos/NUM tokens (0% of all NUM
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NUM
and Gender
co-occurred: NumType=Card (1; 100%).
NUM
tokens may have the following values of Gender
:
Masc
(1; 100% of non-emptyGender
): 17EMPTY
(1401): deux, trois, 5, 2, 2006, 10, 3, 30, 1, 4
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender
:
NOUN –[det]–> DET (4336; 59%),
NOUN –[amod]–> ADJ (1839; 61%),
NOUN –[acl]–> VERB (528; 61%),
NOUN –[conj]–> NOUN (501; 55%),
PROPN –[det]–> DET (322; 66%),
VERB –[nsubj:pass]–> NOUN (294; 91%),
NOUN –[appos]–> NOUN (105; 55%),
VERB –[conj]–> VERB (90; 53%),
ADJ –[nsubj]–> NOUN (80; 60%),
VERB –[nsubj:pass]–> PRON (64; 60%).
Gender in other languages: [am] [ar] [bg] [bxr] [ca] [ckb] [cop] [cs] [cu] [da] [de] [el] [en] [es] [et] [eu] [fa] [fo] [fr] [ga] [gl] [got] [grc] [he] [hi] [hr] [hu] [id] [it] [ja] [kk] [kmr] [ko] [la] [lv] [mr] [nl] [no] [pl] [pt] [ro] [ru] [sa] [sk] [sla] [sl] [so] [sr] [sv] [swl] [ta] [tr] [u] [ug] [uk] [ur] [vi] [yue] [zh]