home fr/feat edit page issue tracker

This page still pertains to UD version 1.

Gender: gender

The possible values for Gender in French is masculine or feminine. It occurs with nouns, adjectives, past participles, determiners and pronouns. Words from other languages can have a neutral gender.


Treebank Statistics (UD_French)

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut.

171614 tokens (44%) have a non-empty value of Gender. 20761 types (49%) occur at least once with a non-empty value of Gender. 13751 lemmas (42%) occur at least once with a non-empty value of Gender. The feature is used with 8 part-of-speech tags: fr-pos/NOUN (72710; 19% instances), fr-pos/DET (57159; 15% instances), fr-pos/ADJ (22137; 6% instances), fr-pos/VERB (10913; 3% instances), fr-pos/PRON (7772; 2% instances), fr-pos/AUX (912; 0% instances), fr-pos/NUM (8; 0% instances), fr-pos/PROPN (3; 0% instances).

NOUN

72710 fr-pos/NOUN tokens (99% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (54660; 75%).

NOUN tokens may have the following values of Gender:

Paradigm enfantMascFem
Number=Singenfantenfant
Number=Plurenfants

Gender seems to be lexical feature of NOUN. 98% lemmas (9120) occur only with one value of Gender.

DET

57159 fr-pos/DET tokens (95% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: PronType=Art (49211; 86%), Number=Sing (44559; 78%), Definite=Def (39410; 69%).

DET tokens may have the following values of Gender:

Paradigm leMascFem
Number=Singla
Number=Sing|PronType=Artle, l', lla, l', l, là, Les
Number=Plur|PronType=Artlesles, L

ADJ

22137 fr-pos/ADJ tokens (99% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (16053; 73%).

ADJ tokens may have the following values of Gender:

Paradigm premierMascFem
Number=Singpremier, 1er, Ier, 1e, 1première, 1ère, 1re
Number=Plurpremierspremières

VERB

10913 fr-pos/VERB tokens (36% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: VerbForm=Part (10913; 100%), Person=EMPTY (10913; 100%), Tense=Past (10913; 100%), Mood=EMPTY (10913; 100%), Number=Sing (8770; 80%).

VERB tokens may have the following values of Gender:

Paradigm faireMascFem
Number=Singfait, faisfaite
Number=Plurfaitsfaites

PRON

7772 fr-pos/PRON tokens (44% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (6723; 87%), Person=3 (6474; 83%), PronType=Prs (5674; 73%).

PRON tokens may have the following values of Gender:

Paradigm ilMascFem
Number=Sing|Person=2-Tu
Number=Sing|Person=3il, -il, Lui, t-il-elle, elle
Number=SingLui
Number=Plur|Person=3ils, -ilselles, -elles

AUX

912 fr-pos/AUX tokens (7% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (912; 100%), Person=EMPTY (912; 100%), Tense=Past (912; 100%), VerbForm=Part (912; 100%), Number=Sing (911; 100%).

AUX tokens may have the following values of Gender:

Gender seems to be lexical feature of AUX. 100% lemmas (12) occur only with one value of Gender.

NUM

8 fr-pos/NUM tokens (0% of all NUM tokens) have a non-empty value of Gender.

NUM tokens may have the following values of Gender:

PROPN

3 fr-pos/PROPN tokens (0% of all PROPN tokens) have a non-empty value of Gender.

PROPN tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (48477; 99%), NOUN –[amod]–> ADJ (18023; 99%), NOUN –[nmod:poss]–> DET (4175; 99%), NOUN –[conj]–> NOUN (3312; 63%), NOUN –[acl]–> VERB (2894; 70%), VERB –[nsubj:pass]–> NOUN (1597; 96%), ADJ –[conj]–> ADJ (877; 97%), NOUN –[appos]–> NOUN (872; 58%), ADJ –[nsubj]–> NOUN (866; 97%), NOUN –[nsubj]–> NOUN (575; 62%).


Treebank Statistics (UD_French-ParTUT)

This feature is universal. It occurs with 2 different values: Fem, Masc.

7036 tokens (39%) have a non-empty value of Gender. 2017 types (62%) occur at least once with a non-empty value of Gender. 1596 lemmas (65%) occur at least once with a non-empty value of Gender. The feature is used with 6 part-of-speech tags: fr-pos/NOUN (3730; 21% instances), fr-pos/DET (1740; 10% instances), fr-pos/ADJ (801; 4% instances), fr-pos/VERB (458; 3% instances), fr-pos/PRON (270; 2% instances), fr-pos/AUX (37; 0% instances).

NOUN

3730 fr-pos/NOUN tokens (99% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (2580; 69%).

NOUN tokens may have the following values of Gender:

Paradigm présidentMascFem
Number=Singprésidentprésidente
Number=Plurprésidents

Gender seems to be lexical feature of NOUN. 97% lemmas (1031) occur only with one value of Gender.

DET

1740 fr-pos/DET tokens (58% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (1528; 88%), PronType=Art (1216; 70%), Definite=Def (932; 54%).

DET tokens may have the following values of Gender:

Paradigm leMascFem
Definite=Def|Number=Singlela
Definite=Def|Number=Plurles
Number=Singlela

ADJ

801 fr-pos/ADJ tokens (70% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (510; 64%).

ADJ tokens may have the following values of Gender:

Paradigm présentMascFem
présentprésente

VERB

458 fr-pos/VERB tokens (29% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Person=EMPTY (458; 100%), VerbForm=Part (456; 100%), Mood=EMPTY (456; 100%), Tense=Past (455; 99%), Number=Sing (289; 63%).

VERB tokens may have the following values of Gender:

Paradigm direMascFem
Number=Singditdite
Number=Plurdites

PRON

270 fr-pos/PRON tokens (27% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Person=3 (219; 81%), Number=Sing (203; 75%), PronType=Prs (197; 73%).

PRON tokens may have the following values of Gender:

Paradigm leMascFem
lela

AUX

37 fr-pos/AUX tokens (6% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Number=Sing (37; 100%), Tense=Past (37; 100%), Mood=EMPTY (37; 100%), Person=EMPTY (37; 100%), VerbForm=Part (37; 100%).

AUX tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (1421; 54%), NOUN –[amod]–> ADJ (611; 70%), NOUN –[conj]–> NOUN (185; 55%), NOUN –[nmod:poss]–> DET (165; 93%), NOUN –[acl]–> VERB (137; 50%), VERB –[nsubj:pass]–> NOUN (75; 90%), NOUN –[compound]–> NOUN (38; 93%), ADJ –[conj]–> ADJ (35; 66%), ADJ –[nsubj]–> NOUN (23; 57%), NOUN –[nsubj]–> NOUN (14; 54%).


Treebank Statistics (UD_French-Sequoia)

This feature is universal. It occurs with 2 different values: Fem, Masc.

23584 tokens (39%) have a non-empty value of Gender. 5559 types (64%) occur at least once with a non-empty value of Gender. 4026 lemmas (64%) occur at least once with a non-empty value of Gender. The feature is used with 9 part-of-speech tags: fr-pos/NOUN (12128; 20% instances), fr-pos/DET (5091; 8% instances), fr-pos/ADJ (2366; 4% instances), fr-pos/VERB (1837; 3% instances), fr-pos/PROPN (1369; 2% instances), fr-pos/PRON (720; 1% instances), fr-pos/AUX (65; 0% instances), fr-pos/ADP (7; 0% instances), fr-pos/NUM (1; 0% instances).

NOUN

12128 fr-pos/NOUN tokens (94% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (8530; 70%).

NOUN tokens may have the following values of Gender:

Paradigm patientMascFem
Number=Singpatientpatiente
Number=Plurpatientspatientes

Gender seems to be lexical feature of NOUN. 99% lemmas (2527) occur only with one value of Gender.

DET

5091 fr-pos/DET tokens (57% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (5055; 99%), PronType=Art (4649; 91%), Definite=Def (3531; 69%).

DET tokens may have the following values of Gender:

Paradigm leMascFem
Definite=Def|PronType=Artle, les, l'la, l'
la

ADJ

2366 fr-pos/ADJ tokens (63% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (1540; 65%).

ADJ tokens may have the following values of Gender:

Paradigm toutMascFem
Number=Singtouttoute
Number=Plurtoustoutes

VERB

1837 fr-pos/VERB tokens (39% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (1837; 100%), Tense=Past (1837; 100%), Person=EMPTY (1837; 100%), VerbForm=Part (1837; 100%), Number=Sing (1281; 70%), Voice=EMPTY (1205; 66%).

VERB tokens may have the following values of Gender:

Paradigm avoirMascFem
eueue

PROPN

1369 fr-pos/PROPN tokens (45% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (1338; 98%).

PROPN tokens may have the following values of Gender:

Paradigm JeanMascFem
JeanJean

Gender seems to be lexical feature of PROPN. 100% lemmas (436) occur only with one value of Gender.

PRON

720 fr-pos/PRON tokens (29% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Reflex=EMPTY (720; 100%), Person=3 (580; 81%), Number=Sing (571; 79%), PronType=EMPTY (541; 75%).

PRON tokens may have the following values of Gender:

Paradigm ilMascFem
Number=Singil, -il, On, -onelle, -elle
Number=Plurils, -ilselles

AUX

65 fr-pos/AUX tokens (3% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Person=EMPTY (65; 100%), VerbForm=Part (65; 100%), Tense=Past (65; 100%), Mood=EMPTY (65; 100%), Number=Sing (50; 77%).

AUX tokens may have the following values of Gender:

Paradigm appelerMascFem
Number=Singappeléappelée
Number=Sing|Voice=Passappelé
Number=Plurappelésappelées

ADP

7 fr-pos/ADP tokens (0% of all ADP tokens) have a non-empty value of Gender.

ADP tokens may have the following values of Gender:

Paradigm àMascFem
àà

NUM

1 fr-pos/NUM tokens (0% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (1; 100%).

NUM tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (4336; 59%), NOUN –[amod]–> ADJ (1839; 61%), NOUN –[acl]–> VERB (528; 61%), NOUN –[conj]–> NOUN (501; 55%), PROPN –[det]–> DET (322; 66%), VERB –[nsubj:pass]–> NOUN (294; 91%), NOUN –[appos]–> NOUN (105; 55%), VERB –[conj]–> VERB (90; 53%), ADJ –[nsubj]–> NOUN (80; 60%), VERB –[nsubj:pass]–> PRON (64; 60%).


Gender in other languages: [am] [ar] [bg] [bxr] [ca] [ckb] [cop] [cs] [cu] [da] [de] [el] [en] [es] [et] [eu] [fa] [fo] [fr] [ga] [gl] [got] [grc] [he] [hi] [hr] [hu] [id] [it] [ja] [kk] [kmr] [ko] [la] [lv] [mr] [nl] [no] [pl] [pt] [ro] [ru] [sa] [sk] [sla] [sl] [so] [sr] [sv] [swl] [ta] [tr] [u] [ug] [uk] [ur] [vi] [yue] [zh]