home en/feat edit page issue tracker

This page still pertains to UD version 1.

Number: number

In English, Number is a feature of nouns and other parts of speech that mark agreement with nouns, i.e. personal pronouns, verbs, and some determiners.

Sing: singular

A singular noun denotes one person, animal or thing. Every noun with the PTB tag NN or NNP is marked with this feature.

Examples:

Pronouns that refer to a single person, an animal or a thing are also marked with this feature.

We also mark all verbs with the PTB tag VBZ with this feature.

Examples:

Further, we mark inflections of be that can only have a singular noun or pronoun in subject position with this feature.

Demonstrative determiners of singular nouns and demonstrative pronouns that refer to singular nouns are also marked with this feature.

Plur: plural

A plural noun denotes several persons, animals or things. Every noun with the PTB tag NNS or NNPS is marked with this feature.

Examples:

Pronouns that refer to a single person, an animal or a thing are also marked with this feature.

Demonstrative determiners of plural nouns and demonstrative pronouns that refer to plural nouns are also marked with this feature.

We currently don’t mark plurale tantum or collective/mass nouns.


Treebank Statistics (UD_English)

This feature is universal. It occurs with 2 different values: Plur, Sing.

77145 tokens (34%) have a non-empty value of Number. 13413 types (74%) occur at least once with a non-empty value of Number. 10992 lemmas (73%) occur at least once with a non-empty value of Number. The feature is used with 11 part-of-speech tags: en-pos/NOUN (38975; 17% instances), en-pos/PRON (15176; 7% instances), en-pos/PROPN (14821; 6% instances), en-pos/AUX (4707; 2% instances), en-pos/VERB (2141; 1% instances), en-pos/DET (1278; 1% instances), en-pos/SYM (38; 0% instances), en-pos/ADJ (5; 0% instances), en-pos/X (2; 0% instances), en-pos/ADP (1; 0% instances), en-pos/NUM (1; 0% instances).

NOUN

38975 en-pos/NOUN tokens (100% of all NOUN tokens) have a non-empty value of Number.

NOUN tokens may have the following values of Number:

Paradigm timeSingPlur
timetimes

PRON

15176 en-pos/PRON tokens (73% of all PRON tokens) have a non-empty value of Number.

The most frequent other feature values with which PRON and Number co-occurred: PronType=Prs (13693; 90%), Poss=EMPTY (12603; 83%), Gender=EMPTY (10746; 71%), Case=Nom (8668; 57%).

PRON tokens may have the following values of Number:

Number seems to be lexical feature of PRON. 100% lemmas (36) occur only with one value of Number.

PROPN

14821 en-pos/PROPN tokens (100% of all PROPN tokens) have a non-empty value of Number.

PROPN tokens may have the following values of Number:

Paradigm StatesSingPlur
StatesStates

Number seems to be lexical feature of PROPN. 100% lemmas (4854) occur only with one value of Number.

AUX

4707 en-pos/AUX tokens (34% of all AUX tokens) have a non-empty value of Number.

The most frequent other feature values with which AUX and Number co-occurred: Mood=Ind (4707; 100%), VerbForm=Fin (4707; 100%), Person=3 (4371; 93%), Tense=Pres (3463; 74%).

AUX tokens may have the following values of Number:

VERB

2141 en-pos/VERB tokens (8% of all VERB tokens) have a non-empty value of Number.

The most frequent other feature values with which VERB and Number co-occurred: VerbForm=Fin (2136; 100%), Mood=Ind (2136; 100%), Tense=Pres (2016; 94%).

VERB tokens may have the following values of Number:

Number seems to be lexical feature of VERB. 100% lemmas (419) occur only with one value of Number.

DET

1278 en-pos/DET tokens (7% of all DET tokens) have a non-empty value of Number.

The most frequent other feature values with which DET and Number co-occurred: Definite=EMPTY (1278; 100%), PronType=Dem (1277; 100%).

DET tokens may have the following values of Number:

SYM

38 en-pos/SYM tokens (6% of all SYM tokens) have a non-empty value of Number.

SYM tokens may have the following values of Number:

ADJ

5 en-pos/ADJ tokens (0% of all ADJ tokens) have a non-empty value of Number.

The most frequent other feature values with which ADJ and Number co-occurred: Degree=EMPTY (5; 100%).

ADJ tokens may have the following values of Number:

X

2 en-pos/X tokens (0% of all X tokens) have a non-empty value of Number.

X tokens may have the following values of Number:

NUM

1 en-pos/NUM tokens (0% of all NUM tokens) have a non-empty value of Number.

The most frequent other feature values with which NUM and Number co-occurred: NumType=EMPTY (1; 100%).

NUM tokens may have the following values of Number:

ADP

1 en-pos/ADP tokens (0% of all ADP tokens) have a non-empty value of Number.

ADP tokens may have the following values of Number:

Relations with Agreement in Number

The 10 most frequent relations where parent and child node agree in Number: NOUN –[compound]–> NOUN (3513; 71%), NOUN –[nmod]–> NOUN (2801; 61%), PROPN –[compound]–> PROPN (2273; 91%), NOUN –[conj]–> NOUN (1744; 79%), NOUN –[nmod:poss]–> PRON (1650; 50%), PROPN –[flat]–> PROPN (1559; 99%), NOUN –[cop]–> AUX (1083; 60%), NOUN –[nmod]–> PROPN (990; 71%), NOUN –[compound]–> PROPN (809; 72%), PROPN –[conj]–> PROPN (767; 95%).


Treebank Statistics (UD_English-ParTUT)

This feature is universal. It occurs with 2 different values: Plur, Sing.

12037 tokens (32%) have a non-empty value of Number. 3313 types (54%) occur at least once with a non-empty value of Number. 2501 lemmas (49%) occur at least once with a non-empty value of Number. The feature is used with 6 part-of-speech tags: en-pos/NOUN (7961; 21% instances), en-pos/DET (1178; 3% instances), en-pos/PRON (989; 3% instances), en-pos/AUX (958; 3% instances), en-pos/VERB (936; 2% instances), en-pos/ADJ (15; 0% instances).

NOUN

7961 en-pos/NOUN tokens (99% of all NOUN tokens) have a non-empty value of Number.

NOUN tokens may have the following values of Number:

Paradigm workSingPlur
workworks

DET

1178 en-pos/DET tokens (29% of all DET tokens) have a non-empty value of Number.

The most frequent other feature values with which DET and Number co-occurred: Poss=EMPTY (1067; 91%), PronType=Art (773; 66%), Definite=Ind (766; 65%).

DET tokens may have the following values of Number:

Paradigm thisSingPlur
thisthese

PRON

989 en-pos/PRON tokens (66% of all PRON tokens) have a non-empty value of Number.

The most frequent other feature values with which PRON and Number co-occurred: PronType=Prs (841; 85%), Gender=EMPTY (794; 80%), Person=3 (555; 56%).

PRON tokens may have the following values of Number:

Paradigm thatSingPlur
Person=3thatthose
thatthose

AUX

958 en-pos/AUX tokens (53% of all AUX tokens) have a non-empty value of Number.

The most frequent other feature values with which AUX and Number co-occurred: VerbForm=Fin (951; 99%), Mood=Ind (948; 99%), Tense=Pres (737; 77%), Person=3 (589; 61%).

AUX tokens may have the following values of Number:

Paradigm beSingPlur
Mood=Ind|Person=1|Tense=Pres|VerbForm=Finam, 'm
Mood=Ind|Person=2|Tense=Past|VerbForm=Finwere
Mood=Ind|Person=2|Tense=Pres|VerbForm=Finare
Mood=Ind|Person=3|Tense=Past|VerbForm=Finwas
Mood=Ind|Person=3|Tense=Pres|VerbForm=Finis, 's
Mood=Ind|Tense=Past|VerbForm=Finwere
Mood=Ind|Tense=Pres|VerbForm=Finare, 're
Tense=Presbe
Tense=Pres|VerbForm=Partbeing

VERB

936 en-pos/VERB tokens (26% of all VERB tokens) have a non-empty value of Number.

The most frequent other feature values with which VERB and Number co-occurred: Tense=Pres (929; 99%), VerbForm=Fin (768; 82%), Mood=Ind (763; 82%), Person=EMPTY (521; 56%).

VERB tokens may have the following values of Number:

Paradigm haveSingPlur
Mood=Ind|Person=1|VerbForm=Finhave
Mood=Ind|Person=3|VerbForm=Finhas
Mood=Ind|VerbForm=Finhave
VerbForm=Parthaving

ADJ

15 en-pos/ADJ tokens (0% of all ADJ tokens) have a non-empty value of Number.

The most frequent other feature values with which ADJ and Number co-occurred: Degree=EMPTY (15; 100%).

ADJ tokens may have the following values of Number:

Relations with Agreement in Number

The 10 most frequent relations where parent and child node agree in Number: NOUN –[nmod]–> NOUN (1455; 60%), NOUN –[conj]–> NOUN (522; 80%), NOUN –[cop]–> AUX (180; 76%), NOUN –[nsubj]–> NOUN (75; 78%), NOUN –[nsubj]–> PRON (56; 64%), NOUN –[compound]–> NOUN (39; 59%), NOUN –[appos]–> NOUN (31; 78%), NOUN –[obj]–> NOUN (13; 93%), NOUN –[amod]–> NOUN (6; 75%), NOUN –[vocative]–> NOUN (5; 100%).


Number in other languages: [am] [ar] [bg] [bxr] [ca] [ckb] [cop] [cs] [cu] [da] [de] [el] [en] [es] [et] [eu] [fa] [fi] [fo] [fr] [ga] [gl] [got] [grc] [he] [hi] [hr] [hu] [id] [it] [ja] [kk] [kmr] [ko] [la] [lv] [mr] [nl] [no] [pl] [pt] [ro] [ru] [sa] [sk] [sla] [sl] [so] [sr] [sv] [swl] [ta] [tr] [u] [ug] [uk] [ur] [urj] [vi] [yue] [zh]