home eu/feat edit page issue tracker

This page still pertains to UD version 1.

Number: number

The Number feature for Basque follows the standard UD guidelines for nouns, adjectives, determiners and adverbs. However, finite verbs contain agreement features on number for the subject, object and indirect object, so the Basque treebank follows the UD description for language-specific features, defining Number[erg]=Sing,Plur, Number[abs]=Sing,Plur, and Number[dat]=Sing,Plur.


Treebank Statistics (UD_Basque)

This feature is universal. It occurs with 2 different values: Plur, Sing.

This is a layered feature with the following layers: Number, Number[abs], Number[dat], Number[erg].

25829 tokens (27%) have a non-empty value of Number. 12090 types (57%) occur at least once with a non-empty value of Number. 5681 lemmas (58%) occur at least once with a non-empty value of Number. The feature is used with 11 part-of-speech tags: eu-pos/NOUN (13672; 14% instances), eu-pos/PROPN (5051; 5% instances), eu-pos/ADJ (3082; 3% instances), eu-pos/DET (2153; 2% instances), eu-pos/ADP (1129; 1% instances), eu-pos/VERB (537; 1% instances), eu-pos/AUX (158; 0% instances), eu-pos/PRON (22; 0% instances), eu-pos/ADV (12; 0% instances), eu-pos/SYM (12; 0% instances), eu-pos/NUM (1; 0% instances).

NOUN

13672 eu-pos/NOUN tokens (58% of all NOUN tokens) have a non-empty value of Number.

The most frequent other feature values with which NOUN and Number co-occurred: Definite=Def (13669; 100%), Animacy=Inan (7699; 56%).

NOUN tokens may have the following values of Number:

Paradigm taldeSingPlur
Animacy=Inan|Case=Abltaldetiktaldeetatik
Animacy=Inan|Case=Abstaldeataldeak, taldeok
Animacy=Inan|Case=Alltalderataldeetara
Animacy=Inan|Case=Comtaldearekin
Animacy=Inan|Case=Dattaldearitaldeei
Animacy=Inan|Case=Ergtaldeaktaldeek, taldekoek, taldeok
Animacy=Inan|Case=Gentaldearentaldeen
Animacy=Inan|Case=Inetaldean, taldearengantaldeetan
Animacy=Inan|Case=Loctaldeko, talderakotaldeetako
Case=AbsTaldeaTaldeak
Case=GenTaldearen

PROPN

5051 eu-pos/PROPN tokens (63% of all PROPN tokens) have a non-empty value of Number.

The most frequent other feature values with which PROPN and Number co-occurred: Definite=Def (5046; 100%).

PROPN tokens may have the following values of Number:

Paradigm SydneySingPlur
Case=AbsSydneykoak
Case=AllSydneyra
Case=IneSydneyn
Case=LocSydneyko

Number seems to be lexical feature of PROPN. 100% lemmas (1708) occur only with one value of Number.

ADJ

3082 eu-pos/ADJ tokens (65% of all ADJ tokens) have a non-empty value of Number.

The most frequent other feature values with which ADJ and Number co-occurred: Definite=Def (3081; 100%), Case=Abs (1966; 64%).

ADJ tokens may have the following values of Number:

Paradigm handiSingPlur
Case=Abshandiahandiak, handikoak
Case=Abs|Degree=Cmphandiagoa, haundiagoahandiagoak, haundiagoak
Case=Abs|Degree=Suphandienahandienak
Case=Abs|Degree=Abshandiegia
Case=All|Degree=Cmphandiagora
Case=Cauhandiagatikhandiengatik
Case=Comhandiarekinhandiekin
Case=Com|Degree=Cmphandiagoarekin
Case=Erghandiek
Case=Erg|Degree=Suphandienek
Case=Genhandien
Case=Gen|Degree=Cmphandiagoaren
Case=Inehandianhandietan
Case=Lochandikohandietako
Case=Loc|Degree=Cmphandiagoko
Case=Loc|Degree=Suphandienekohandienetariko

DET

2153 eu-pos/DET tokens (66% of all DET tokens) have a non-empty value of Number.

The most frequent other feature values with which DET and Number co-occurred: Definite=Def (1839; 85%).

DET tokens may have the following values of Number:

Paradigm beraSingPlur
Case=Ablberetik
Case=Abs|Definite=Defbera, berekoabereak, berekoak
Case=Abs|Definite=Indbere
Case=Ben|Definite=Defberarentzat
Case=Cau|Definite=Defberagatik
Case=Com|Definite=Defberarekin
Case=Dat|Definite=Defberari
Case=Erg|Definite=Defberak
Case=Genbere
Case=Gen|Definite=Defberaren
Case=Ineberean
Case=Ine|Definite=Defberarengan

ADP

1129 eu-pos/ADP tokens (77% of all ADP tokens) have a non-empty value of Number.

The most frequent other feature values with which ADP and Number co-occurred: Definite=Def (1073; 95%), Animacy=EMPTY (755; 67%).

ADP tokens may have the following values of Number:

Paradigm arteSingPlur
Animacy=Anim|Case=Ine|Definite=Defartean
Animacy=Anim|Case=Loc|Definite=Defartekoarteko
Animacy=Inan|Case=Abs|Definite=Defarte
Animacy=Inan|Case=Ine|Definite=Defartean
Animacy=Inan|Case=Loc|Definite=Defartekoarteko
Case=Abs|Definite=Defartearte
Case=Ineartean
Case=Ine|Definite=Defarteanartean
Case=Ine|Definite=Def|Degree=Supartean
Case=Ine|Definite=Def|Person=1artean
Case=Loc|Definite=Defartekoarteko
Case=Loc|Definite=Def|Degree=Suparteko
Case=Loc|Definite=Def|Person=3arteko

VERB

537 eu-pos/VERB tokens (4% of all VERB tokens) have a non-empty value of Number.

The most frequent other feature values with which VERB and Number co-occurred: Number[abs]=EMPTY (476; 89%), Mood=EMPTY (476; 89%), Person[abs]=EMPTY (476; 89%), Aspect=EMPTY (475; 88%), VerbForm=Part (343; 64%), Case=Abs (309; 58%).

VERB tokens may have the following values of Number:

Paradigm izanSingPlur
Aspect=Prog|Case=Abl|Mood=Ind|Person[abs]=1ginenekotik
Aspect=Prog|Case=Abs|Mood=Ind|Person[abs]=3direnazirenak
Aspect=Prog|Case=Abs|Mood=Ind|Person[abs]=3|Person[dat]=3zaizkienak
Aspect=Prog|Case=Dat|Mood=Ind|Person[abs]=3zirenei
Aspect=Prog|Case=Gen|Mood=Ind|Person[abs]=3zenaren
Aspect=Prog|Case=Insdenez
Case=Abs|VerbForm=Partizanaizanak
Case=Datizateari
Case=Ergizateak
Case=Erg|VerbForm=Partizanak
Case=Gen|VerbForm=Partizanaren
Case=Insizateaz
Case=Ins|VerbForm=Partizanaz

AUX

158 eu-pos/AUX tokens (2% of all AUX tokens) have a non-empty value of Number.

The most frequent other feature values with which AUX and Number co-occurred: Aspect=EMPTY (143; 91%), Person[abs]=3 (135; 85%), Mood=Ind (132; 84%), Number[abs]=Sing (103; 65%), Person[erg]=3 (81; 51%).

AUX tokens may have the following values of Number:

Paradigm *edunSingPlur
Case=Abl|Mood=Ind|Person[abs]=3|Person[erg]=3dutenenetatik
Case=Abs|Mood=Cnd|Person[abs]=3|Person[erg]=3lukeena
Case=Abs|Mood=Ind|Person[abs]=1|Person[erg]=3gaituztenak
Case=Abs|Mood=Ind|Person[abs]=3|Person[dat]=1|Person[erg]=3didana, zidatenazidatenak
Case=Abs|Mood=Ind|Person[abs]=3|Person[dat]=3|Person[erg]=3diona, zizkiona, ziona, diotenadienak, diotenak
Case=Abs|Mood=Ind|Person[abs]=3|Person[erg]=1nuena
Case=Abs|Mood=Ind|Person[abs]=3|Person[erg]=3duena, zuena, dutena, dituena, zutena, dutenetakoa, dutenenadutenak, dituztenak, dituenak, zituztenak
Case=Ben|Mood=Ind|Person[abs]=3|Person[erg]=3zuenarentzat
Case=Cau|Mood=Ind|Person[abs]=1|Person[erg]=3gintuenagatik
Case=Com|Mood=Ind|Person[abs]=3|Person[dat]=3|Person[erg]=3zizkiotenekin
Case=Com|Mood=Ind|Person[abs]=3|Person[erg]=1dudanarekin
Case=Com|Mood=Ind|Person[abs]=3|Person[erg]=2duzunarekin
Case=Com|Mood=Ind|Person[abs]=3|Person[erg]=3dituztenekin
Case=Dat|Mood=Ind|Person[abs]=3|Person[erg]=1dugunari
Case=Dat|Mood=Ind|Person[abs]=3|Person[erg]=3duenarizituenei
Case=Erg|Mood=Ind|Person[abs]=3|Person[dat]=3|Person[erg]=3ziotenakdiotenek
Case=Erg|Mood=Ind|Person[abs]=3|Person[erg]=1dudanak
Case=Erg|Mood=Ind|Person[abs]=3|Person[erg]=3duenak, zuenakdituztenek, dutenek
Case=Gen|Mood=Ind|Person[abs]=3|Person[erg]=3dutenaren, duenarendutenen, dituztenen
Case=Loc|Mood=Ind|Person[abs]=3|Person[erg]=3zuteneko, duteneko

PRON

22 eu-pos/PRON tokens (4% of all PRON tokens) have a non-empty value of Number.

The most frequent other feature values with which PRON and Number co-occurred: Definite=Def (22; 100%), PronType=EMPTY (22; 100%).

PRON tokens may have the following values of Number:

SYM

12 eu-pos/SYM tokens (92% of all SYM tokens) have a non-empty value of Number.

The most frequent other feature values with which SYM and Number co-occurred: Definite=Def (12; 100%), Case=Abs (10; 83%), Animacy=EMPTY (7; 58%).

SYM tokens may have the following values of Number:

Paradigm kVSingPlur
KVkv

ADV

12 eu-pos/ADV tokens (0% of all ADV tokens) have a non-empty value of Number.

ADV tokens may have the following values of Number:

Paradigm samarSingPlur
Case=Abssamarra
Case=Inesamarreansamarretan

NUM

1 eu-pos/NUM tokens (0% of all NUM tokens) have a non-empty value of Number.

The most frequent other feature values with which NUM and Number co-occurred: NumType=Card (1; 100%).

NUM tokens may have the following values of Number:

Relations with Agreement in Number

The 10 most frequent relations where parent and child node agree in Number: NOUN –[nmod]–> DET (207; 52%), ADJ –[nsubj]–> NOUN (166; 69%), PROPN –[nmod]–> PROPN (52; 65%), PROPN –[appos]–> PROPN (43; 68%), NOUN –[nsubj]–> DET (41; 59%), ADJ –[conj]–> ADJ (33; 51%), PROPN –[appos]–> NOUN (32; 64%), NOUN –[conj]–> PROPN (27; 53%), ADJ –[nsubj]–> DET (23; 79%), DET –[nmod]–> DET (8; 80%).


Number in other languages: [am] [ar] [bg] [bxr] [ca] [ckb] [cop] [cs] [cu] [da] [de] [el] [en] [es] [et] [eu] [fa] [fi] [fo] [fr] [ga] [gl] [got] [grc] [he] [hi] [hr] [hu] [id] [it] [ja] [kk] [kmr] [ko] [la] [lv] [mr] [nl] [no] [pl] [pt] [ro] [ru] [sa] [sk] [sla] [sl] [so] [sr] [sv] [swl] [ta] [tr] [u] [ug] [uk] [ur] [urj] [vi] [yue] [zh]