Number
: number
Latvian features number for nouns (NOUN
, PROPN
), adjectives (ADJ
), some numerals (NUM
), verbs (VERB
) and some pronouns (PRON
and DET
).
Values used:
Sing
(singular number)Plur
(plural number)Ptan
(plurale tantum)Coll
(collective / mass / singulare tantum) - might be used unevenly as it is hard distinguish fromSing
Theoretically possible values:
Dual
(dual number) - older texts feature forms like abi roki “both hands”, however it is so rare in contemporary language that it is absent from corpus.
Values not present in Latvian:
Tri
(trial number)Pauc
(paucal number)Grpa
(greater paucal number)Grpl
(greater plural number)Inv
(inverse number)
Treebank Statistics (UD_Latvian)
This feature is universal.
It occurs with 4 different values: Coll
, Plur
, Ptan
, Sing
.
21478 tokens (48%) have a non-empty value of Number
.
9418 types (79%) occur at least once with a non-empty value of Number
.
5066 lemmas (73%) occur at least once with a non-empty value of Number
.
The feature is used with 7 part-of-speech tags: lv-pos/NOUN (11520; 26% instances), lv-pos/PRON (2553; 6% instances), lv-pos/VERB (2317; 5% instances), lv-pos/ADJ (2244; 5% instances), lv-pos/PROPN (1516; 3% instances), lv-pos/DET (1010; 2% instances), lv-pos/NUM (318; 1% instances).
NOUN
11520 lv-pos/NOUN tokens (100% of all NOUN
tokens) have a non-empty value of Number
.
The most frequent other feature values with which NOUN
and Number
co-occurred: Gender=Fem (6032; 52%).
NOUN
tokens may have the following values of Number
:
Coll
(26; 0% of non-emptyNumber
): vidū, zelta, cilvēces, zelts, Miers, intelekta, interneta, internetu, medus, mierāPlur
(3102; 27% of non-emptyNumber
): darbinieku, latu, nagu, skolotāju, skolēnu, acis, dalībvalstis, iestāžu, mājās, rokasPtan
(293; 3% of non-emptyNumber
): atkritumu, finanšu, attiecību, atkritumus, durvis, resursu, svētki, atkritumiem, atmiņas, datiemSing
(8099; 70% of non-emptyNumber
): gada, gadā, valsts, darba, izglītības, laikā, skaitu, pasaules, uzņēmuma, skaitaEMPTY
(15): eiro, kino, Sanī, foto, Cukini, alibi, auto
Paradigm gads | Sing | Plur |
---|---|---|
Case=Acc | gadu | gadus |
Case=Dat | gadam | gadiem |
Case=Gen | gada | gadu |
Case=Loc | gadā | gados |
Case=Nom | gads | gadi |
PRON
2553 lv-pos/PRON tokens (84% of all PRON
tokens) have a non-empty value of Number
.
The most frequent other feature values with which PRON
and Number
co-occurred: PronType=Prs (1492; 58%), Case=Nom (1442; 56%).
PRON
tokens may have the following values of Number
:
Plur
(499; 20% of non-emptyNumber
): mēs, viņi, mums, tās, mūsu, jūs, tiem, tie, abi, visiSing
(2054; 80% of non-emptyNumber
): es, viņa, to, tas, viņš, man, tu, tā, viņu, maniEMPTY
(482): kas, ko, sevi, sev, kam, sevis, kā, sevī, nekā, viskautko
Paradigm tas | Sing | Plur |
---|---|---|
Case=Acc|Person=3 | to | |
Case=Acc | to | tos |
Case=Dat|Person=3 | tam | |
Case=Dat | tam | tiem |
Case=Gen|Person=3 | tā | |
Case=Gen | tā | to |
Case=Loc | tajā, tai | tajos |
Case=Nom|Person=3 | tas | |
Case=Nom | tas | tie |
VERB
2317 lv-pos/VERB tokens (32% of all VERB
tokens) have a non-empty value of Number
.
The most frequent other feature values with which VERB
and Number
co-occurred: Reflex=EMPTY (2092; 90%), Evident=EMPTY (1468; 63%), Tense=Past (1452; 63%), Polarity=EMPTY (1394; 60%), Mood=EMPTY (1394; 60%), Person=EMPTY (1394; 60%), VerbForm=Part (1382; 60%), Degree=Pos (1381; 60%), Voice=EMPTY (1177; 51%), Aspect=Perf (1176; 51%).
VERB
tokens may have the following values of Number
:
Plur
(665; 29% of non-emptyNumber
): esam, zinām, bijām, redzam, runājam, skatāmies, varam, dzīvojam, pieņemti, atvērsimSing
(1652; 71% of non-emptyNumber
): esmu, esi, bijis, biju, teicu, neesmu, saistīts, domāju, cēlusies, gribuEMPTY
(4882): ir, bija, nav, var, būs, nebija, varētu, būtu, tiek, būt
Paradigm būt | Sing | Plur |
---|---|---|
Aspect=Imp|Case=Gen|Definite=Def|Degree=Pos|Gender=Masc|Tense=Pres|VerbForm=Part|Voice=Pass | esošo | |
Aspect=Imp|Case=Loc|Definite=Ind|Degree=Pos|Gender=Fem|Tense=Pres|VerbForm=Part|Voice=Pass | esošās | |
Aspect=Imp|Case=Nom|Definite=Def|Degree=Pos|Gender=Masc|Tense=Pres|VerbForm=Part|Voice=Pass | esošais | |
Aspect=Imp|Case=Nom|Definite=Def|Degree=Pos|Gender=Fem|Tense=Pres|VerbForm=Part|Voice=Pass | esošā | |
Aspect=Perf|Case=Acc|Definite=Def|Degree=Pos|Gender=Masc|Tense=Past|VerbForm=Part | bijušo | |
Aspect=Perf|Case=Dat|Definite=Def|Degree=Pos|Gender=Fem|Tense=Past|VerbForm=Part | bijušajām | |
Aspect=Perf|Case=Nom|Definite=Ind|Degree=Pos|Gender=Masc|Tense=Past|VerbForm=Part | bijis | bijuši |
Aspect=Perf|Case=Nom|Definite=Ind|Degree=Pos|Gender=Fem|Tense=Past|VerbForm=Part | bijusi | bijušas |
Case=Nom|Definite=Ind|Gender=Masc|VerbForm=Conv|Voice=Pass | būdami | |
Evident=Fh,Nfh|Mood=Ind|Person=1|Polarity=Neg|Tense=Pres|VerbForm=Fin|Voice=Act | neesam | |
Evident=Fh,Nfh|Mood=Ind|Person=1|Polarity=Pos|Tense=Fut|VerbForm=Fin|Voice=Act | būšu | |
Evident=Fh,Nfh|Mood=Ind|Person=1|Polarity=Pos|Tense=Past|VerbForm=Fin|Voice=Act | biju | bijām |
Evident=Fh,Nfh|Mood=Ind|Person=1|Polarity=Pos|Tense=Pres|VerbForm=Fin|Voice=Act | esmu | esam |
Evident=Fh,Nfh|Mood=Ind|Person=2|Polarity=Pos|Tense=Fut|VerbForm=Fin|Voice=Act | būsi | |
Evident=Fh,Nfh|Mood=Ind|Person=2|Polarity=Pos|Tense=Pres|VerbForm=Fin|Voice=Act | esi | esat |
ADJ
2244 lv-pos/ADJ tokens (87% of all ADJ
tokens) have a non-empty value of Number
.
The most frequent other feature values with which ADJ
and Number
co-occurred: NumType=EMPTY (2174; 97%), Degree=Pos (2047; 91%), Gender=Masc (1178; 52%).
ADJ
tokens may have the following values of Number
:
Plur
(724; 32% of non-emptyNumber
): pedagoģisko, sabiedrisko, dažādu, papildu, lielas, pāris, nepieciešamo, dažādas, dažādi, ekonomiskoSing
(1520; 68% of non-emptyNumber
): nacionālās, iespējams, liela, liels, nepieciešams, lielā, jaunu, lielu, otrās, vispārējāsEMPTY
(325): 2012., 1., 2., 3., 2010., 2011., 2007., 4., 2013., 7.
Paradigm liels | Sing | Plur |
---|---|---|
Case=Acc|Degree=Pos | lielu, lielo | lielus, lielos |
Case=Acc|Degree=Cmp | lielāku, Lielāko | |
Case=Dat|Degree=Pos | lielajam | lieliem, lielajiem |
Case=Dat|Degree=Cmp | lielākajiem | |
Case=Gen|Degree=Pos | lielā, liela | lielu |
Case=Gen|Degree=Cmp | lielāku | |
Case=Loc|Degree=Pos | lielā | lielos |
Case=Loc|Degree=Cmp | lielākā | |
Case=Nom|Degree=Pos | liels, lielais | |
Case=Nom|Degree=Cmp | lielākais, lielāks | lielākie |
PROPN
1516 lv-pos/PROPN tokens (84% of all PROPN
tokens) have a non-empty value of Number
.
The most frequent other feature values with which PROPN
and Number
co-occurred: Abbr=EMPTY (1516; 100%), Gender=Fem (859; 57%).
PROPN
tokens may have the following values of Number
:
Plur
(12; 1% of non-emptyNumber
): Bondaru, Knāgi, Kopienu, Livanoviču, Luīzi, Rietumu-Austrumu, Tartu, frančiem, ilgos, miljonuPtan
(36; 2% of non-emptyNumber
): Ādažu, Pļaviņu, Ziemassvētku, Ķemeros, Allažu, Brocēnos, Bulduri, Dzintari, Jurģus, JāņiemSing
(1468; 97% of non-emptyNumber
): Latvijas, Sofija, Eiropas, Latvijā, Andris, Rīgas, Jelgavas, Vilks, Pillar, SanīEMPTY
(286): SIA, ZAAO, IKP, UNESCO, DUS, LETA, ST, EEK, ES, AS
Paradigm Bondars | Sing | Plur |
---|---|---|
Case=Acc | Bondaru | |
Case=Gen | Bondaru | |
Case=Nom | Bondars, BONDARS |
Number
seems to be lexical feature of PROPN
. 99% lemmas (524) occur only with one value of Number
.
DET
1010 lv-pos/DET tokens (100% of all DET
tokens) have a non-empty value of Number
.
The most frequent other feature values with which DET
and Number
co-occurred: Poss=EMPTY (815; 81%), Gender=Masc (535; 53%).
DET
tokens may have the following values of Number
:
Plur
(316; 31% of non-emptyNumber
): to, visas, savas, visiem, kuru, vairāki, visus, šīs, daudzas, daudziSing
(694; 69% of non-emptyNumber
): savu, šo, šī, tā, kādu, šīs, šajā, kāda, savā, mansEMPTY
(1): kā
Paradigm šī | Sing | Plur |
---|---|---|
Case=Acc|Person=3 | šo | |
Case=Acc | šo | šīs |
Case=Dat | šai | šīm |
Case=Gen|Person=3 | šīs | |
Case=Gen | šīs | šo |
Case=Loc|Person=3 | šai | |
Case=Loc | šajā, šai | šajās |
Case=Nom | šī | šīs |
NUM
318 lv-pos/NUM tokens (50% of all NUM
tokens) have a non-empty value of Number
.
The most frequent other feature values with which NUM
and Number
co-occurred: NumType=Card (317; 100%).
NUM
tokens may have the following values of Number
:
Plur
(171; 54% of non-emptyNumber
): trīs, divas, desmit, divi, divus, divām, tūkstošiem, piecdesmit, divdesmit, piecosSing
(147; 46% of non-emptyNumber
): viens, vienu, viena, vienā, otra, vienam, otru, vienai, otras, vienasEMPTY
(322): 25, 3, 2, 2007, 4, 80, 987, 5, 20, 50
Paradigm otra | Sing | Plur |
---|---|---|
Case=Acc | otru | |
Case=Dat | otrai | otrām |
Case=Gen | otras, otrās | |
Case=Loc | otrā | |
Case=Nom | otra |
Number
seems to be lexical feature of NUM
. 97% lemmas (33) occur only with one value of Number
.
Relations with Agreement in Number
The 10 most frequent relations where parent and child node agree in Number
:
NOUN –[nmod]–> NOUN (1896; 57%),
NOUN –[amod]–> ADJ (1548; 83%),
NOUN –[det]–> DET (925; 96%),
NOUN –[conj]–> NOUN (519; 74%),
NOUN –[amod]–> VERB (389; 93%),
NOUN –[nmod]–> PROPN (367; 67%),
PROPN –[flat:name]–> PROPN (230; 98%),
NOUN –[nummod]–> NUM (213; 65%),
NOUN –[acl]–> NOUN (199; 64%),
VERB –[nsubj:pass]–> NOUN (177; 96%).
Number in other languages: [am] [ar] [bg] [bxr] [ca] [ckb] [cop] [cs] [cu] [da] [de] [el] [en] [es] [et] [eu] [fa] [fi] [fo] [fr] [ga] [gl] [got] [grc] [he] [hi] [hr] [hu] [id] [it] [ja] [kk] [kmr] [ko] [la] [lv] [mr] [nl] [no] [pl] [pt] [ro] [ru] [sa] [sk] [sla] [sl] [so] [sr] [sv] [swl] [ta] [tr] [u] [ug] [uk] [ur] [urj] [vi] [yue] [zh]