DET

home sl/pos edit page issue tracker

This page still pertains to UD version 1.

`DET`: determiner

Definition

Determiners are words that modify nouns or noun phrases and express the reference of the noun phrase in context. That is, a determiner may indicate whether the noun is referring to a definite or indefinite element of a class, to a closer or more distant element, to an element belonging to a specified person or thing, to a particular number or quantity, etc.

The traditional grammar of Slovenian does not define determiners as a separate word class. Instead, words that perform the syntactic function of determiners are either categorizied as adverbs (nekaj “some”, veliko “a lot of”, dovolj “enough of” etc.) or pronouns (ta “this”, ves “all”, moj “my”, vsak “each” etc.), regardless of whether they are used as attributives (To.DET besedilo je nerazumljivo. “This text is incomprehensible.”) or substantives (To.PRON sem že slišal. “I have heard this before.”).

Conversion from JOS

Since JOS morphosyntactic specifications do not distinguish substantive and attributive pronouns or quantifying and other adverbs, the conversion is done based on syntactic information. The pronouns modifying a noun are thus marked as DET, otherwise they are marked as PRON. Similarly, the list of adverbs modifying a noun was manually validated to define a closed set of quantifying adverbs marked as DET.

Examples

njegov “his”, njen “her”, naš “our”, njihov “their”, _moj “my”, _vaš “your” etc. (JOS possessive pronouns)
ta “this”, tisti “that”, takšen “such”, tak “such”, _tolikšen “so big” etc. (JOS demonstrative pronouns)
ves “all”, vsak “each”, oba “both”, vsakršen “any” (JOS general pronouns)
svoj “one’s own” (JOS reflexive pronouns)
nekateri “some”, nek “some kind”, isti “identical”, enak “same”, mnog “many” (JOS indefinite pronouns)
kakšen “what kind”, kateri “what type”, čigav “whose” (JOS interrogative pronouns)
noben “no one”, nikakršen “no kind”, nič “nothing” (JOS negative pronouns)
kakršenkoli “any kind of”, katerikoli “any type of”, čigar “whose” (JOS relative pronouns)
nekaj “some”, več “more”, veliko “a lot of”, dovolj “enough of”, pol “half of”, malo “little of” (JOS adverbs)

Treebank Statistics (UD_Slovenian)

There are 63 DET lemmas (0%), 312 DET types (1%) and 4711 DET tokens (4%). Out of 16 observed tags, the rank of DET is: 8 in number of lemmas, 7 in number of types and 10 in number of tokens.

The 10 most frequent DET lemmas: ta, ves, svoj, kateri, njegov, nekaj, več, vsak, naš, njen

The 10 most frequent DET types: to, tem, vse, nekaj, ta, več, tega, svoje, te, veliko

The 10 most frequent ambiguous lemmas: ves (DET 439, ADV 1), nekaj (DET 171, PRON 60), več (DET 158, PART 74), pol (DET 25, NOUN 1), par (NOUN 17, DET 4), četrt (DET 2, NOUN 1)

The 10 most frequent ambiguous types: tem (DET 244, NOUN 1, ADV 1), vse (DET 148, ADV 34), nekaj (DET 152, PRON 53), več (DET 152, PART 74), te (DET 77, PRON 14), veliko (DET 81, ADJ 34), malo (DET 42, ADJ 5), pol (DET 25, ADV 3), ti (PRON 14, DET 14), neki (DET 11, PRON 1)

tem
- DET 244: S tem nikakor ni zmanjšan pomen enotne volje državljanov Slovenije .
- NOUN 1: Dijaki v njem prikažejo svoje poznavanje obravnavanih tem iz književnosti .
- ADV 1: Čim boljše bo v Sloveniji znanje tujih jezikov , tem bolje se bo Slovenija sporazumevala s svetom .
vse
- DET 148: Na vse prireditve je vstop prost .
- ADV 34: Slišim namreč vse več glasov o nepravilnostih .
nekaj
- DET 152: Simpatična uradna stran z nekaj prav zanimivimi rubrikami .
- PRON 53: A nekaj v meni mi ni dovolilo , da bi zaploskala .
več
- DET 152: Za zadovoljitev pomembne želje so pripravljeni vložiti več truda .
- PART 74: Tragika te ženske : na koncu ji noben zdravnik ni več verjel .
te
- DET 77: Če te ne bi bilo , ne bi pomagal niti izredno ugoden splet okoliščin .
- PRON 14: Ne , če te hočejo pokončati , te bodo našli kjerkoli .
veliko
- DET 81: Ali ima center veliko dela spričo tako hudih medvrstniških obračunavanj ?
- ADJ 34: Na zadnji hrbtni bodici ima veliko črno piko .
malo
- DET 42: Potem bi imela lep vzrok , da bi šla malo mižat pod tisti balvan .
- ADJ 5: Vse bolj pogumno pa se malo gospodarstvo razvija tudi na področju negospodarstva in prav tu je slutiti nadaljnji razvoj .
pol
- DET 25: V zadnjih petih urah sva se premaknila za slabe pol milje .
- ADV 3: — Ma , morš izpast totalno navdušen , sam pol pa vseen zajebat .
ti
- PRON 14: » Oho , ti si pa poln keša ! « ga je občudujoče pogledala Karmen .
- DET 14: V zadnjih 20 letih so se ti cilji stalno spreminjali .
neki
- DET 11: Na to me je opozoril neki nadarjen avstralski filozof .
- PRON 1: Tolk me imobilizira , da sam sedim v fotelju pa mi gre program na televiziji ful na kurac , sam se mi zdi , ko da seu neki groznga zgodl , če ga probam prešaltat .

Morphology

The form / lemma ratio of DET is 4.952381 (the average of all parts of speech is 1.870691).

The 1st highest number of forms (12) was observed with the lemma “svoj”: svoj, svoja, svoje, svojega, svojem, svojemu, svoji, svojih, svojim, svojimi, svojmu, svojo.

The 2nd highest number of forms (12) was observed with the lemma “tisti”: tist, tista, tiste, tistega, tistem, tistemu, tisti, tistih, tistim, tistimi, tistmu, tisto.

The 3rd highest number of forms (11) was observed with the lemma “kakšen”: kakšen, kakšenmu, kakšna, kakšne, kakšnega, kakšnem, kakšni, kakšnih, kakšnim, kakšnimi, kakšno.

DET occurs with 9 features: sl-feat/PronType (4711; 100% instances), sl-feat/Case (3994; 85% instances), sl-feat/Gender (3994; 85% instances), sl-feat/Number (3994; 85% instances), sl-feat/Poss (1161; 25% instances), sl-feat/Number[psor] (733; 16% instances), sl-feat/Person (732; 16% instances), sl-feat/Reflex (429; 9% instances), sl-feat/Gender[psor] (336; 7% instances)

DET occurs with 30 feature-value pairs: Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Gender=Fem, Gender=Masc, Gender=Neut, Gender[psor]=Fem, Gender[psor]=Masc, Gender[psor]=Neut, Number=Dual, Number=Plur, Number=Sing, Number[psor]=Dual, Number[psor]=Plur, Number[psor]=Sing, Person=1, Person=2, Person=3, Poss=Yes, PronType=Dem, PronType=Ind, PronType=Int, PronType=Neg, PronType=Prs, PronType=Rel, PronType=Tot, Reflex=Yes

DET occurs with 408 feature combinations. The most frequent feature combination is PronType=Ind (666 tokens). Examples: nekaj, več, veliko, manj, dovolj, malo, pol, preveč, največ, nekatere

Relations

DET nodes are attached to their parents using 16 different relations: sl-dep/det (2965; 63% instances), sl-dep/obl (544; 12% instances), sl-dep/nsubj (459; 10% instances), sl-dep/advmod (319; 7% instances), sl-dep/obj (180; 4% instances), sl-dep/nmod (142; 3% instances), sl-dep/conj (32; 1% instances), sl-dep/root (27; 1% instances), sl-dep/iobj (11; 0% instances), sl-dep/acl (7; 0% instances), sl-dep/ccomp (6; 0% instances), sl-dep/fixed (6; 0% instances), sl-dep/advcl (5; 0% instances), sl-dep/xcomp (5; 0% instances), sl-dep/csubj (2; 0% instances), sl-dep/parataxis (1; 0% instances)

Parents of DET nodes belong to 11 different parts of speech: NOUN (3180; 68% instances), VERB (1093; 23% instances), ADJ (252; 5% instances), DET (48; 1% instances), ADV (32; 1% instances), ROOT (27; 1% instances), PRON (26; 1% instances), PROPN (25; 1% instances), NUM (23; 0% instances), ADP (4; 0% instances), X (1; 0% instances)

3766 (80%) DET nodes are leaves.

752 (16%) DET nodes have one child.

128 (3%) DET nodes have two children.

65 (1%) DET nodes have three or more children.

The highest child degree of a DET node is 7.

Children of DET nodes are attached using 21 different relations: sl-dep/case (575; 45% instances), sl-dep/acl (184; 14% instances), sl-dep/advmod (156; 12% instances), sl-dep/nmod (62; 5% instances), sl-dep/punct (62; 5% instances), sl-dep/fixed (52; 4% instances), sl-dep/cop (49; 4% instances), sl-dep/nsubj (37; 3% instances), sl-dep/cc (32; 3% instances), sl-dep/obl (28; 2% instances), sl-dep/conj (13; 1% instances), sl-dep/mark (10; 1% instances), sl-dep/aux (6; 0% instances), sl-dep/csubj (3; 0% instances), sl-dep/obj (3; 0% instances), sl-dep/parataxis (3; 0% instances), sl-dep/advcl (1; 0% instances), sl-dep/cc:preconj (1; 0% instances), sl-dep/discourse (1; 0% instances), sl-dep/expl (1; 0% instances), sl-dep/nummod (1; 0% instances)

Children of DET nodes belong to 15 different parts of speech: ADP (570; 45% instances), VERB (168; 13% instances), ADV (97; 8% instances), NOUN (90; 7% instances), SCONJ (65; 5% instances), PUNCT (62; 5% instances), AUX (55; 4% instances), DET (48; 4% instances), PART (47; 4% instances), CCONJ (36; 3% instances), ADJ (25; 2% instances), PRON (11; 1% instances), PROPN (4; 0% instances), NUM (1; 0% instances), X (1; 0% instances)

Treebank Statistics (UD_Slovenian-SST)

There are 47 DET lemmas (1%), 169 DET types (4%) and 1204 DET tokens (6%). Out of 16 observed tags, the rank of DET is: 8 in number of lemmas, 6 in number of types and 6 in number of tokens.

The 10 most frequent DET lemmas: ta, ves, tisti, nekaj, malo, kakšen, naš, nič, nek, kateri

The 10 most frequent DET types: to, ta, nekaj, vse, tem, malo, nič, tega, tisto, te

The 10 most frequent ambiguous lemmas: nič (DET 32, ADV 16), več (DET 22, PART 11), pol (ADV 78, DET 8)

The 10 most frequent ambiguous types: to (DET 350, X 1), vse (DET 37, ADV 4), malo (DET 35, ADJ 3), nič (DET 32, ADV 16), te (DET 23, PRON 10, ADV 9), več (DET 22, PART 11), ti (PRON 68, DET 11, INTJ 1, X 1), pol (ADV 78, DET 8), oni (DET 7, PRON 6), tako (ADV 155, CCONJ 37, DET 7)

to
- DET 350: zdaj mogoče to ni treba vsepovsod
- X 1: mi smo kar eee kolega ni povedal v letošnjem letu intenzivirali obnavljanje in eee eee gradnjo na progi in pri tem opravili s [gap] to [gap] opisano povečanje tovornega prometa navkljub
vse
- DET 37: vsakič znova je koristno ker vsakič je vse znova ne
- ADV 4: [gap] tole so vse karavanke in tukaj je en sicer ne to je že pohorje zdaj pa [gap]
malo
- DET 35: in spodaj so vsi komentini malo istrijani malo po hrvaško
- ADJ 3: ker nama je bilo ful všeč tako malo mestece tako mir kar pa [gap]
nič
- DET 32: pač en tak oblačen četrtek je pred nami nič hudega
- ADV 16: kaj da ne more nič ?
te
- DET 23: mhm … ja vse je do te višine a ne
- PRON 10: [gap] iščem pa [gap] ne te najdem [audience:laughter]
- ADV 9: eee to bi te bilo vse ven pobrati samo jaz sem to ne imela toliko časa
več
- DET 22: dosti je več tako ni ne videti
- PART 11: ne ne bom več [all:laughter] ampak ne ful mi je dober
ti
- PRON 68: ko bo prišel domov pa ti bo stopnice pokozlal
- DET 11: samo ti pa niso v oplotnici doma ti so jo v zrečah
- INTJ 1: pa če noče zaspati ko je še mala pa ji špilaš pred posteljico veš [audience:laughter] tako delaš ti di di pa še igraš
- X 1: in se smejem kot matasta tipo ki že costi ti ga bi bujeri ser [all:laughter]
pol
- ADV 78: pa pa so ga ljudje zajebavali evo pol vam je pa dal kajlo
- DET 8: potem mi pa potem mi boste pa pripravili na strani in pol vaš komentar
oni
- DET 7: aja oni je bil tu v zrečah nekaj pri eni teti ali kako
- PRON 6: ker oni naprej svoje terajo denar držijo
tako
- ADV 155: na vrhu je tako kot si rekla en šef lahko sta tudi dva
- CCONJ 37: tako da pazi
- DET 7: o tako obleko bi jaz imela

Morphology

The form / lemma ratio of DET is 3.595745 (the average of all parts of speech is 1.494596).

The 1st highest number of forms (10) was observed with the lemma “ta”: ta, te, tega, teh, tej, tem, temi, temu, ti, to.

The 2nd highest number of forms (9) was observed with the lemma “kateri”: katera, katere, katerega, katerem, kateri, katerih, katerim, katerimi, katero.

The 3rd highest number of forms (9) was observed with the lemma “oni”: ona, one, onega, onemu, oni, onih, onim, onimi, ono.

DET occurs with 8 features: sl-feat/PronType (1204; 100% instances), sl-feat/Case (1051; 87% instances), sl-feat/Gender (1051; 87% instances), sl-feat/Number (1051; 87% instances), sl-feat/Number[psor] (86; 7% instances), sl-feat/Person (86; 7% instances), sl-feat/Poss (86; 7% instances), sl-feat/Gender[psor] (6; 0% instances)

DET occurs with 28 feature-value pairs: Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Gender=Fem, Gender=Masc, Gender=Neut, Gender[psor]=Fem, Gender[psor]=Masc, Number=Dual, Number=Plur, Number=Sing, Number[psor]=Dual, Number[psor]=Plur, Number[psor]=Sing, Person=1, Person=2, Person=3, Poss=Yes, PronType=Dem, PronType=Ind, PronType=Int, PronType=Neg, PronType=Prs, PronType=Rel, PronType=Tot

DET occurs with 172 feature combinations. The most frequent feature combination is Case=Nom|Gender=Neut|Number=Sing|PronType=Dem (257 tokens). Examples: to, tisto, tako, tole, ono, ovo

Relations

DET nodes are attached to their parents using 23 different relations: sl-dep/det (477; 40% instances), sl-dep/nsubj (224; 19% instances), sl-dep/obj (134; 11% instances), sl-dep/advmod (104; 9% instances), sl-dep/obl (60; 5% instances), sl-dep/root (47; 4% instances), sl-dep/expl (28; 2% instances), sl-dep/reparandum (26; 2% instances), sl-dep/conj (22; 2% instances), sl-dep/nmod (18; 1% instances), sl-dep/conj:extend (15; 1% instances), sl-dep/parataxis (13; 1% instances), sl-dep/flat (8; 1% instances), sl-dep/ccomp (6; 0% instances), sl-dep/dislocated (5; 0% instances), sl-dep/acl (4; 0% instances), sl-dep/fixed (4; 0% instances), sl-dep/orphan (3; 0% instances), sl-dep/iobj (2; 0% instances), sl-dep/advcl (1; 0% instances), sl-dep/appos (1; 0% instances), sl-dep/mark (1; 0% instances), sl-dep/parataxis:restart (1; 0% instances)

Parents of DET nodes belong to 14 different parts of speech: NOUN (531; 44% instances), VERB (411; 34% instances), ADJ (77; 6% instances), DET (54; 4% instances), ROOT (47; 4% instances), PROPN (21; 2% instances), NUM (19; 2% instances), PRON (16; 1% instances), ADV (14; 1% instances), X (6; 0% instances), AUX (3; 0% instances), ADP (2; 0% instances), PART (2; 0% instances), CCONJ (1; 0% instances)

937 (78%) DET nodes are leaves.

153 (13%) DET nodes have one child.

58 (5%) DET nodes have two children.

56 (5%) DET nodes have three or more children.

The highest child degree of a DET node is 9.

Children of DET nodes are attached using 31 different relations: sl-dep/case (76; 14% instances), sl-dep/advmod (70; 13% instances), sl-dep/acl (54; 10% instances), sl-dep/cc (45; 9% instances), sl-dep/cop (36; 7% instances), sl-dep/discourse (34; 6% instances), sl-dep/reparandum (33; 6% instances), sl-dep/nsubj (32; 6% instances), sl-dep/punct (25; 5% instances), sl-dep/nmod (18; 3% instances), sl-dep/parataxis (16; 3% instances), sl-dep/conj (12; 2% instances), sl-dep/discourse:filler (11; 2% instances), sl-dep/amod (9; 2% instances), sl-dep/fixed (8; 2% instances), sl-dep/advcl (6; 1% instances), sl-dep/det (5; 1% instances), sl-dep/flat (5; 1% instances), sl-dep/obl (5; 1% instances), sl-dep/mark (4; 1% instances), sl-dep/obj (3; 1% instances), sl-dep/orphan (3; 1% instances), sl-dep/parataxis:discourse (3; 1% instances), sl-dep/appos (2; 0% instances), sl-dep/dislocated (2; 0% instances), sl-dep/nummod (2; 0% instances), sl-dep/vocative (2; 0% instances), sl-dep/aux (1; 0% instances), sl-dep/conj:extend (1; 0% instances), sl-dep/goeswith (1; 0% instances), sl-dep/parataxis:restart (1; 0% instances)

Children of DET nodes belong to 16 different parts of speech: ADP (67; 13% instances), ADV (61; 12% instances), VERB (61; 12% instances), DET (54; 10% instances), CCONJ (52; 10% instances), PART (41; 8% instances), AUX (37; 7% instances), NOUN (33; 6% instances), X (31; 6% instances), ADJ (24; 5% instances), INTJ (15; 3% instances), SCONJ (13; 2% instances), PRON (12; 2% instances), PUNCT (12; 2% instances), NUM (7; 1% instances), PROPN (5; 1% instances)

DET in other languages: [am] [ar] [bg] [bxr] [ca] [ckb] [cop] [cs] [cu] [da] [de] [el] [en] [es] [et] [eu] [fa] [fi] [fo] [fr] [ga] [gl] [got] [grc] [he] [hi] [hr] [hu] [id] [it] [ja] [kk] [kmr] [ko] [la] [lv] [mr] [nl] [no] [pl] [pt] [ro] [ru] [sa] [sk] [sla] [sl] [so] [sr] [sv] [swl] [ta] [tr] [ug] [uk] [u] [urj] [ur] [vi] [yue] [zh]

DET: determiner

Definition

Conversion from JOS

Examples

Treebank Statistics (UD_Slovenian)

Morphology

Relations

Treebank Statistics (UD_Slovenian-SST)

Morphology

Relations

`DET`: determiner