home no/pos edit page issue tracker

This page still pertains to UD version 1.

DET: determiner

####Definition Determiners are words that modify nouns or noun phrases and express the reference of the noun phrase in context. In Norwegian, most determiners agree with the nominal head in terms of gender and number, e.g. min bil “my car”, mitt barn “my child”, mine barn “my children”. We distinguish three main types of determiners: possessive, demonstrative and quantifying.

Possessive

Possessive determiners agree in gender and number with the noun the modify, their form varies depending on person. In Norwegian, possessive determiners usually precede their head noun, but may occur after the head noun when the noun is in the definite form, e.g. bilen hans “his car”

Demonstrative

Demonstrative determiners agree in gender and number with the noun the modify. This group of determiners include the interrogative hvilken “which”. The demonstratives may only precede the noun they modify.

Quantifying

The quantifying pronouns are a heterogenous group of determiners which all occur before the noun they modify. Some of these agree with their nominal head (like articles and indefinite determiners) and some do not.

####References Jan Terje Faarlund, Svein Lie and Kjell Ivar Vannebo. 1997. “Norsk referansegrammatikk”. Universitetsforlaget, Oslo, Norway.


Treebank Statistics (UD_Norwegian-Bokmaal)

There are 46 DET lemmas (0%), 84 DET types (0%) and 15013 DET tokens (5%). Out of 17 observed tags, the rank of DET is: 9 in number of lemmas, 9 in number of types and 9 in number of tokens.

The 10 most frequent DET lemmas: en, den, de, det, sin, annen, noen, all, denne, min

The 10 most frequent DET types: en, et, den, de, det, andre, sin, alle, denne, noen

The 10 most frequent ambiguous lemmas: en (DET 5621, PRON 71, X 1), den (DET 1368, PRON 408), de (PRON 1489, DET 1233, PROPN 11, X 5, ADV 1), det (PRON 4907, DET 1008, X 2), noen (DET 489, PRON 98), all (DET 428, X 2), denne (DET 316, PRON 16), min (DET 307, X 1), vår (DET 277, NOUN 27), selv (ADV 274, DET 186)

The 10 most frequent ambiguous types: en (DET 3585, PRON 63, ADP 5, SCONJ 1, X 1), et (DET 1619, PRON 1), den (DET 1162, PRON 332), de (DET 1072, PRON 960, PROPN 11, X 5, ADV 1), det (PRON 3400, DET 845, X 2), andre (DET 435, ADJ 48), alle (DET 301, PRON 126), denne (DET 272, PRON 15), noen (DET 260, PRON 79), noe (PRON 283, DET 201)

Morphology

The form / lemma ratio of DET is 1.826087 (the average of all parts of speech is 1.383513).

The 1st highest number of forms (7) was observed with the lemma “en”: at, ei, en, ens, er, et, ett.

The 2nd highest number of forms (5) was observed with the lemma “annen”: andre, andres, annen, annens, annet.

The 3rd highest number of forms (4) was observed with the lemma “all”: all, alle, alles, alt.

DET occurs with 7 features: no-feat/PronType (15013; 100% instances), no-feat/Number (14490; 97% instances), no-feat/Gender (11270; 75% instances), no-feat/Poss (1938; 13% instances), no-feat/Definite (812; 5% instances), no-feat/Polarity (178; 1% instances), no-feat/Case (44; 0% instances)

DET occurs with 19 feature-value pairs: Case=Gen, Definite=Def, Definite=Ind, Gender=Fem, Gender=Masc, Gender=Neut, Number=Plur, Number=Sing, Polarity=Neg, Poss=Yes, PronType=Art, PronType=Dem, PronType=Dem,Ind, PronType=Ind, PronType=Int, PronType=Neg, PronType=Prs, PronType=Rcp, PronType=Tot

DET occurs with 49 feature combinations. The most frequent feature combination is Gender=Masc|Number=Sing|PronType=Art (3838 tokens). Examples: en

Relations

DET nodes are attached to their parents using 21 different relations: no-dep/det (12315; 82% instances), no-dep/nmod (2053; 14% instances), no-dep/obl (204; 1% instances), no-dep/nsubj (129; 1% instances), no-dep/obj (83; 1% instances), no-dep/conj (70; 0% instances), no-dep/root (68; 0% instances), no-dep/appos (17; 0% instances), no-dep/nsubj:pass (13; 0% instances), no-dep/xcomp (13; 0% instances), no-dep/flat:name (9; 0% instances), no-dep/acl:relcl (8; 0% instances), no-dep/orphan (6; 0% instances), no-dep/ccomp (5; 0% instances), no-dep/acl (4; 0% instances), no-dep/goeswith (4; 0% instances), no-dep/iobj (4; 0% instances), no-dep/dislocated (3; 0% instances), no-dep/advcl (2; 0% instances), no-dep/compound (2; 0% instances), no-dep/expl (1; 0% instances)

Parents of DET nodes belong to 11 different parts of speech: NOUN (13123; 87% instances), ADJ (746; 5% instances), VERB (374; 2% instances), PRON (240; 2% instances), PROPN (195; 1% instances), DET (184; 1% instances), NUM (71; 0% instances), ROOT (68; 0% instances), ADV (6; 0% instances), ADP (3; 0% instances), SCONJ (3; 0% instances)

14132 (94%) DET nodes are leaves.

594 (4%) DET nodes have one child.

152 (1%) DET nodes have two children.

135 (1%) DET nodes have three or more children.

The highest child degree of a DET node is 9.

Children of DET nodes are attached using 23 different relations: no-dep/case (273; 18% instances), no-dep/nmod (226; 15% instances), no-dep/advmod (160; 11% instances), no-dep/det (146; 10% instances), no-dep/punct (130; 9% instances), no-dep/obl (117; 8% instances), no-dep/cop (90; 6% instances), no-dep/nsubj (83; 6% instances), no-dep/cc (66; 4% instances), no-dep/conj (51; 3% instances), no-dep/advcl (35; 2% instances), no-dep/acl:relcl (34; 2% instances), no-dep/amod (31; 2% instances), no-dep/mark (17; 1% instances), no-dep/acl (8; 1% instances), no-dep/expl (8; 1% instances), no-dep/aux (7; 0% instances), no-dep/appos (4; 0% instances), no-dep/nummod (4; 0% instances), no-dep/csubj (2; 0% instances), no-dep/flat:name (2; 0% instances), no-dep/orphan (1; 0% instances), no-dep/parataxis (1; 0% instances)

Children of DET nodes belong to 14 different parts of speech: ADP (296; 20% instances), NOUN (261; 17% instances), DET (184; 12% instances), PUNCT (130; 9% instances), ADV (116; 8% instances), PRON (102; 7% instances), AUX (97; 6% instances), ADJ (81; 5% instances), VERB (79; 5% instances), PROPN (68; 5% instances), CCONJ (66; 4% instances), NUM (8; 1% instances), SCONJ (7; 0% instances), X (1; 0% instances)


Treebank Statistics (UD_Norwegian-Nynorsk)

There are 52 DET lemmas (0%), 87 DET types (0%) and 15147 DET tokens (5%). Out of 17 observed tags, the rank of DET is: 9 in number of lemmas, 9 in number of types and 7 in number of tokens.

The 10 most frequent DET lemmas: ein, den, dei, det, sin, annan, all, denne, nokon, eigen

The 10 most frequent DET types: ein, den, eit, dei, ei, det, andre, alle, denne, sin

The 10 most frequent ambiguous lemmas: ein (DET 5401, PRON 767), den (DET 1785, PRON 129, X 9, PROPN 1), dei (PRON 1527, DET 1367), det (PRON 5091, DET 1207, X 15, PROPN 1), all (DET 545, ADV 2, X 2), denne (DET 402, PRON 21, X 1), nokon (DET 398, PRON 70), slik (ADV 334, DET 219), vår (DET 218, NOUN 19, X 2), same (DET 204, ADV 1)

The 10 most frequent ambiguous types: ein (DET 2201, PRON 676, ADP 1), den (DET 1543, PRON 103, X 9), dei (PRON 1314, DET 1208), ei (DET 1156, ADV 2, PRON 2, NUM 1), det (PRON 3767, DET 1056, X 15, ADV 1), andre (DET 430, ADJ 54, X 1), alle (DET 387, PRON 104), denne (DET 353, PRON 17, X 1), sine (DET 214, NOUN 1), noko (PRON 258, DET 198)

Morphology

The form / lemma ratio of DET is 1.673077 (the average of all parts of speech is 1.343969).

The 1st highest number of forms (6) was observed with the lemma “ein”: ei, ein, eir, eit, eitt, en.

The 2nd highest number of forms (5) was observed with the lemma “eigen”: egen, eiga, eige, eigen, eigne.

The 3rd highest number of forms (5) was observed with the lemma “nokon”: noka, noko, nokon, nokor, nokre.

DET occurs with 7 features: no-feat/PronType (15147; 100% instances), no-feat/Number (14496; 96% instances), no-feat/Gender (11382; 75% instances), no-feat/Poss (1429; 9% instances), no-feat/Definite (943; 6% instances), no-feat/Polarity (128; 1% instances), no-feat/Case (5; 0% instances)

DET occurs with 17 feature-value pairs: Case=Gen, Definite=Def, Definite=Ind, Gender=Fem, Gender=Masc, Gender=Neut, Number=Plur, Number=Sing, Polarity=Neg, Poss=Yes, PronType=Art, PronType=Dem, PronType=Ind, PronType=Int, PronType=Neg, PronType=Prs, PronType=Tot

DET occurs with 40 feature combinations. The most frequent feature combination is Gender=Masc|Number=Sing|PronType=Art (2347 tokens). Examples: ein, en

Relations

DET nodes are attached to their parents using 23 different relations: no-dep/det (12930; 85% instances), no-dep/nmod (1482; 10% instances), no-dep/obl (252; 2% instances), no-dep/nsubj (138; 1% instances), no-dep/obj (95; 1% instances), no-dep/conj (87; 1% instances), no-dep/root (66; 0% instances), no-dep/xcomp (20; 0% instances), no-dep/appos (19; 0% instances), no-dep/flat:name (18; 0% instances), no-dep/acl (7; 0% instances), no-dep/acl:relcl (6; 0% instances), no-dep/nsubj:pass (6; 0% instances), no-dep/ccomp (4; 0% instances), no-dep/iobj (4; 0% instances), no-dep/orphan (4; 0% instances), no-dep/advcl (2; 0% instances), no-dep/csubj (2; 0% instances), no-dep/compound (1; 0% instances), no-dep/dislocated (1; 0% instances), no-dep/expl (1; 0% instances), no-dep/goeswith (1; 0% instances), no-dep/parataxis (1; 0% instances)

Parents of DET nodes belong to 12 different parts of speech: NOUN (13200; 87% instances), ADJ (839; 6% instances), VERB (396; 3% instances), PROPN (188; 1% instances), PRON (183; 1% instances), DET (179; 1% instances), NUM (74; 0% instances), ROOT (66; 0% instances), ADV (12; 0% instances), SCONJ (8; 0% instances), ADP (1; 0% instances), X (1; 0% instances)

14006 (92%) DET nodes are leaves.

841 (6%) DET nodes have one child.

164 (1%) DET nodes have two children.

136 (1%) DET nodes have three or more children.

The highest child degree of a DET node is 10.

Children of DET nodes are attached using 23 different relations: no-dep/obl (350; 20% instances), no-dep/case (287; 16% instances), no-dep/nmod (229; 13% instances), no-dep/advmod (149; 8% instances), no-dep/punct (140; 8% instances), no-dep/det (138; 8% instances), no-dep/cop (89; 5% instances), no-dep/cc (82; 5% instances), no-dep/nsubj (73; 4% instances), no-dep/conj (50; 3% instances), no-dep/advcl (35; 2% instances), no-dep/amod (33; 2% instances), no-dep/acl:relcl (31; 2% instances), no-dep/mark (24; 1% instances), no-dep/appos (11; 1% instances), no-dep/flat:name (10; 1% instances), no-dep/expl (7; 0% instances), no-dep/orphan (6; 0% instances), no-dep/acl (5; 0% instances), no-dep/aux (5; 0% instances), no-dep/csubj (3; 0% instances), no-dep/nummod (3; 0% instances), no-dep/parataxis (3; 0% instances)

Children of DET nodes belong to 13 different parts of speech: NOUN (439; 25% instances), ADP (312; 18% instances), DET (179; 10% instances), PUNCT (140; 8% instances), PROPN (135; 8% instances), ADJ (107; 6% instances), AUX (94; 5% instances), ADV (93; 5% instances), CCONJ (82; 5% instances), PRON (80; 5% instances), VERB (78; 4% instances), NUM (13; 1% instances), SCONJ (11; 1% instances)


DET in other languages: [am] [ar] [bg] [bxr] [ca] [ckb] [cop] [cs] [cu] [da] [de] [el] [en] [es] [et] [eu] [fa] [fi] [fo] [fr] [ga] [gl] [got] [grc] [he] [hi] [hr] [hu] [id] [it] [ja] [kk] [kmr] [ko] [la] [lv] [mr] [nl] [no] [pl] [pt] [ro] [ru] [sa] [sk] [sla] [sl] [so] [sr] [sv] [swl] [ta] [tr] [ug] [uk] [u] [urj] [ur] [vi] [yue] [zh]