home no/pos edit page issue tracker

This page still pertains to UD version 1.

NOUN: noun

#####Definition Nouns are a part of speech typically denoting a person, place, thing, animal or idea. The NOUN tag is used only for common nouns, see PROPN for proper nouns.

In Norwegian, nouns inflect for definiteness (bil-bilen) and usually also for number (bil - biler).

#####Examples


Treebank Statistics (UD_Norwegian-Bokmaal)

There are 11710 NOUN lemmas (51%), 16460 NOUN types (52%) and 51775 NOUN tokens (18%). Out of 17 observed tags, the rank of NOUN is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.

The 10 most frequent NOUN lemmas: år, dag, land, gang, tid, verden, kirke, barn, folk, prosent

The 10 most frequent NOUN types: år, dag, prosent, gang, folk, tid, verden, land, del, barn

The 10 most frequent ambiguous lemmas: land (NOUN 356, X 1), tid (NOUN 323, X 1), del (NOUN 229, X 2, PROPN 1), mann (NOUN 180, X 1), problem (NOUN 151, X 1), krone (NOUN 105, VERB 2), ord (NOUN 105, PROPN 1), fall (NOUN 94, X 1), by (NOUN 90, VERB 13, X 1), rett (ADJ 113, NOUN 88)

The 10 most frequent ambiguous types: tid (NOUN 184, X 1), land (NOUN 163, X 1), del (NOUN 155, X 2, PROPN 1), landet (NOUN 113, VERB 7), kroner (NOUN 101, VERB 1), fall (NOUN 87, X 1), mann (NOUN 61, PRON 1, X 1), rekke (NOUN 60, VERB 3), leder (NOUN 58, VERB 29), bruk (NOUN 55, X 1, VERB 1)

Morphology

The form / lemma ratio of NOUN is 1.405636 (the average of all parts of speech is 1.383513).

The 1st highest number of forms (9) was observed with the lemma “tid”: tid, tida, tidas, tiden, tidene, tidenes, tider, tiders, tids.

The 2nd highest number of forms (8) was observed with the lemma “kirke”: Kirkenes, kirka, kirke, kirken, kirkene, kirkens, kirker, kirkes.

The 3rd highest number of forms (7) was observed with the lemma “bedrift”: bedrift, bedriften, bedriftene, bedriftenes, bedriftens, bedrifter, bedrifters.

NOUN occurs with 5 features: no-feat/Gender (50918; 98% instances), no-feat/Number (50446; 97% instances), no-feat/Definite (50445; 97% instances), no-feat/Case (1314; 3% instances), no-feat/Abbr (150; 0% instances)

NOUN occurs with 11 feature-value pairs: Abbr=Yes, Case=Gen, Definite=Def, Definite=Def,Ind, Definite=Ind, Gender=Fem, Gender=Masc, Gender=Neut, Number=Plur, Number=Plur,Sing, Number=Sing

NOUN occurs with 42 feature combinations. The most frequent feature combination is Definite=Ind|Gender=Masc|Number=Sing (12574 tokens). Examples: dag, gang, verden, del, grunn, måte, plass, vei, politikk, grad

Relations

NOUN nodes are attached to their parents using 22 different relations: no-dep/obl (13115; 25% instances), no-dep/nmod (10804; 21% instances), no-dep/obj (8895; 17% instances), no-dep/nsubj (7986; 15% instances), no-dep/conj (4275; 8% instances), no-dep/root (2694; 5% instances), no-dep/xcomp (1049; 2% instances), no-dep/nsubj:pass (874; 2% instances), no-dep/appos (501; 1% instances), no-dep/flat:name (412; 1% instances), no-dep/acl (231; 0% instances), no-dep/ccomp (181; 0% instances), no-dep/advcl (178; 0% instances), no-dep/acl:relcl (150; 0% instances), no-dep/iobj (135; 0% instances), no-dep/orphan (82; 0% instances), no-dep/parataxis (68; 0% instances), no-dep/dislocated (58; 0% instances), no-dep/compound (51; 0% instances), no-dep/csubj (33; 0% instances), no-dep/goeswith (2; 0% instances), no-dep/discourse (1; 0% instances)

Parents of NOUN nodes belong to 15 different parts of speech: VERB (28487; 55% instances), NOUN (14019; 27% instances), ADJ (2773; 5% instances), ROOT (2694; 5% instances), PROPN (2471; 5% instances), PRON (412; 1% instances), DET (261; 1% instances), NUM (247; 0% instances), ADV (244; 0% instances), ADP (148; 0% instances), X (7; 0% instances), SCONJ (6; 0% instances), INTJ (4; 0% instances), AUX (1; 0% instances), CCONJ (1; 0% instances)

9732 (19%) NOUN nodes are leaves.

17054 (33%) NOUN nodes have one child.

13635 (26%) NOUN nodes have two children.

11354 (22%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 19.

Children of NOUN nodes are attached using 28 different relations: no-dep/case (20958; 23% instances), no-dep/nmod (14709; 16% instances), no-dep/amod (12350; 14% instances), no-dep/det (11105; 12% instances), no-dep/punct (6878; 8% instances), no-dep/conj (4190; 5% instances), no-dep/cc (3508; 4% instances), no-dep/acl:relcl (2674; 3% instances), no-dep/cop (2422; 3% instances), no-dep/nummod (1979; 2% instances), no-dep/nsubj (1815; 2% instances), no-dep/advmod (1750; 2% instances), no-dep/mark (1502; 2% instances), no-dep/acl (1272; 1% instances), no-dep/obl (639; 1% instances), no-dep/expl (459; 1% instances), no-dep/appos (431; 0% instances), no-dep/compound (245; 0% instances), no-dep/aux (233; 0% instances), no-dep/parataxis (210; 0% instances), no-dep/advcl (182; 0% instances), no-dep/csubj (155; 0% instances), no-dep/xcomp (127; 0% instances), no-dep/orphan (94; 0% instances), no-dep/discourse (16; 0% instances), no-dep/flat:name (16; 0% instances), no-dep/obj (6; 0% instances), no-dep/goeswith (2; 0% instances)

Children of NOUN nodes belong to 17 different parts of speech: ADP (22201; 25% instances), NOUN (14019; 16% instances), ADJ (13202; 15% instances), DET (13123; 15% instances), PUNCT (6878; 8% instances), VERB (4304; 5% instances), PROPN (3800; 4% instances), CCONJ (3514; 4% instances), AUX (2655; 3% instances), NUM (2485; 3% instances), PRON (1782; 2% instances), ADV (1396; 2% instances), SCONJ (432; 0% instances), PART (73; 0% instances), SYM (32; 0% instances), INTJ (16; 0% instances), X (15; 0% instances)


Treebank Statistics (UD_Norwegian-Nynorsk)

There are 11973 NOUN lemmas (52%), 16284 NOUN types (53%) and 54902 NOUN tokens (20%). Out of 17 observed tags, the rank of NOUN is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.

The 10 most frequent NOUN lemmas: år, dag, land, tid, folk, språk, del, kommune, gong, prosent

The 10 most frequent NOUN types: år, dag, folk, tid, prosent, språk, del, kroner, landet, regjeringa

The 10 most frequent ambiguous lemmas: år (NOUN 634, X 1), folk (NOUN 321, X 2), språk (NOUN 294, X 1), del (NOUN 275, X 2), prosent (NOUN 219, X 1), bok (NOUN 193, X 1), liv (NOUN 151, X 1), mann (NOUN 175, X 1), grunn (NOUN 165, ADJ 2), verd (NOUN 147, ADJ 23)

The 10 most frequent ambiguous types: år (NOUN 453, X 1), folk (NOUN 260, X 2), prosent (NOUN 212, X 1), språk (NOUN 207, X 1), del (NOUN 177, X 2, VERB 2), grunn (NOUN 115, ADJ 1), SV (NOUN 111, X 1), arbeidet (NOUN 93, X 1), leiar (NOUN 94, VERB 3), bruk (NOUN 98, X 1, VERB 1)

Morphology

The form / lemma ratio of NOUN is 1.360060 (the average of all parts of speech is 1.343969).

The 1st highest number of forms (9) was observed with the lemma “medlem”: medlammar, medlem, medlemar, medlemene, medlemer, medlemmane, medlemmar, medlemmene, medlemmer.

The 2nd highest number of forms (8) was observed with the lemma “tid”: tid, tida, tidene, tidenes, tider, tiders, tidi, tids.

The 3rd highest number of forms (7) was observed with the lemma “dag”: dag, dagane, dagar, dagars, dagen, dagens, dags.

NOUN occurs with 5 features: no-feat/Gender (53297; 97% instances), no-feat/Number (50591; 92% instances), no-feat/Definite (50586; 92% instances), no-feat/Abbr (955; 2% instances), no-feat/Case (545; 1% instances)

NOUN occurs with 11 feature-value pairs: Abbr=Yes, Case=Gen, Definite=Def, Definite=Def,Ind, Definite=Ind, Gender=Fem, Gender=Masc, Gender=Neut, Number=Plur, Number=Plur,Sing, Number=Sing

NOUN occurs with 41 feature combinations. The most frequent feature combination is Definite=Ind|Gender=Masc|Number=Sing (10441 tokens). Examples: dag, del, gong, grunn, leiar, bruk, plass, måte, fredag, nynorsk

Relations

NOUN nodes are attached to their parents using 25 different relations: no-dep/obl (13261; 24% instances), no-dep/nmod (11404; 21% instances), no-dep/nsubj (9921; 18% instances), no-dep/obj (8818; 16% instances), no-dep/conj (4464; 8% instances), no-dep/root (2759; 5% instances), no-dep/xcomp (1007; 2% instances), no-dep/flat:name (993; 2% instances), no-dep/appos (634; 1% instances), no-dep/nsubj:pass (488; 1% instances), no-dep/acl (189; 0% instances), no-dep/advcl (180; 0% instances), no-dep/ccomp (179; 0% instances), no-dep/acl:relcl (163; 0% instances), no-dep/iobj (146; 0% instances), no-dep/orphan (96; 0% instances), no-dep/parataxis (76; 0% instances), no-dep/dislocated (50; 0% instances), no-dep/csubj (33; 0% instances), no-dep/compound (32; 0% instances), no-dep/goeswith (3; 0% instances), no-dep/discourse (2; 0% instances), no-dep/expl (2; 0% instances), no-dep/cc (1; 0% instances), no-dep/flat:foreign (1; 0% instances)

Parents of NOUN nodes belong to 13 different parts of speech: VERB (27207; 50% instances), NOUN (16123; 29% instances), ADJ (5212; 9% instances), ROOT (2759; 5% instances), PROPN (1919; 3% instances), DET (439; 1% instances), PRON (410; 1% instances), NUM (322; 1% instances), ADV (283; 1% instances), ADP (201; 0% instances), X (12; 0% instances), INTJ (9; 0% instances), SCONJ (6; 0% instances)

10159 (19%) NOUN nodes are leaves.

18170 (33%) NOUN nodes have one child.

14051 (26%) NOUN nodes have two children.

12522 (23%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 13.

Children of NOUN nodes are attached using 28 different relations: no-dep/case (22529; 23% instances), no-dep/nmod (14947; 15% instances), no-dep/amod (13465; 14% instances), no-dep/det (11688; 12% instances), no-dep/punct (7137; 7% instances), no-dep/conj (4438; 5% instances), no-dep/cc (3779; 4% instances), no-dep/cop (2532; 3% instances), no-dep/acl:relcl (2478; 3% instances), no-dep/flat:name (2195; 2% instances), no-dep/nummod (1993; 2% instances), no-dep/advmod (1958; 2% instances), no-dep/nsubj (1917; 2% instances), no-dep/mark (1422; 1% instances), no-dep/acl (1305; 1% instances), no-dep/obl (742; 1% instances), no-dep/appos (611; 1% instances), no-dep/expl (492; 1% instances), no-dep/parataxis (283; 0% instances), no-dep/aux (274; 0% instances), no-dep/advcl (176; 0% instances), no-dep/compound (138; 0% instances), no-dep/csubj (136; 0% instances), no-dep/xcomp (101; 0% instances), no-dep/orphan (96; 0% instances), no-dep/discourse (32; 0% instances), no-dep/obj (12; 0% instances), no-dep/goeswith (3; 0% instances)

Children of NOUN nodes belong to 17 different parts of speech: ADP (23791; 25% instances), NOUN (16123; 17% instances), ADJ (14417; 15% instances), DET (13200; 14% instances), PUNCT (7137; 7% instances), PROPN (5022; 5% instances), VERB (4151; 4% instances), CCONJ (3815; 4% instances), AUX (2806; 3% instances), NUM (2546; 3% instances), PRON (1765; 2% instances), ADV (1519; 2% instances), SCONJ (435; 0% instances), PART (65; 0% instances), INTJ (33; 0% instances), X (32; 0% instances), SYM (22; 0% instances)


NOUN in other languages: [am] [ar] [bg] [bxr] [ca] [ckb] [cop] [cs] [cu] [da] [de] [el] [en] [es] [et] [eu] [fa] [fi] [fo] [fr] [ga] [gl] [got] [grc] [he] [hi] [hr] [hu] [id] [it] [ja] [kk] [kmr] [ko] [la] [lv] [mr] [nl] [no] [pl] [pt] [ro] [ru] [sa] [sk] [sla] [sl] [so] [sr] [sv] [swl] [ta] [tr] [ug] [uk] [u] [urj] [ur] [vi] [yue] [zh]