NUM
: numeral
Definition
A numeral is a word, functioning most typically as a determiner or pronoun, that expresses a number and a relation to the number, such as quantity, sequence, frequency or fraction.
Cardinal numerals are covered by NUM regardless of syntactic function and regardless of whether they are expressed as words (fyra “four”), digits (4) or Roman numerals (IV). By contrast, ordinal numerals are always tagged ADJ.
Note that in Swedish the decimal mark is most often a comma. Thousands are then separated by either a space or dot.
Examples
- 0, 1, 2, 3, 2014, 1 000 000, 3.000,15, 3,14159265359
- ett “one”, två “two”, tre “three”, sjuttiosju “seventy-seven”
- I, II, III, IV, V, MMXIV
Treebank Statistics (UD_Swedish)
There are 403 NUM
lemmas (4%), 404 NUM
types (3%) and 1384 NUM
tokens (2%).
Out of 16 observed tags, the rank of NUM
is: 6 in number of lemmas, 6 in number of types and 11 in number of tokens.
The 10 most frequent NUM
lemmas: två, en, tre, 1, 2, 20, 1970, 3, fyra, 10
The 10 most frequent NUM
types: två, tre, 1, en, 2, 20, ett, 1970, 3, fyra
The 10 most frequent ambiguous lemmas: två (NUM 88, ADJ 18, PRON 1), en (DET 3049, NUM 80, ADJ 63, PRON 44), tre (NUM 59, ADJ 6), fyra (NUM 19, ADJ 8), sex (NUM 12, NOUN 1), tio (NUM 12, ADJ 2), fem (NUM 10, ADJ 3), åtta (NUM 5, ADJ 1), i (ADP 2023, NUM 1, ADJ 1, NOUN 1)
The 10 most frequent ambiguous types: en (DET 1000, NUM 50, PRON 29), ett (DET 484, NUM 25, PRON 4), sex (NUM 11, NOUN 1), I (ADP 209, NOUN 1, ADJ 1, NUM 1, PROPN 1)
- en
- ett
- sex
- I
- ADP 209: I och med att kvinnan axlar mer krävande uppgifter får hon högre lön .
- NOUN 1: På I 12 i Eksjö tog en kompanichef för ett pansarvärnskompani saken i egna händer .
- ADJ 1: 1 I Kor. 7:2 .
- NUM 1: De åldersgrupper , som lekcirkeln i första hand är tänkt för är - förutom babybarnen - småbarn ( 1-3 år ) , förskolebarn ( 4-6 år ) , skolbarn I ( 7-9 år ) och skolbarn II ( 10-12 år ) .
- PROPN 1: De nationalitetsbeteckningar som bör användas i förbindelse med postnummer till utlandet är följande : Belgien B Danmark DK Finland SF Frankrike F Italien I Liechtenstein FL Norge N Schweiz CH Västtyskland D Österrike A
Morphology
The form / lemma ratio of NUM
is 1.002481 (the average of all parts of speech is 1.407742).
The 1st highest number of forms (2) was observed with the lemma “1”: 1, 1).
The 2nd highest number of forms (2) was observed with the lemma “en”: en, ett.
The 3rd highest number of forms (1) was observed with the lemma “-67”: -67.
NUM
occurs with 5 features: sv-feat/NumType (1384; 100% instances), sv-feat/Case (1378; 100% instances), sv-feat/Definite (80; 6% instances), sv-feat/Gender (80; 6% instances), sv-feat/Number (80; 6% instances)
NUM
occurs with 6 feature-value pairs: Case=Nom
, Definite=Ind
, Gender=Com
, Gender=Neut
, NumType=Card
, Number=Sing
NUM
occurs with 4 feature combinations.
The most frequent feature combination is Case=Nom|NumType=Card
(1298 tokens).
Examples: två, tre, 1, 2, 20, 1970, 3, fyra, 10, 100
Relations
NUM
nodes are attached to their parents using 17 different relations: sv-dep/nummod (1067; 77% instances), sv-dep/obl (125; 9% instances), sv-dep/nmod (82; 6% instances), sv-dep/conj (28; 2% instances), sv-dep/appos (19; 1% instances), sv-dep/nsubj (13; 1% instances), sv-dep/obj (13; 1% instances), sv-dep/root (13; 1% instances), sv-dep/orphan (10; 1% instances), sv-dep/list (4; 0% instances), sv-dep/advcl (2; 0% instances), sv-dep/nsubj:pass (2; 0% instances), sv-dep/xcomp (2; 0% instances), sv-dep/acl (1; 0% instances), sv-dep/ccomp (1; 0% instances), sv-dep/dislocated (1; 0% instances), sv-dep/parataxis (1; 0% instances)
Parents of NUM
nodes belong to 10 different parts of speech: NOUN (988; 71% instances), VERB (186; 13% instances), NUM (99; 7% instances), ADJ (43; 3% instances), PROPN (39; 3% instances), ROOT (13; 1% instances), ADV (7; 1% instances), ADP (4; 0% instances), DET (3; 0% instances), PRON (2; 0% instances)
831 (60%) NUM
nodes are leaves.
361 (26%) NUM
nodes have one child.
117 (8%) NUM
nodes have two children.
75 (5%) NUM
nodes have three or more children.
The highest child degree of a NUM
node is 8.
Children of NUM
nodes are attached using 20 different relations: sv-dep/advmod (224; 26% instances), sv-dep/nmod (182; 21% instances), sv-dep/case (157; 18% instances), sv-dep/punct (68; 8% instances), sv-dep/nummod (56; 6% instances), sv-dep/conj (47; 5% instances), sv-dep/cc (46; 5% instances), sv-dep/obl (34; 4% instances), sv-dep/det (13; 1% instances), sv-dep/nsubj (9; 1% instances), sv-dep/fixed (8; 1% instances), sv-dep/cop (7; 1% instances), sv-dep/acl:relcl (4; 0% instances), sv-dep/amod (4; 0% instances), sv-dep/appos (4; 0% instances), sv-dep/mark (3; 0% instances), sv-dep/nmod:poss (3; 0% instances), sv-dep/acl (1; 0% instances), sv-dep/advcl (1; 0% instances), sv-dep/orphan (1; 0% instances)
Children of NUM
nodes belong to 13 different parts of speech: ADV (228; 26% instances), NOUN (218; 25% instances), ADP (143; 16% instances), NUM (99; 11% instances), PUNCT (68; 8% instances), CCONJ (43; 5% instances), ADJ (21; 2% instances), SYM (19; 2% instances), DET (16; 2% instances), AUX (7; 1% instances), VERB (6; 1% instances), PRON (3; 0% instances), PROPN (1; 0% instances)
Treebank Statistics (UD_Swedish-LinES)
There are 1 NUM
lemmas (6%), 121 NUM
types (1%) and 339 NUM
tokens (1%).
Out of 17 observed tags, the rank of NUM
is: 9 in number of lemmas, 6 in number of types and 14 in number of tokens.
The 10 most frequent NUM
lemmas: _
The 10 most frequent NUM
types: två, tre, en, fem, sex, tio, 1, 2000, fyra, 2
The 10 most frequent ambiguous lemmas: _ (NOUN 11462, VERB 8134, ADP 7148, PUNCT 6980, PRON 6631, ADV 4925, ADJ 4444, DET 3508, AUX 2803, CCONJ 2463, SCONJ 2183, PROPN 2158, PART 1442, NUM 339, INTJ 143, X 15, SYM 9)
The 10 most frequent ambiguous types: en (DET 1220, PRON 40, NUM 18), 1 (NUM 10, ADJ 1), 3 (NUM 3, ADJ 1), ett (DET 560, PRON 5, NUM 2), 12 (NUM 2, ADJ 1), 30 (NUM 2, ADJ 1), 22 (ADJ 2, NUM 1), 4 (NUM 1, ADJ 1)
- en
- 1
- 3
- ett
- 12
- NUM 2: I samband med andra behandlingen bedömde jag det som lämpligt att åter ta upp 3 av de 12 ändringsförslag som avvisades av rådet .
- ADJ 1: Den 12 juli , efter raiden , anklagades Israel för att uppmuntra reaktionärerna i Rhodesia och Sydafrika genom sin demonstration av militär överlägsenhet och sitt bruk av västerländsk beväpning och teknik , för att störa det arbete som utfördes av rättsinniga människor i Paris vilka försökte skapa ett nytt klimat och behandla den tredje världens länder som jämlikar och partners .
- 30
- 22
- 4
Morphology
The form / lemma ratio of NUM
is 121.000000 (the average of all parts of speech is 693.647059).
The 1st highest number of forms (121) was observed with the lemma “_”: 1, 1-100, 10, 100, 101-200, 11, 11.25, 11.30, 12, 12.00, 12.30, 13, 14, 15, 16.30, 18.30, 1857, 1875, 19.15, 1910, 1947, 1952, 1953, 1955, 1973, 1976, 1996, 1996-1997, 1997, 2, 2,6, 2000, 2000-, 2002, 2002-, 2005, 21.00, 22, 23, 25, 3, 30, 37, 38, 4, 40, 43-, 44, 46, 49, 5, 5.5, 50, 50000, 59, 6, 60, 6500, 7.0, 8, 84, 96/23, 96/96, A4-0072/97, C4-0497/98-98/0126, Etthundratjugotvå, H-0002/99, H-0045/99, H-0209/99, H-0218/97, H-0237/97, Tretti, Tvåtusen, arton, elva, en, ett, etthundranio, fem, femhundra, femti, femton, femtonhundra, fjorton, fyra, fyratusen, fyrtio, fyrtioåtta, hundra, n, nio, nitton, nittonhundra, nn, noll, sex, sextio, sexton, sju, sjutton, tio, tjugo, tjugofem, tjugofyra, tjugosex, tjugotusen, tjugotvå, tjugu, tolv, tre, tre-fyra, trehundrafemti, trettio, trettiofem, trettioåtta, tretton, tusen, två, två-ett, tvåhundra, åtta.
NUM
does not occur with any features.
Relations
NUM
nodes are attached to their parents using 12 different relations: sv-dep/nummod (225; 66% instances), sv-dep/obl (41; 12% instances), sv-dep/conj (20; 6% instances), sv-dep/discourse (14; 4% instances), sv-dep/appos (11; 3% instances), sv-dep/obj (7; 2% instances), sv-dep/flat (6; 2% instances), sv-dep/nsubj (6; 2% instances), sv-dep/root (4; 1% instances), sv-dep/nsubj:pass (2; 1% instances), sv-dep/xcomp (2; 1% instances), sv-dep/dislocated (1; 0% instances)
Parents of NUM
nodes belong to 10 different parts of speech: NOUN (223; 66% instances), VERB (62; 18% instances), NUM (23; 7% instances), PROPN (18; 5% instances), ROOT (4; 1% instances), ADJ (3; 1% instances), ADV (2; 1% instances), PRON (2; 1% instances), AUX (1; 0% instances), X (1; 0% instances)
203 (60%) NUM
nodes are leaves.
82 (24%) NUM
nodes have one child.
33 (10%) NUM
nodes have two children.
21 (6%) NUM
nodes have three or more children.
The highest child degree of a NUM
node is 7.
Children of NUM
nodes are attached using 15 different relations: sv-dep/nmod (64; 28% instances), sv-dep/punct (35; 16% instances), sv-dep/advmod (30; 13% instances), sv-dep/case (29; 13% instances), sv-dep/conj (28; 12% instances), sv-dep/cc (14; 6% instances), sv-dep/det (6; 3% instances), sv-dep/cop (4; 2% instances), sv-dep/amod (3; 1% instances), sv-dep/nsubj (3; 1% instances), sv-dep/nummod (3; 1% instances), sv-dep/appos (2; 1% instances), sv-dep/mark (2; 1% instances), sv-dep/acl:relcl (1; 0% instances), sv-dep/parataxis (1; 0% instances)
Children of NUM
nodes belong to 15 different parts of speech: NOUN (66; 29% instances), PUNCT (35; 16% instances), ADV (33; 15% instances), ADP (29; 13% instances), NUM (23; 10% instances), CCONJ (14; 6% instances), DET (6; 3% instances), ADJ (4; 2% instances), AUX (4; 2% instances), VERB (3; 1% instances), PRON (2; 1% instances), PROPN (2; 1% instances), SCONJ (2; 1% instances), PART (1; 0% instances), X (1; 0% instances)
Treebank Statistics (UD_Swedish_Sign_Language)
There are 1 NUM
lemmas (10%), 4 NUM
types (2%) and 4 NUM
tokens (1%).
Out of 10 observed tags, the rank of NUM
is: 7 in number of lemmas, 7 in number of types and 9 in number of tokens.
The 10 most frequent NUM
lemmas: _
The 10 most frequent NUM
types: EN, EN-ENDA, EN-TILL, SJU_ÅTTA
The 10 most frequent ambiguous lemmas: _ (VERB 187, NOUN 73, X 45, PRON 29, ADV 19, DET 12, INTJ 7, ADJ 5, NUM 4, ADP 1)
The 10 most frequent ambiguous types:
Morphology
The form / lemma ratio of NUM
is 4.000000 (the average of all parts of speech is 23.100000).
The 1st highest number of forms (4) was observed with the lemma “_”: EN, EN-ENDA, EN-TILL, SJU_ÅTTA.
NUM
does not occur with any features.
Relations
NUM
nodes are attached to their parents using 1 different relations: sv-dep/nummod (4; 100% instances)
Parents of NUM
nodes belong to 1 different parts of speech: NOUN (4; 100% instances)
4 (100%) NUM
nodes are leaves.
The highest child degree of a NUM
node is 0.
NUM in other languages: [am] [ar] [bg] [bxr] [ca] [ckb] [cop] [cs] [cu] [da] [de] [el] [en] [es] [et] [eu] [fa] [fi] [fo] [fr] [ga] [gl] [got] [grc] [he] [hi] [hr] [hu] [id] [it] [ja] [kk] [kmr] [ko] [la] [lv] [mr] [nl] [no] [pl] [pt] [ro] [ru] [sa] [sk] [sla] [sl] [so] [sr] [sv] [swl] [ta] [tr] [ug] [uk] [u] [urj] [ur] [vi] [yue] [zh]