home en/pos edit page issue tracker

This page still pertains to UD version 1.

DET: determiner

The English DET covers most cases of Penn Treebank DT, PDT, WDT. However, when a Penn Treebank word with one of these tags stands alone as a noun phrase rather than modifying another word, then it becomes PRON.


Treebank Statistics (UD_English)

There are 36 DET lemmas (0%), 39 DET types (0%) and 18180 DET tokens (8%). Out of 17 observed tags, the rank of DET is: 14 in number of lemmas, 15 in number of types and 6 in number of tokens.

The 10 most frequent DET lemmas: the, a, this, all, some, any, no, that, these, another

The 10 most frequent DET types: the, a, this, an, all, some, any, no, that, these

The 10 most frequent ambiguous lemmas: the (DET 10044, PRON 9, ADV 2, ADP 1), a (DET 4781, NOUN 19, X 13, ADV 5, ADP 4, PART 1, CCONJ 1, AUX 1), this (DET 814, PRON 468, ADV 5, NOUN 1), all (DET 483, ADV 110, NOUN 3, X 2, PUNCT 1), some (DET 388, ADV 2, X 1), any (DET 358, ADV 8, X 3), no (DET 281, INTJ 48, ADV 31, NOUN 1, X 1), that (SCONJ 1079, PRON 912, DET 201, ADV 41, ADP 4), these (DET 182, PRON 34), another (DET 139, NOUN 1, ADJ 1)

The 10 most frequent ambiguous types: the (DET 9000, PRON 7, ADV 2, PART 1, ADP 1), a (DET 4063, X 7, ADV 5, NOUN 5, ADP 4, CCONJ 1, AUX 1, PART 1), this (DET 687, PRON 339, ADV 5, NOUN 1), an (DET 524, CCONJ 3, NOUN 2), all (DET 431, ADV 107, NOUN 3, X 2), some (DET 358, ADV 2, X 1), any (DET 323, ADV 8, X 3), no (DET 229, INTJ 33, ADV 25, VERB 2, NOUN 1, X 1), that (SCONJ 1072, PRON 841, DET 183, ADV 40, ADP 4), these (DET 157, PRON 17)

Morphology

The form / lemma ratio of DET is 1.083333 (the average of all parts of speech is 1.181137).

The 1st highest number of forms (2) was observed with the lemma “a”: a, an.

The 2nd highest number of forms (2) was observed with the lemma “some”: $ome, some.

The 3rd highest number of forms (2) was observed with the lemma “this”: his, this.

DET occurs with 3 features: en-feat/PronType (16210; 89% instances), en-feat/Definite (14824; 82% instances), en-feat/Number (1278; 7% instances)

DET occurs with 8 feature-value pairs: Definite=Def, Definite=Ind, Number=Plur, Number=Sing, PronType=Art, PronType=Dem, PronType=Int, PronType=Rel

DET occurs with 9 feature combinations. The most frequent feature combination is Definite=Def|PronType=Art (10043 tokens). Examples: the

Relations

DET nodes are attached to their parents using 26 different relations: en-dep/det (17558; 97% instances), en-dep/det:predet (183; 1% instances), en-dep/nsubj (111; 1% instances), en-dep/obj (87; 0% instances), en-dep/obl (59; 0% instances), en-dep/conj (35; 0% instances), en-dep/root (26; 0% instances), en-dep/nmod (25; 0% instances), en-dep/mark (22; 0% instances), en-dep/nsubj:pass (12; 0% instances), en-dep/advmod (11; 0% instances), en-dep/reparandum (11; 0% instances), en-dep/nummod (7; 0% instances), en-dep/compound (6; 0% instances), en-dep/appos (4; 0% instances), en-dep/nmod:npmod (4; 0% instances), en-dep/obl:npmod (3; 0% instances), en-dep/xcomp (3; 0% instances), en-dep/advcl (2; 0% instances), en-dep/cc:preconj (2; 0% instances), en-dep/ccomp (2; 0% instances), en-dep/parataxis (2; 0% instances), en-dep/vocative (2; 0% instances), en-dep/case (1; 0% instances), en-dep/discourse (1; 0% instances), en-dep/iobj (1; 0% instances)

Parents of DET nodes belong to 14 different parts of speech: NOUN (16029; 88% instances), PROPN (1332; 7% instances), ADJ (342; 2% instances), VERB (246; 1% instances), NUM (60; 0% instances), PRON (57; 0% instances), DET (35; 0% instances), ROOT (26; 0% instances), ADV (24; 0% instances), SYM (22; 0% instances), INTJ (3; 0% instances), X (2; 0% instances), ADP (1; 0% instances), AUX (1; 0% instances)

17833 (98%) DET nodes are leaves.

210 (1%) DET nodes have one child.

91 (1%) DET nodes have two children.

46 (0%) DET nodes have three or more children.

The highest child degree of a DET node is 10.

Children of DET nodes are attached using 25 different relations: en-dep/nmod (191; 32% instances), en-dep/case (97; 16% instances), en-dep/punct (53; 9% instances), en-dep/acl:relcl (48; 8% instances), en-dep/cc (36; 6% instances), en-dep/advmod (34; 6% instances), en-dep/conj (28; 5% instances), en-dep/cop (21; 4% instances), en-dep/nsubj (21; 4% instances), en-dep/amod (15; 3% instances), en-dep/det:predet (10; 2% instances), en-dep/advcl (7; 1% instances), en-dep/det (7; 1% instances), en-dep/mark (7; 1% instances), en-dep/aux (3; 1% instances), en-dep/obj (3; 1% instances), en-dep/_ (2; 0% instances), en-dep/appos (2; 0% instances), en-dep/obl (2; 0% instances), en-dep/orphan (2; 0% instances), en-dep/parataxis (2; 0% instances), en-dep/aux:pass (1; 0% instances), en-dep/compound (1; 0% instances), en-dep/discourse (1; 0% instances), en-dep/nmod:poss (1; 0% instances)

Children of DET nodes belong to 16 different parts of speech: NOUN (151; 25% instances), ADP (97; 16% instances), VERB (71; 12% instances), PRON (58; 10% instances), PUNCT (52; 9% instances), CCONJ (35; 6% instances), ADV (32; 5% instances), DET (27; 5% instances), AUX (25; 4% instances), ADJ (21; 4% instances), NUM (6; 1% instances), PART (6; 1% instances), PROPN (6; 1% instances), SCONJ (5; 1% instances), SYM (2; 0% instances), INTJ (1; 0% instances)


Treebank Statistics (UD_English-ESL)

There are 1 DET lemmas (6%), 1 DET types (6%) and 9068 DET tokens (10%). Out of 17 observed tags, the rank of DET is: 6 in number of lemmas, 6 in number of types and 4 in number of tokens.

The 10 most frequent DET lemmas: _

The 10 most frequent DET types: _

The 10 most frequent ambiguous lemmas: _ (NOUN 14135, VERB 13583, PRON 9575, DET 9068, PUNCT 8624, ADP 7769, ADJ 5278, ADV 5121, AUX 4111, PART 3169, CONJ 2865, SCONJ 2278, PROPN 1574, NUM 776, INTJ 67, X 60, SYM 37)

The 10 most frequent ambiguous types: _ (NOUN 14135, VERB 13583, PRON 9575, DET 9068, PUNCT 8624, ADP 7769, ADJ 5278, ADV 5121, AUX 4111, PART 3169, CONJ 2865, SCONJ 2278, PROPN 1574, NUM 776, INTJ 67, X 60, SYM 37)

Morphology

The form / lemma ratio of DET is 1.000000 (the average of all parts of speech is 1.000000).

The 1st highest number of forms (1) was observed with the lemma “_”: _.

DET does not occur with any features.

Relations

DET nodes are attached to their parents using 19 different relations: en-dep/det (6591; 73% instances), en-dep/nmod:poss (1720; 19% instances), en-dep/nsubj (200; 2% instances), en-dep/det:predet (178; 2% instances), en-dep/nmod (143; 2% instances), en-dep/dobj (98; 1% instances), en-dep/neg (79; 1% instances), en-dep/nsubjpass (29; 0% instances), en-dep/root (7; 0% instances), en-dep/advmod (4; 0% instances), en-dep/case (4; 0% instances), en-dep/mark (4; 0% instances), en-dep/compound (3; 0% instances), en-dep/amod (2; 0% instances), en-dep/conj (2; 0% instances), en-dep/appos (1; 0% instances), en-dep/cc (1; 0% instances), en-dep/ccomp (1; 0% instances), en-dep/mwe (1; 0% instances)

Parents of DET nodes belong to 14 different parts of speech: NOUN (8222; 91% instances), VERB (350; 4% instances), PROPN (180; 2% instances), ADJ (146; 2% instances), ADV (85; 1% instances), PRON (44; 0% instances), NUM (21; 0% instances), ROOT (7; 0% instances), DET (4; 0% instances), SYM (4; 0% instances), ADP (2; 0% instances), AUX (1; 0% instances), INTJ (1; 0% instances), PART (1; 0% instances)

8866 (98%) DET nodes are leaves.

181 (2%) DET nodes have one child.

12 (0%) DET nodes have two children.

9 (0%) DET nodes have three or more children.

The highest child degree of a DET node is 8.

Children of DET nodes are attached using 20 different relations: en-dep/case (139; 57% instances), en-dep/nmod (47; 19% instances), en-dep/acl:relcl (10; 4% instances), en-dep/advmod (9; 4% instances), en-dep/punct (9; 4% instances), en-dep/cop (6; 2% instances), en-dep/nsubj (6; 2% instances), en-dep/cc (3; 1% instances), en-dep/amod (2; 1% instances), en-dep/aux (2; 1% instances), en-dep/conj (2; 1% instances), en-dep/det:predet (2; 1% instances), en-dep/neg (2; 1% instances), en-dep/advcl (1; 0% instances), en-dep/appos (1; 0% instances), en-dep/det (1; 0% instances), en-dep/goeswith (1; 0% instances), en-dep/mark (1; 0% instances), en-dep/mwe (1; 0% instances), en-dep/nmod:tmod (1; 0% instances)

Children of DET nodes belong to 14 different parts of speech: ADP (139; 57% instances), PRON (28; 11% instances), NOUN (27; 11% instances), VERB (19; 8% instances), PUNCT (9; 4% instances), ADV (7; 3% instances), DET (4; 2% instances), ADJ (3; 1% instances), CONJ (3; 1% instances), AUX (2; 1% instances), PART (2; 1% instances), PROPN (1; 0% instances), SCONJ (1; 0% instances), X (1; 0% instances)


Treebank Statistics (UD_English-LinES)

There are 1 DET lemmas (6%), 23 DET types (0%) and 6429 DET tokens (10%). Out of 17 observed tags, the rank of DET is: 6 in number of lemmas, 11 in number of types and 5 in number of tokens.

The 10 most frequent DET lemmas: _

The 10 most frequent DET types: the, a, an, this, that, no, all, some, any, these

The 10 most frequent ambiguous lemmas: _ (NOUN 12161, PUNCT 8085, VERB 8020, ADP 6788, DET 6429, PRON 6303, ADJ 4270, ADV 3700, AUX 3539, PROPN 2257, CCONJ 2081, PART 1703, SCONJ 1231, NUM 462, INTJ 122, X 41, SYM 5)

The 10 most frequent ambiguous types: the (DET 3471, ADV 1), a (DET 1562, ADV 4, PRON 1), this (DET 130, PRON 65), that (SCONJ 505, DET 99, PRON 88), no (DET 82, ADV 19, INTJ 7, PRON 7), all (DET 79, PRON 75, ADV 35, ADP 1), some (DET 62, PRON 13, ADV 3), any (DET 59, PRON 5, ADV 4), these (DET 45, PRON 4), each (DET 26, PRON 19)

Morphology

The form / lemma ratio of DET is 23.000000 (the average of all parts of speech is 527.705882).

The 1st highest number of forms (23) was observed with the lemma “_”: La, Le, a, all, an, any, both, du, each, either, every, no, one, some, that, the, these, this, those, what, whatever, which, whose.

DET does not occur with any features.

Relations

DET nodes are attached to their parents using 5 different relations: en-dep/det (6344; 99% instances), en-dep/advmod (75; 1% instances), en-dep/amod (6; 0% instances), en-dep/obj (3; 0% instances), en-dep/appos (1; 0% instances)

Parents of DET nodes belong to 13 different parts of speech: NOUN (6005; 93% instances), ADJ (131; 2% instances), PROPN (123; 2% instances), VERB (62; 1% instances), PRON (57; 1% instances), ADV (15; 0% instances), NUM (14; 0% instances), DET (9; 0% instances), ADP (4; 0% instances), PUNCT (4; 0% instances), X (3; 0% instances), INTJ (1; 0% instances), SYM (1; 0% instances)

6382 (99%) DET nodes are leaves.

40 (1%) DET nodes have one child.

4 (0%) DET nodes have two children.

3 (0%) DET nodes have three or more children.

The highest child degree of a DET node is 3.

Children of DET nodes are attached using 13 different relations: en-dep/advmod (13; 23% instances), en-dep/fixed (12; 21% instances), en-dep/det (8; 14% instances), en-dep/mark (6; 11% instances), en-dep/nsubj (4; 7% instances), en-dep/punct (4; 7% instances), en-dep/amod (3; 5% instances), en-dep/aux (2; 4% instances), en-dep/acl (1; 2% instances), en-dep/advcl (1; 2% instances), en-dep/appos (1; 2% instances), en-dep/cc (1; 2% instances), en-dep/obj (1; 2% instances)

Children of DET nodes belong to 13 different parts of speech: ADJ (11; 19% instances), ADV (11; 19% instances), DET (9; 16% instances), ADP (5; 9% instances), PUNCT (4; 7% instances), SCONJ (4; 7% instances), NOUN (3; 5% instances), PRON (3; 5% instances), AUX (2; 4% instances), PART (2; 4% instances), CCONJ (1; 2% instances), PROPN (1; 2% instances), VERB (1; 2% instances)


Treebank Statistics (UD_English-ParTUT)

There are 37 DET lemmas (1%), 39 DET types (1%) and 4015 DET tokens (11%). Out of 17 observed tags, the rank of DET is: 10 in number of lemmas, 10 in number of types and 4 in number of tokens.

The 10 most frequent DET lemmas: the, a, this, his, their, its, any, us, that, all

The 10 most frequent DET types: the, a, his, this, an, their, its, any, these, our

The 10 most frequent ambiguous lemmas: a (DET 758, X 1), this (DET 228, PRON 67), his (DET 215, PRON 4), any (DET 58, ADV 1), us (DET 47, PRON 26), that (SCONJ 246, PRON 169, DET 43, ADJ 2), all (DET 38, PRON 36, ADV 1), no (DET 38, ADV 7), you (PRON 82, DET 31), some (DET 27, PRON 17)

The 10 most frequent ambiguous types: a (DET 626, X 1), his (DET 190, PRON 4), this (DET 150, PRON 35), any (DET 56, ADV 1), these (DET 43, PRON 5), our (DET 42, PRON 1), all (DET 36, PRON 28, ADV 1), no (DET 27, ADV 7), some (DET 20, PRON 15), such (DET 20, ADJ 18, ADP 4)

Morphology

The form / lemma ratio of DET is 1.054054 (the average of all parts of speech is 1.187751).

The 1st highest number of forms (2) was observed with the lemma “a”: a, an.

The 2nd highest number of forms (2) was observed with the lemma “le”: Le, les.

The 3rd highest number of forms (2) was observed with the lemma “that”: that, those.

DET occurs with 5 features: en-feat/PronType (4015; 100% instances), en-feat/Definite (2989; 74% instances), en-feat/Number (1178; 29% instances), en-feat/Poss (514; 13% instances), en-feat/Gender (5; 0% instances)

DET occurs with 13 feature-value pairs: Definite=Def, Definite=Ind, Gender=Fem, Number=Plur, Number=Sing, Poss=Yes, PronType=Art, PronType=Dem, PronType=Ind, PronType=Int, PronType=Prs, PronType=Rel, PronType=Tot

DET occurs with 20 feature combinations. The most frequent feature combination is Definite=Def|PronType=Art (2206 tokens). Examples: the, ’s

Relations

DET nodes are attached to their parents using 12 different relations: en-dep/det (3469; 86% instances), en-dep/nmod:poss (513; 13% instances), en-dep/det:predet (11; 0% instances), en-dep/fixed (6; 0% instances), en-dep/nmod (6; 0% instances), en-dep/obl (3; 0% instances), en-dep/nsubj (2; 0% instances), en-dep/advmod (1; 0% instances), en-dep/iobj (1; 0% instances), en-dep/obj (1; 0% instances), en-dep/parataxis (1; 0% instances), en-dep/root (1; 0% instances)

Parents of DET nodes belong to 13 different parts of speech: NOUN (3800; 95% instances), PROPN (124; 3% instances), ADJ (26; 1% instances), PRON (21; 1% instances), NUM (14; 0% instances), VERB (9; 0% instances), X (7; 0% instances), ADP (5; 0% instances), ADV (4; 0% instances), SYM (2; 0% instances), DET (1; 0% instances), INTJ (1; 0% instances), ROOT (1; 0% instances)

3995 (100%) DET nodes are leaves.

15 (0%) DET nodes have one child.

4 (0%) DET nodes have two children.

1 (0%) DET nodes have three or more children.

The highest child degree of a DET node is 8.

Children of DET nodes are attached using 12 different relations: en-dep/case (8; 26% instances), en-dep/nmod (4; 13% instances), en-dep/punct (4; 13% instances), en-dep/advmod (3; 10% instances), en-dep/fixed (3; 10% instances), en-dep/aux (2; 6% instances), en-dep/conj (2; 6% instances), en-dep/advcl (1; 3% instances), en-dep/cop (1; 3% instances), en-dep/goeswith (1; 3% instances), en-dep/mark (1; 3% instances), en-dep/nsubj (1; 3% instances)

Children of DET nodes belong to 11 different parts of speech: ADP (8; 26% instances), ADV (5; 16% instances), PUNCT (4; 13% instances), AUX (3; 10% instances), NOUN (3; 10% instances), ADJ (2; 6% instances), PRON (2; 6% instances), DET (1; 3% instances), SCONJ (1; 3% instances), VERB (1; 3% instances), X (1; 3% instances)


DET in other languages: [am] [ar] [bg] [bxr] [ca] [ckb] [cop] [cs] [cu] [da] [de] [el] [en] [es] [et] [eu] [fa] [fi] [fo] [fr] [ga] [gl] [got] [grc] [he] [hi] [hr] [hu] [id] [it] [ja] [kk] [kmr] [ko] [la] [lv] [mr] [nl] [no] [pl] [pt] [ro] [ru] [sa] [sk] [sla] [sl] [so] [sr] [sv] [swl] [ta] [tr] [ug] [uk] [u] [urj] [ur] [vi] [yue] [zh]