DET
: determiner
The English DET
covers most cases of Penn Treebank DT, PDT, WDT. However, when a Penn Treebank word with one of these tags stands alone as a noun phrase rather than modifying another word, then it becomes PRON
.
Treebank Statistics (UD_English)
There are 36 DET
lemmas (0%), 39 DET
types (0%) and 18180 DET
tokens (8%).
Out of 17 observed tags, the rank of DET
is: 14 in number of lemmas, 15 in number of types and 6 in number of tokens.
The 10 most frequent DET
lemmas: the, a, this, all, some, any, no, that, these, another
The 10 most frequent DET
types: the, a, this, an, all, some, any, no, that, these
The 10 most frequent ambiguous lemmas: the (DET 10044, PRON 9, ADV 2, ADP 1), a (DET 4781, NOUN 19, X 13, ADV 5, ADP 4, PART 1, CCONJ 1, AUX 1), this (DET 814, PRON 468, ADV 5, NOUN 1), all (DET 483, ADV 110, NOUN 3, X 2, PUNCT 1), some (DET 388, ADV 2, X 1), any (DET 358, ADV 8, X 3), no (DET 281, INTJ 48, ADV 31, NOUN 1, X 1), that (SCONJ 1079, PRON 912, DET 201, ADV 41, ADP 4), these (DET 182, PRON 34), another (DET 139, NOUN 1, ADJ 1)
The 10 most frequent ambiguous types: the (DET 9000, PRON 7, ADV 2, PART 1, ADP 1), a (DET 4063, X 7, ADV 5, NOUN 5, ADP 4, CCONJ 1, AUX 1, PART 1), this (DET 687, PRON 339, ADV 5, NOUN 1), an (DET 524, CCONJ 3, NOUN 2), all (DET 431, ADV 107, NOUN 3, X 2), some (DET 358, ADV 2, X 1), any (DET 323, ADV 8, X 3), no (DET 229, INTJ 33, ADV 25, VERB 2, NOUN 1, X 1), that (SCONJ 1072, PRON 841, DET 183, ADV 40, ADP 4), these (DET 157, PRON 17)
- the
- DET 9000: From the AP comes this story :
- PRON 7: I think that the re pretty good .
- ADV 2: Got the tile ripped out , call today , now all the sudden this grinder wo n’t leave a finished look AND it ‘s $ 125 PLUS around $ 75 for the inserts .
- PART 1: Ok so i spoke to the vet and he said the give him white rice and boiled chicken but only a little at a time .
- ADP 1: Hens make excellent mothers as they are bigger , can see better , and are smarter the Silkies .
- a
- DET 4063: Read the entire article ; there ‘s a punchline , too .
- X 7: A la guerre c’est comme a la guerre !
- ADV 5: Also , any tour recommendations would be very helpful a well .
- NOUN 5: Top range of bike , cheap prices , excellent a +++
- ADP 4: Big deal kind a stuff .
- CCONJ 1: But word of advice if you ‘re get your girlfriend a laptop make sure it s a good brand a not something like DELL , Acer , Asus , eMachines etc .
- AUX 1: yea i guess but rabbits a easily escape a pen or another rabbit could get in there and that rabbit could be the opposite gender .
- PART 1: I feel X - BOX is a very smooth system i own it like 3 years , it s very compatible to previous versions and mostly important i was very comfortable with the User Interface and the JOYSTICK …. coz you do nt wan a hold a joystick that gives you discomfort .
- this
- an
- all
- some
- any
- no
- DET 229: i think they are all bark and no bite .
- INTJ 33: Er , no ?
- ADV 25: Stylish and contemporary , no matter your size or personality type .
- VERB 2: I du n no how they did it , but Scottish friends — this is THE REAL DEAL .
- NOUN 1: It was a no brainer really .
- X 1: she knows she is invading someone else ‘s territory , but ca n’t help it , and has no where to go .
- that
- SCONJ 1072: It is rumored that North Korea has at least a couple nuclear weapons .
- PRON 841: Right now that seems to be the US , EU , and IAEA .
- DET 183: I have sent your question re on line trading to that area .
- ADV 40: it ‘s passable as a pub , but the pizza is not that great .
- ADP 4: Is that reasonable ?
- these
Morphology
The form / lemma ratio of DET
is 1.083333 (the average of all parts of speech is 1.181137).
The 1st highest number of forms (2) was observed with the lemma “a”: a, an.
The 2nd highest number of forms (2) was observed with the lemma “some”: $ome, some.
The 3rd highest number of forms (2) was observed with the lemma “this”: his, this.
DET
occurs with 3 features: en-feat/PronType (16210; 89% instances), en-feat/Definite (14824; 82% instances), en-feat/Number (1278; 7% instances)
DET
occurs with 8 feature-value pairs: Definite=Def
, Definite=Ind
, Number=Plur
, Number=Sing
, PronType=Art
, PronType=Dem
, PronType=Int
, PronType=Rel
DET
occurs with 9 feature combinations.
The most frequent feature combination is Definite=Def|PronType=Art
(10043 tokens).
Examples: the
Relations
DET
nodes are attached to their parents using 26 different relations: en-dep/det (17558; 97% instances), en-dep/det:predet (183; 1% instances), en-dep/nsubj (111; 1% instances), en-dep/obj (87; 0% instances), en-dep/obl (59; 0% instances), en-dep/conj (35; 0% instances), en-dep/root (26; 0% instances), en-dep/nmod (25; 0% instances), en-dep/mark (22; 0% instances), en-dep/nsubj:pass (12; 0% instances), en-dep/advmod (11; 0% instances), en-dep/reparandum (11; 0% instances), en-dep/nummod (7; 0% instances), en-dep/compound (6; 0% instances), en-dep/appos (4; 0% instances), en-dep/nmod:npmod (4; 0% instances), en-dep/obl:npmod (3; 0% instances), en-dep/xcomp (3; 0% instances), en-dep/advcl (2; 0% instances), en-dep/cc:preconj (2; 0% instances), en-dep/ccomp (2; 0% instances), en-dep/parataxis (2; 0% instances), en-dep/vocative (2; 0% instances), en-dep/case (1; 0% instances), en-dep/discourse (1; 0% instances), en-dep/iobj (1; 0% instances)
Parents of DET
nodes belong to 14 different parts of speech: NOUN (16029; 88% instances), PROPN (1332; 7% instances), ADJ (342; 2% instances), VERB (246; 1% instances), NUM (60; 0% instances), PRON (57; 0% instances), DET (35; 0% instances), ROOT (26; 0% instances), ADV (24; 0% instances), SYM (22; 0% instances), INTJ (3; 0% instances), X (2; 0% instances), ADP (1; 0% instances), AUX (1; 0% instances)
17833 (98%) DET
nodes are leaves.
210 (1%) DET
nodes have one child.
91 (1%) DET
nodes have two children.
46 (0%) DET
nodes have three or more children.
The highest child degree of a DET
node is 10.
Children of DET
nodes are attached using 25 different relations: en-dep/nmod (191; 32% instances), en-dep/case (97; 16% instances), en-dep/punct (53; 9% instances), en-dep/acl:relcl (48; 8% instances), en-dep/cc (36; 6% instances), en-dep/advmod (34; 6% instances), en-dep/conj (28; 5% instances), en-dep/cop (21; 4% instances), en-dep/nsubj (21; 4% instances), en-dep/amod (15; 3% instances), en-dep/det:predet (10; 2% instances), en-dep/advcl (7; 1% instances), en-dep/det (7; 1% instances), en-dep/mark (7; 1% instances), en-dep/aux (3; 1% instances), en-dep/obj (3; 1% instances), en-dep/_ (2; 0% instances), en-dep/appos (2; 0% instances), en-dep/obl (2; 0% instances), en-dep/orphan (2; 0% instances), en-dep/parataxis (2; 0% instances), en-dep/aux:pass (1; 0% instances), en-dep/compound (1; 0% instances), en-dep/discourse (1; 0% instances), en-dep/nmod:poss (1; 0% instances)
Children of DET
nodes belong to 16 different parts of speech: NOUN (151; 25% instances), ADP (97; 16% instances), VERB (71; 12% instances), PRON (58; 10% instances), PUNCT (52; 9% instances), CCONJ (35; 6% instances), ADV (32; 5% instances), DET (27; 5% instances), AUX (25; 4% instances), ADJ (21; 4% instances), NUM (6; 1% instances), PART (6; 1% instances), PROPN (6; 1% instances), SCONJ (5; 1% instances), SYM (2; 0% instances), INTJ (1; 0% instances)
Treebank Statistics (UD_English-ESL)
There are 1 DET
lemmas (6%), 1 DET
types (6%) and 9068 DET
tokens (10%).
Out of 17 observed tags, the rank of DET
is: 6 in number of lemmas, 6 in number of types and 4 in number of tokens.
The 10 most frequent DET
lemmas: _
The 10 most frequent DET
types: _
The 10 most frequent ambiguous lemmas: _ (NOUN 14135, VERB 13583, PRON 9575, DET 9068, PUNCT 8624, ADP 7769, ADJ 5278, ADV 5121, AUX 4111, PART 3169, CONJ 2865, SCONJ 2278, PROPN 1574, NUM 776, INTJ 67, X 60, SYM 37)
The 10 most frequent ambiguous types: _ (NOUN 14135, VERB 13583, PRON 9575, DET 9068, PUNCT 8624, ADP 7769, ADJ 5278, ADV 5121, AUX 4111, PART 3169, CONJ 2865, SCONJ 2278, PROPN 1574, NUM 776, INTJ 67, X 60, SYM 37)
- _
- NOUN 14135: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- VERB 13583: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- PRON 9575: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- DET 9068: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- PUNCT 8624: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- ADP 7769: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- ADJ 5278: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- ADV 5121: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- AUX 4111: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- PART 3169: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- CONJ 2865: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- SCONJ 2278: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- PROPN 1574: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- NUM 776: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- INTJ 67: _ _ _ _ _ _ _ _ _ _ _ _
- X 60: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
- SYM 37: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
Morphology
The form / lemma ratio of DET
is 1.000000 (the average of all parts of speech is 1.000000).
The 1st highest number of forms (1) was observed with the lemma “_”: _.
DET
does not occur with any features.
Relations
DET
nodes are attached to their parents using 19 different relations: en-dep/det (6591; 73% instances), en-dep/nmod:poss (1720; 19% instances), en-dep/nsubj (200; 2% instances), en-dep/det:predet (178; 2% instances), en-dep/nmod (143; 2% instances), en-dep/dobj (98; 1% instances), en-dep/neg (79; 1% instances), en-dep/nsubjpass (29; 0% instances), en-dep/root (7; 0% instances), en-dep/advmod (4; 0% instances), en-dep/case (4; 0% instances), en-dep/mark (4; 0% instances), en-dep/compound (3; 0% instances), en-dep/amod (2; 0% instances), en-dep/conj (2; 0% instances), en-dep/appos (1; 0% instances), en-dep/cc (1; 0% instances), en-dep/ccomp (1; 0% instances), en-dep/mwe (1; 0% instances)
Parents of DET
nodes belong to 14 different parts of speech: NOUN (8222; 91% instances), VERB (350; 4% instances), PROPN (180; 2% instances), ADJ (146; 2% instances), ADV (85; 1% instances), PRON (44; 0% instances), NUM (21; 0% instances), ROOT (7; 0% instances), DET (4; 0% instances), SYM (4; 0% instances), ADP (2; 0% instances), AUX (1; 0% instances), INTJ (1; 0% instances), PART (1; 0% instances)
8866 (98%) DET
nodes are leaves.
181 (2%) DET
nodes have one child.
12 (0%) DET
nodes have two children.
9 (0%) DET
nodes have three or more children.
The highest child degree of a DET
node is 8.
Children of DET
nodes are attached using 20 different relations: en-dep/case (139; 57% instances), en-dep/nmod (47; 19% instances), en-dep/acl:relcl (10; 4% instances), en-dep/advmod (9; 4% instances), en-dep/punct (9; 4% instances), en-dep/cop (6; 2% instances), en-dep/nsubj (6; 2% instances), en-dep/cc (3; 1% instances), en-dep/amod (2; 1% instances), en-dep/aux (2; 1% instances), en-dep/conj (2; 1% instances), en-dep/det:predet (2; 1% instances), en-dep/neg (2; 1% instances), en-dep/advcl (1; 0% instances), en-dep/appos (1; 0% instances), en-dep/det (1; 0% instances), en-dep/goeswith (1; 0% instances), en-dep/mark (1; 0% instances), en-dep/mwe (1; 0% instances), en-dep/nmod:tmod (1; 0% instances)
Children of DET
nodes belong to 14 different parts of speech: ADP (139; 57% instances), PRON (28; 11% instances), NOUN (27; 11% instances), VERB (19; 8% instances), PUNCT (9; 4% instances), ADV (7; 3% instances), DET (4; 2% instances), ADJ (3; 1% instances), CONJ (3; 1% instances), AUX (2; 1% instances), PART (2; 1% instances), PROPN (1; 0% instances), SCONJ (1; 0% instances), X (1; 0% instances)
Treebank Statistics (UD_English-LinES)
There are 1 DET
lemmas (6%), 23 DET
types (0%) and 6429 DET
tokens (10%).
Out of 17 observed tags, the rank of DET
is: 6 in number of lemmas, 11 in number of types and 5 in number of tokens.
The 10 most frequent DET
lemmas: _
The 10 most frequent DET
types: the, a, an, this, that, no, all, some, any, these
The 10 most frequent ambiguous lemmas: _ (NOUN 12161, PUNCT 8085, VERB 8020, ADP 6788, DET 6429, PRON 6303, ADJ 4270, ADV 3700, AUX 3539, PROPN 2257, CCONJ 2081, PART 1703, SCONJ 1231, NUM 462, INTJ 122, X 41, SYM 5)
The 10 most frequent ambiguous types: the (DET 3471, ADV 1), a (DET 1562, ADV 4, PRON 1), this (DET 130, PRON 65), that (SCONJ 505, DET 99, PRON 88), no (DET 82, ADV 19, INTJ 7, PRON 7), all (DET 79, PRON 75, ADV 35, ADP 1), some (DET 62, PRON 13, ADV 3), any (DET 59, PRON 5, ADV 4), these (DET 45, PRON 4), each (DET 26, PRON 19)
- the
- a
- this
- that
- no
- all
- some
- any
- these
- each
Morphology
The form / lemma ratio of DET
is 23.000000 (the average of all parts of speech is 527.705882).
The 1st highest number of forms (23) was observed with the lemma “_”: La, Le, a, all, an, any, both, du, each, either, every, no, one, some, that, the, these, this, those, what, whatever, which, whose.
DET
does not occur with any features.
Relations
DET
nodes are attached to their parents using 5 different relations: en-dep/det (6344; 99% instances), en-dep/advmod (75; 1% instances), en-dep/amod (6; 0% instances), en-dep/obj (3; 0% instances), en-dep/appos (1; 0% instances)
Parents of DET
nodes belong to 13 different parts of speech: NOUN (6005; 93% instances), ADJ (131; 2% instances), PROPN (123; 2% instances), VERB (62; 1% instances), PRON (57; 1% instances), ADV (15; 0% instances), NUM (14; 0% instances), DET (9; 0% instances), ADP (4; 0% instances), PUNCT (4; 0% instances), X (3; 0% instances), INTJ (1; 0% instances), SYM (1; 0% instances)
6382 (99%) DET
nodes are leaves.
40 (1%) DET
nodes have one child.
4 (0%) DET
nodes have two children.
3 (0%) DET
nodes have three or more children.
The highest child degree of a DET
node is 3.
Children of DET
nodes are attached using 13 different relations: en-dep/advmod (13; 23% instances), en-dep/fixed (12; 21% instances), en-dep/det (8; 14% instances), en-dep/mark (6; 11% instances), en-dep/nsubj (4; 7% instances), en-dep/punct (4; 7% instances), en-dep/amod (3; 5% instances), en-dep/aux (2; 4% instances), en-dep/acl (1; 2% instances), en-dep/advcl (1; 2% instances), en-dep/appos (1; 2% instances), en-dep/cc (1; 2% instances), en-dep/obj (1; 2% instances)
Children of DET
nodes belong to 13 different parts of speech: ADJ (11; 19% instances), ADV (11; 19% instances), DET (9; 16% instances), ADP (5; 9% instances), PUNCT (4; 7% instances), SCONJ (4; 7% instances), NOUN (3; 5% instances), PRON (3; 5% instances), AUX (2; 4% instances), PART (2; 4% instances), CCONJ (1; 2% instances), PROPN (1; 2% instances), VERB (1; 2% instances)
Treebank Statistics (UD_English-ParTUT)
There are 37 DET
lemmas (1%), 39 DET
types (1%) and 4015 DET
tokens (11%).
Out of 17 observed tags, the rank of DET
is: 10 in number of lemmas, 10 in number of types and 4 in number of tokens.
The 10 most frequent DET
lemmas: the, a, this, his, their, its, any, us, that, all
The 10 most frequent DET
types: the, a, his, this, an, their, its, any, these, our
The 10 most frequent ambiguous lemmas: a (DET 758, X 1), this (DET 228, PRON 67), his (DET 215, PRON 4), any (DET 58, ADV 1), us (DET 47, PRON 26), that (SCONJ 246, PRON 169, DET 43, ADJ 2), all (DET 38, PRON 36, ADV 1), no (DET 38, ADV 7), you (PRON 82, DET 31), some (DET 27, PRON 17)
The 10 most frequent ambiguous types: a (DET 626, X 1), his (DET 190, PRON 4), this (DET 150, PRON 35), any (DET 56, ADV 1), these (DET 43, PRON 5), our (DET 42, PRON 1), all (DET 36, PRON 28, ADV 1), no (DET 27, ADV 7), some (DET 20, PRON 15), such (DET 20, ADJ 18, ADP 4)
- a
- DET 626: ( The House rose and observed a minute ‘s silence ) .
- X 1: It merely prolongs transitional rules by postponing deadlines , deletes provisions which are no longer applicable , and lays down the procedures for a ) carrying out the ad hoc transportation of dangerous goods and b ) enacting less stringent national regulations , in particular for the transport of very small amounts of dangerous goods within strictly defined local areas .
- his
- this
- any
- DET 56: Adjust your daily budget at any time .
- ADV 1: In this context , I should like to make a request and ask the Commissioner responsible , who is with us here today , to table an appropriate text as soon as possible with a view to continuing to make it safer for traffic to transit tunnels in the future , so that we in Europe do not have to experience any more such disasters on this scale .
- these
- our
- all
- DET 36: In fact , all hell broke loose in some municipalities in my province .
- PRON 28: Mr Berenguer Fuster , we shall check all this .
- ADV 1: In 1623 , John Heminges and Henry Condell , two friends and fellow actors of Shakespeare , published the First Folio , a collected edition of his dramatic works that included all but two of the plays now recognised as Shakespeare ‘s .
- no
- some
- such
Morphology
The form / lemma ratio of DET
is 1.054054 (the average of all parts of speech is 1.187751).
The 1st highest number of forms (2) was observed with the lemma “a”: a, an.
The 2nd highest number of forms (2) was observed with the lemma “le”: Le, les.
The 3rd highest number of forms (2) was observed with the lemma “that”: that, those.
DET
occurs with 5 features: en-feat/PronType (4015; 100% instances), en-feat/Definite (2989; 74% instances), en-feat/Number (1178; 29% instances), en-feat/Poss (514; 13% instances), en-feat/Gender (5; 0% instances)
DET
occurs with 13 feature-value pairs: Definite=Def
, Definite=Ind
, Gender=Fem
, Number=Plur
, Number=Sing
, Poss=Yes
, PronType=Art
, PronType=Dem
, PronType=Ind
, PronType=Int
, PronType=Prs
, PronType=Rel
, PronType=Tot
DET
occurs with 20 feature combinations.
The most frequent feature combination is Definite=Def|PronType=Art
(2206 tokens).
Examples: the, ’s
Relations
DET
nodes are attached to their parents using 12 different relations: en-dep/det (3469; 86% instances), en-dep/nmod:poss (513; 13% instances), en-dep/det:predet (11; 0% instances), en-dep/fixed (6; 0% instances), en-dep/nmod (6; 0% instances), en-dep/obl (3; 0% instances), en-dep/nsubj (2; 0% instances), en-dep/advmod (1; 0% instances), en-dep/iobj (1; 0% instances), en-dep/obj (1; 0% instances), en-dep/parataxis (1; 0% instances), en-dep/root (1; 0% instances)
Parents of DET
nodes belong to 13 different parts of speech: NOUN (3800; 95% instances), PROPN (124; 3% instances), ADJ (26; 1% instances), PRON (21; 1% instances), NUM (14; 0% instances), VERB (9; 0% instances), X (7; 0% instances), ADP (5; 0% instances), ADV (4; 0% instances), SYM (2; 0% instances), DET (1; 0% instances), INTJ (1; 0% instances), ROOT (1; 0% instances)
3995 (100%) DET
nodes are leaves.
15 (0%) DET
nodes have one child.
4 (0%) DET
nodes have two children.
1 (0%) DET
nodes have three or more children.
The highest child degree of a DET
node is 8.
Children of DET
nodes are attached using 12 different relations: en-dep/case (8; 26% instances), en-dep/nmod (4; 13% instances), en-dep/punct (4; 13% instances), en-dep/advmod (3; 10% instances), en-dep/fixed (3; 10% instances), en-dep/aux (2; 6% instances), en-dep/conj (2; 6% instances), en-dep/advcl (1; 3% instances), en-dep/cop (1; 3% instances), en-dep/goeswith (1; 3% instances), en-dep/mark (1; 3% instances), en-dep/nsubj (1; 3% instances)
Children of DET
nodes belong to 11 different parts of speech: ADP (8; 26% instances), ADV (5; 16% instances), PUNCT (4; 13% instances), AUX (3; 10% instances), NOUN (3; 10% instances), ADJ (2; 6% instances), PRON (2; 6% instances), DET (1; 3% instances), SCONJ (1; 3% instances), VERB (1; 3% instances), X (1; 3% instances)
DET in other languages: [am] [ar] [bg] [bxr] [ca] [ckb] [cop] [cs] [cu] [da] [de] [el] [en] [es] [et] [eu] [fa] [fi] [fo] [fr] [ga] [gl] [got] [grc] [he] [hi] [hr] [hu] [id] [it] [ja] [kk] [kmr] [ko] [la] [lv] [mr] [nl] [no] [pl] [pt] [ro] [ru] [sa] [sk] [sla] [sl] [so] [sr] [sv] [swl] [ta] [tr] [ug] [uk] [u] [urj] [ur] [vi] [yue] [zh]