Treebank Statistics: UD_Czech: Features: NameType
This feature is language-specific.
It occurs with 7 different values: Com, Geo, Giv, Nat, Oth, Pro, Sur.
Some words have combined values of the feature; 21 combinations have been observed: Com|Geo, Com|Giv, Com|Giv|Sur, Com|Nat, Com|Oth, Com|Pro, Com|Pro|Sur, Com|Sur, Geo|Giv, Geo|Giv|Sur, Geo|Oth, Geo|Pro, Geo|Sur, Giv|Nat, Giv|Oth, Giv|Pro, Giv|Pro|Sur, Giv|Sur, Nat|Sur, Oth|Sur, Pro|Sur.
88937 tokens (6%) have a non-empty value of NameType.
24371 types (19%) occur at least once with a non-empty value of NameType.
17008 lemmas (29%) occur at least once with a non-empty value of NameType.
The feature is used with 11 part-of-speech tags: PROPN (84031; 6% instances), ADJ (4756; 0% instances), ADP (71; 0% instances), NUM (20; 0% instances), ADV (17; 0% instances), VERB (13; 0% instances), PRON (12; 0% instances), PART (8; 0% instances), DET (4; 0% instances), INTJ (4; 0% instances), CCONJ (1; 0% instances).
PROPN
84031 PROPN tokens (100% of all PROPN tokens) have a non-empty value of NameType.
The most frequent other feature values with which PROPN and NameType co-occurred: Polarity=Pos (84031; 100%), Abbr=EMPTY (70989; 84%), Number=Sing (63182; 75%), Gender=Masc (48949; 58%).
PROPN tokens may have the following values of NameType:
Com(12393; 15% of non-emptyNameType): ODS, OSN, ODA, ČSSD, NATO, Sparta, ČT, HZDS, EU, FSCom,Geo(46; 0% of non-emptyNameType): Chelsea, Bergen, Europe, Kladno, Prague, Aral, Bay, California, Canada, DeutschlandCom,Giv(34; 0% of non-emptyNameType): KOVO, Kovo, Konstruktiva, Poldi, Fiorentina, Michael, Ringo, Světozor, Kovohutě, NšočiCom,Giv,Sur(1; 0% of non-emptyNameType): WinstonCom,Nat(5; 0% of non-emptyNameType): Jihlavanu, JihlavanCom,Pro(34; 0% of non-emptyNameType): Bild, Canon, Fiat, Honda, Fiatu, Canonu, Hondy, CANON, Fiaty, HONDACom,Sur(44; 0% of non-emptyNameType): Benetton, Benettonu, Mates, Winston, Maxwell, Biederstein, Bradstreet, Daimler, Dohme, DunGeo(26520; 32% of non-emptyNameType): Praha, ČR, Praze, USA, Evropy, Brno, Prahy, ČSFR, Evropě, NěmeckuGeo,Giv(31; 0% of non-emptyNameType): Amos, Gyula, Gyuly, Karin, Alma, AMOS, Amosem, Gyulu, José, JosémuGeo,Giv,Sur(18; 0% of non-emptyNameType): Butrus, Butruse, Keith, KozákGeo,Oth(1; 0% of non-emptyNameType): SaturnGeo,Pro(2; 0% of non-emptyNameType): Mountain, RENOVAGeo,Sur(241; 0% of non-emptyNameType): Breda, Paisley, Petrov, Wallis, Powell, Bihače, Wallise, Warren, Lichtenbergu, LomGiv(15099; 18% of non-emptyNameType): J, Jiří, Jan, Václav, Jana, Petr, M, Josef, Pavel, VladimírGiv,Nat(3; 0% of non-emptyNameType): HunGiv,Oth(5; 0% of non-emptyNameType): Miranda, David, John, MIRANDYGiv,Pro(1; 0% of non-emptyNameType): PascalGiv,Pro,Sur(1; 0% of non-emptyNameType): FigaroGiv,Sur(139; 0% of non-emptyNameType): Perry, Perryho, Charlie, Diega, Othello, Diego, Ricardo, Rút, Heřman, JohanNat(2286; 3% of non-emptyNameType): Němci, Češi, Němců, Američané, američan, Slováci, Srbové, Rusové, Srby, ČechůNat,Sur(7; 0% of non-emptyNameType): Uher, Maye, UHEROth(555; 1% of non-emptyNameType): PVP, Prix, Tour, ECU, Garden, München, line, Rapaportu, VC, AgePro(2054; 2% of non-emptyNameType): LN, MF, PC, Škoda, mercedes, favorit, Mir, ford, polo, WeltPro,Sur(25; 0% of non-emptyNameType): Kozel, Stock, Burda, Johnnie, Hornet, Walker, WalkeremSur(24486; 29% of non-emptyNameType): Klaus, Havel, Klause, Svoboda, Mečiar, Havla, Jelcin, John, Zeman, Němec
| Paradigm Paris | Com | Geo | Giv | Oth | Pro | Sur |
|---|---|---|---|---|---|---|
| Animacy=Anim|Case=Acc|Gender=Masc|Number=Sing | Parise | |||||
| Animacy=Anim|Case=Nom|Gender=Masc|Number=Sing | Paris | |||||
| Foreign=Yes|Gender=Fem|Number=Sing | Paris | |||||
| Foreign=Yes|Gender=Fem | Paris | |||||
| Gender=Fem | Paris | Paris | ||||
| Paris |
NameType seems to be lexical feature of PROPN. 96% lemmas (14763) occur only with one value of NameType.
ADJ
4756 ADJ tokens (3% of all ADJ tokens) have a non-empty value of NameType.
The most frequent other feature values with which ADJ and NameType co-occurred: Animacy=EMPTY (3630; 76%), Polarity=Pos (2450; 52%), Degree=Pos (2437; 51%).
ADJ tokens may have the following values of NameType:
Com(1199; 25% of non-emptyNameType): RM, Pink, K, J, Deutsche, United, Die, I, U, BritishCom,Geo(17; 0% of non-emptyNameType): York, Covent, Abbey, Amsterdam, Bradford, Brooklyn, Louis, New, Oak, RidgeCom,Giv(1; 0% of non-emptyNameType): KonrádCom,Oth(8; 0% of non-emptyNameType): Al, Black, Box, MuteCom,Pro(2; 0% of non-emptyNameType): Apple, MicrosoftCom,Pro,Sur(1; 0% of non-emptyNameType): SunCom,Sur(10; 0% of non-emptyNameType): Gordon, Binder, Cocteau, Goethe, Mandel, Rambert, Random, Warner, WellesleyGeo(733; 15% of non-emptyNameType): New, Č, Flushing, Los, San, Tchaj, Horní, Devils, Twin, BuenosGeo,Giv(3; 0% of non-emptyNameType): Karl, KarlovyGeo,Oth(1; 0% of non-emptyNameType): SalemGeo,Pro(4; 0% of non-emptyNameType): York, Denver, WashingtonGeo,Sur(11; 0% of non-emptyNameType): Marx, Špindlerově, Lounských, Powellovo, Powellovy, Powellových, Santa, Spenglerův, WallisověGiv(290; 6% of non-emptyNameType): Karlovy, Karlových, Karlova, Karlově, Heinrichovy, Janova, Jindřichově, Heinrichových, Jindřichův, JežíšovaGiv,Sur(28; 1% of non-emptyNameType): Eukleidových, Eukleidovy, Damoklův, Heřmanův, Alláhovým, Berijova, Eukleidova, Eukleidově, Franckův, HésiodovyNat(10; 0% of non-emptyNameType): Američanovy, Američanův, Australanovo, Brazilcovy, Florenťanův, Indian, Irův, Němcův, Pražákovo, TaliánůvOth(222; 5% of non-emptyNameType): US, New, Made, Sex, al, Australian, French, Miranda, Inspiral, MuteOth,Sur(1; 0% of non-emptyNameType): SheaPro(167; 4% of non-emptyNameType): Financial, coca, Super, Chem, Eng, Prágai, Wyborcza, der, pepsi, MagyarSur(2048; 43% of non-emptyNameType): Milíčova, Masarykově, Benešových, Schrödingerova, Casimirův, Klausův, Masarykova, Mečiarova, Benešovy, Janáčkovy
| Paradigm New | Com,Geo | Geo | Oth |
|---|---|---|---|
| New | New, NEW | New |
NameType seems to be lexical feature of ADJ. 95% lemmas (1730) occur only with one value of NameType.
ADP
71 ADP tokens (0% of all ADP tokens) have a non-empty value of NameType.
The most frequent other feature values with which ADP and NameType co-occurred: AdpType=Prep (71; 100%), Case=EMPTY (70; 99%).
ADP tokens may have the following values of NameType:
Com(16; 23% of non-emptyNameType): Pro, PRO, dei, des, poGeo(3; 4% of non-emptyNameType): Unter, del, ÁthaGeo,Giv,Sur(35; 49% of non-emptyNameType): diOth(6; 8% of non-emptyNameType): for, Into, Pour, Pro, ToPro(10; 14% of non-emptyNameType): ex, della, QuantumSur(1; 1% of non-emptyNameType): zum
| Paradigm Pro | Com | Oth |
|---|---|---|
| Pro, PRO | Pro |
NameType seems to be lexical feature of ADP. 94% lemmas (15) occur only with one value of NameType.
NUM
20 NUM tokens (0% of all NUM tokens) have a non-empty value of NameType.
The most frequent other feature values with which NUM and NameType co-occurred: NumForm=Word (20; 100%), Gender=EMPTY (20; 100%), NumType=Card (20; 100%), NumValue=1,2,3 (19; 95%), Case=EMPTY (19; 95%), Number=Plur (19; 95%).
NUM tokens may have the following values of NameType:
Com(20; 100% of non-emptyNameType): Four, Seven, Twenty, Six, Tre
ADV
17 ADV tokens (0% of all ADV tokens) have a non-empty value of NameType.
The most frequent other feature values with which ADV and NameType co-occurred: PronType=EMPTY (17; 100%), Degree=EMPTY (14; 82%), Polarity=EMPTY (14; 82%).
ADV tokens may have the following values of NameType:
Com(5; 29% of non-emptyNameType): More, Nahoru, dolů, achšavOth(7; 41% of non-emptyNameType): COSI, Down, How, Live, So, Up, WhyPro(5; 29% of non-emptyNameType): Ahead, Inside, Live, Today, Weekly
| Paradigm Live | Oth | Pro |
|---|---|---|
| Degree=Pos|Polarity=Pos | Live | |
| Live |
NameType seems to be lexical feature of ADV. 93% lemmas (14) occur only with one value of NameType.
VERB
13 VERB tokens (0% of all VERB tokens) have a non-empty value of NameType.
The most frequent other feature values with which VERB and NameType co-occurred: Polarity=Pos (13; 100%), Aspect=EMPTY (13; 100%), Gender=EMPTY (13; 100%), Person=EMPTY (8; 62%), Number=EMPTY (8; 62%), Voice=Act (7; 54%), Mood=EMPTY (7; 54%).
VERB tokens may have the following values of NameType:
Com(2; 15% of non-emptyNameType): Can, DanceOth(9; 69% of non-emptyNameType): Porter, Can, Comes, FAN, Feels, Said, Takes, WantPro(2; 15% of non-emptyNameType): Check, Lean
| Paradigm Can | Com | Oth |
|---|---|---|
| Mood=Ind|Tense=Pres|VerbForm=Fin|Voice=Act | Can | |
| VerbForm=Inf | Can |
NameType seems to be lexical feature of VERB. 91% lemmas (10) occur only with one value of NameType.
PRON
12 PRON tokens (0% of all PRON tokens) have a non-empty value of NameType.
The most frequent other feature values with which PRON and NameType co-occurred: Reflex=EMPTY (12; 100%), Variant=EMPTY (12; 100%), PrepCase=EMPTY (12; 100%), Gender=EMPTY (8; 67%), Person=EMPTY (7; 58%), Number=Sing (7; 58%), PronType=Tot (7; 58%).
PRON tokens may have the following values of NameType:
Com(4; 33% of non-emptyNameType): AllOth(4; 33% of non-emptyNameType): All, Everything, YouPro(4; 33% of non-emptyNameType): Ty, It, man
| Paradigm All | Com | Oth |
|---|---|---|
| Case=Acc|Gender=Neut|Number=Sing | All | |
| All |
PART
8 PART tokens (0% of all PART tokens) have a non-empty value of NameType.
PART tokens may have the following values of NameType:
Com(2; 25% of non-emptyNameType): Non, weOth(5; 63% of non-emptyNameType): L, Not, at, el, tSur(1; 13% of non-emptyNameType): ka
DET
4 DET tokens (0% of all DET tokens) have a non-empty value of NameType.
The most frequent other feature values with which DET and NameType co-occurred: Animacy=EMPTY (4; 100%), Case=EMPTY (3; 75%), Poss=Yes (3; 75%), Number[psor]=EMPTY (3; 75%), PronType=Prs (3; 75%).
DET tokens may have the following values of NameType:
Oth(2; 50% of non-emptyNameType): Notre, ThisPro(2; 50% of non-emptyNameType): Your
INTJ
4 INTJ tokens (4% of all INTJ tokens) have a non-empty value of NameType.
INTJ tokens may have the following values of NameType:
Com(1; 25% of non-emptyNameType): HaloOth(3; 75% of non-emptyNameType): Bang, Boom, Crash
CCONJ
1 CCONJ tokens (0% of all CCONJ tokens) have a non-empty value of NameType.
CCONJ tokens may have the following values of NameType:
Com(1; 100% of non-emptyNameType): und
Relations with Agreement in NameType
The 10 most frequent relations where parent and child node agree in NameType:
PROPN –[conj]–> PROPN (5712; 87%),
PROPN –[flat:foreign]–> ADJ (661; 72%),
PROPN –[flat:foreign]–> PROPN (246; 85%),
PROPN –[orphan]–> PROPN (137; 79%),
ADJ –[flat:foreign]–> PROPN (84; 88%),
ADJ –[amod]–> ADJ (51; 61%),
ADJ –[conj]–> ADJ (50; 76%),
PROPN –[nsubj]–> PROPN (12; 55%),
PROPN –[xcomp]–> PROPN (10; 67%),
PROPN –[cc]–> PROPN (5; 56%).