Treebank Statistics: UD_Czech: Features: Foreign
This feature is universal.
It occurs with 1 different values: Yes.
9317 tokens (1%) have a non-empty value of Foreign.
3670 types (3%) occur at least once with a non-empty value of Foreign.
3487 lemmas (6%) occur at least once with a non-empty value of Foreign.
The feature is used with 13 part-of-speech tags: PROPN (3684; 0% instances), ADJ (2670; 0% instances), NOUN (1813; 0% instances), ADP (592; 0% instances), PART (120; 0% instances), VERB (119; 0% instances), ADV (116; 0% instances), CCONJ (80; 0% instances), PRON (62; 0% instances), NUM (29; 0% instances), DET (18; 0% instances), SCONJ (8; 0% instances), INTJ (6; 0% instances).
PROPN
3684 PROPN tokens (4% of all PROPN tokens) have a non-empty value of Foreign.
The most frequent other feature values with which PROPN and Foreign co-occurred: Polarity=Pos (3684; 100%), Case=EMPTY (2905; 79%), Abbr=EMPTY (2670; 72%), NameType=Com (2512; 68%), Animacy=EMPTY (2259; 61%), Number=EMPTY (2177; 59%).
PROPN tokens may have the following values of Foreign:
Yes(3684; 100% of non-emptyForeign): HZDS, IRA, Floyd, Nature, International, Science, Sinn, Fein, Times, CupEMPTY(80347): Praha, ČR, Praze, LN, ODS, USA, J, Jiří, Jan, OSN
Foreign seems to be lexical feature of PROPN. 100% lemmas (1422) occur only with one value of Foreign.
ADJ
2670 ADJ tokens (1% of all ADJ tokens) have a non-empty value of Foreign.
The most frequent other feature values with which ADJ and Foreign co-occurred: Polarity=Pos (2666; 100%), Degree=Pos (2655; 99%), Animacy=EMPTY (2571; 96%), Case=EMPTY (2546; 95%), Number=EMPTY (2447; 92%), Gender=EMPTY (2439; 91%).
ADJ tokens may have the following values of Foreign:
Yes(2670; 100% of non-emptyForeign): New, the, open, US, Pink, la, Le, Deutsche, die, UnitedEMPTY(186515): první, další, české, nové, druhé, poslední, státní, dalších, možné, vlastní
Foreign seems to be lexical feature of ADJ. 100% lemmas (1003) occur only with one value of Foreign.
NOUN
1813 NOUN tokens (0% of all NOUN tokens) have a non-empty value of Foreign.
The most frequent other feature values with which NOUN and Foreign co-occurred: Polarity=Pos (1812; 100%), Case=EMPTY (1250; 69%), Animacy=EMPTY (1015; 56%), Number=EMPTY (975; 54%).
NOUN tokens may have the following values of Foreign:
Yes(1813; 100% of non-emptyForeign): play, managementu, management, CD, s, facto, st, o, homo, neemEMPTY(370553): roku, korun, let, roce, strany, procent, společnosti, době, případě, firmy
Foreign seems to be lexical feature of NOUN. 100% lemmas (945) occur only with one value of Foreign.
ADP
592 ADP tokens (0% of all ADP tokens) have a non-empty value of Foreign.
The most frequent other feature values with which ADP and Foreign co-occurred: AdpType=Prep (592; 100%), Case=EMPTY (353; 60%).
ADP tokens may have the following values of Foreign:
Yes(592; 100% of non-emptyForeign): de, of, di, van, in, von, versus, ad, Pro, toEMPTY(145351): v, na, o, z, s, do, ve, k, pro, za
Foreign seems to be lexical feature of ADP. 100% lemmas (55) occur only with one value of Foreign.
PART
120 PART tokens (1% of all PART tokens) have a non-empty value of Foreign.
PART tokens may have the following values of Foreign:
Yes(120; 100% of non-emptyForeign): off, džambo, not, t, oui, Bienvenue, So, ne, sorry, vivaEMPTY(8045): jen, až, asi, li, ne, nejen, prý, jenom, ano, bohužel
Foreign seems to be lexical feature of PART. 100% lemmas (28) occur only with one value of Foreign.
VERB
119 VERB tokens (0% of all VERB tokens) have a non-empty value of Foreign.
The most frequent other feature values with which VERB and Foreign co-occurred: Aspect=EMPTY (119; 100%), Polarity=Pos (113; 95%), Gender=EMPTY (112; 94%), Person=EMPTY (65; 55%), Tense=EMPTY (62; 52%), Voice=EMPTY (62; 52%), Mood=EMPTY (60; 50%).
VERB tokens may have the following values of Foreign:
Yes(119; 100% of non-emptyForeign): is, Be, can, est, transit, Check, Come, Habent, Keep, LoveEMPTY(135391): má, je, může, řekl, měl, mají, musí, jde, měla, jsou
Foreign seems to be lexical feature of VERB. 100% lemmas (85) occur only with one value of Foreign.
ADV
116 ADV tokens (0% of all ADV tokens) have a non-empty value of Foreign.
The most frequent other feature values with which ADV and Foreign co-occurred: PronType=EMPTY (114; 98%), Polarity=EMPTY (107; 92%), Degree=EMPTY (107; 92%).
ADV tokens may have the following values of Foreign:
Yes(116; 100% of non-emptyForeign): cca, priori, Today, live, Here, Only, Sic, Very, dove, echtEMPTY(79881): tak, už, také, jak, včera, ještě, již, tedy, dnes, pak
Foreign seems to be lexical feature of ADV. 100% lemmas (71) occur only with one value of Foreign.
CCONJ
80 CCONJ tokens (0% of all CCONJ tokens) have a non-empty value of Foreign.
CCONJ tokens may have the following values of Foreign:
Yes(80; 100% of non-emptyForeign): and, et, und, As, or, ma, So, e, nEMPTY(56777): a, i, ale, však, nebo, ani, či, proto, až, ovšem
PRON
62 PRON tokens (0% of all PRON tokens) have a non-empty value of Foreign.
The most frequent other feature values with which PRON and Foreign co-occurred: PrepCase=EMPTY (62; 100%), Variant=EMPTY (61; 98%), Reflex=EMPTY (61; 98%), Gender=EMPTY (45; 73%), PronType=Prs (42; 68%), Number=Sing (32; 52%).
PRON tokens may have the following values of Foreign:
Yes(62; 100% of non-emptyForeign): it, All, you, I, Me, We, Us, She, Some, WASEMPTY(44863): se, si, co, nás, je, nám, nich, kdo, což, mu
Foreign seems to be lexical feature of PRON. 100% lemmas (23) occur only with one value of Foreign.
NUM
29 NUM tokens (0% of all NUM tokens) have a non-empty value of Foreign.
The most frequent other feature values with which NUM and Foreign co-occurred: NumForm=Word (29; 100%), Gender=EMPTY (29; 100%), NumType=Card (29; 100%), Case=EMPTY (26; 90%), NumValue=1,2,3 (24; 83%), Number=Plur (22; 76%).
NUM tokens may have the following values of Foreign:
Yes(29; 100% of non-emptyForeign): Four, Twenty, Seven, Six, one, Five, Three, Tre, Tri, seděmEMPTY(41478): 1, 2, 3, dva, tři, 4, jeden, 6, dvě, tisíc
Foreign seems to be lexical feature of NUM. 100% lemmas (12) occur only with one value of Foreign.
DET
18 DET tokens (0% of all DET tokens) have a non-empty value of Foreign.
The most frequent other feature values with which DET and Foreign co-occurred: Animacy=EMPTY (17; 94%), Number[psor]=EMPTY (14; 78%), Gender=EMPTY (13; 72%), Case=EMPTY (13; 72%), Person=EMPTY (12; 67%), Poss=EMPTY (10; 56%).
DET tokens may have the following values of Foreign:
Yes(18; 100% of non-emptyForeign): My, That, This, Your, sua, C, Notre, Some, These, ceEMPTY(56447): to, které, který, jeho, která, jejich, své, tím, kteří, tom
Foreign seems to be lexical feature of DET. 100% lemmas (13) occur only with one value of Foreign.
SCONJ
8 SCONJ tokens (0% of all SCONJ tokens) have a non-empty value of Foreign.
SCONJ tokens may have the following values of Foreign:
Yes(8; 100% of non-emptyForeign): as, If, When, ak, ako, gdyž, kakEMPTY(27710): že, jako, aby, než, když, pokud, protože, zda, jak, zatímco
INTJ
6 INTJ tokens (5% of all INTJ tokens) have a non-empty value of Foreign.
INTJ tokens may have the following values of Foreign:
Yes(6; 100% of non-emptyForeign): O, propos, Bang, Boom, CrashEMPTY(107): PA, Pink, ach, Inu, hle, proboha, Haló, což, fajn, Ó
Relations with Agreement in Foreign
The 10 most frequent relations where parent and child node agree in Foreign:
PROPN –[flat:foreign]–> ADJ (920; 100%),
NOUN –[flat:foreign]–> ADJ (594; 100%),
PROPN –[flat:foreign]–> PROPN (286; 99%),
NOUN –[flat:foreign]–> NOUN (163; 99%),
ADJ –[flat:foreign]–> ADJ (139; 100%),
NOUN –[flat:foreign]–> ADP (127; 100%),
ADJ –[flat:foreign]–> PROPN (96; 100%),
NOUN –[flat:foreign]–> PART (51; 100%),
ADJ –[flat:foreign]–> NOUN (40; 100%),
NOUN –[flat:foreign]–> PROPN (27; 87%).