Treebank Statistics: UD_Czech: Features: Abbr
This feature is universal.
It occurs with 1 different values: Yes
.
21743 tokens (1%) have a non-empty value of Abbr
.
1755 types (1%) occur at least once with a non-empty value of Abbr
.
1806 lemmas (3%) occur at least once with a non-empty value of Abbr
.
The feature is used with 10 part-of-speech tags: PROPN (13042; 1% instances), NOUN (5768; 0% instances), ADJ (1714; 0% instances), ADV (956; 0% instances), CCONJ (182; 0% instances), ADP (23; 0% instances), VERB (22; 0% instances), DET (21; 0% instances), X (12; 0% instances), PART (3; 0% instances).
PROPN
13042 PROPN tokens (16% of all PROPN
tokens) have a non-empty value of Abbr
.
The most frequent other feature values with which PROPN
and Abbr
co-occurred: Polarity=Pos (13042; 100%), Case=EMPTY (13010; 100%), Number=EMPTY (12219; 94%), Animacy=EMPTY (9687; 74%), Gender=Fem (6911; 53%), NameType=Com (6803; 52%).
PROPN
tokens may have the following values of Abbr
:
Yes
(13042; 100% of non-emptyAbbr
): ČR, LN, ODS, J, OSN, ODA, M, ČSFR, V, AEMPTY
(70989): Praha, Praze, USA, Jiří, Jan, Evropy, Brno, Prahy, Václav, Jana
Abbr
seems to be lexical feature of PROPN
. 100% lemmas (1236) occur only with one value of Abbr
.
NOUN
5768 NOUN tokens (2% of all NOUN
tokens) have a non-empty value of Abbr
.
The most frequent other feature values with which NOUN
and Abbr
co-occurred: Polarity=Pos (5768; 100%), Case=EMPTY (5608; 97%), Number=EMPTY (5538; 96%), Gender=Masc (3042; 53%).
NOUN
tokens may have the following values of Abbr
:
Yes
(5768; 100% of non-emptyAbbr
): r, s, tel, m, č, km, MS, mil, Kčs, cmEMPTY
(366598): roku, korun, let, roce, strany, procent, společnosti, době, případě, firmy
Abbr
seems to be lexical feature of NOUN
. 100% lemmas (485) occur only with one value of Abbr
.
ADJ
1714 ADJ tokens (1% of all ADJ
tokens) have a non-empty value of Abbr
.
The most frequent other feature values with which ADJ
and Abbr
co-occurred: Polarity=Pos (1714; 100%), Animacy=EMPTY (1713; 100%), Degree=Pos (1705; 99%), Case=EMPTY (1601; 93%), Number=EMPTY (1601; 93%), Gender=EMPTY (1598; 93%).
ADJ
tokens may have the following values of Abbr
:
Yes
(1714; 100% of non-emptyAbbr
): tzv, a, čs, o, sv, RM, US, Č, n, kEMPTY
(187471): první, další, české, nové, druhé, poslední, státní, dalších, možné, vlastní
Abbr
seems to be lexical feature of ADJ
. 100% lemmas (185) occur only with one value of Abbr
.
ADV
956 ADV tokens (1% of all ADV
tokens) have a non-empty value of Abbr
.
The most frequent other feature values with which ADV
and Abbr
co-occurred: Polarity=EMPTY (956; 100%), Degree=EMPTY (956; 100%), PronType=EMPTY (955; 100%).
ADV
tokens may have the following values of Abbr
:
Yes
(956; 100% of non-emptyAbbr
): např, mj, apod, atd, resp, atp, popř, cca, ap, kupřEMPTY
(79041): tak, už, také, jak, včera, ještě, již, tedy, dnes, pak
Abbr
seems to be lexical feature of ADV
. 100% lemmas (22) occur only with one value of Abbr
.
CCONJ
182 CCONJ tokens (0% of all CCONJ
tokens) have a non-empty value of Abbr
.
CCONJ
tokens may have the following values of Abbr
:
Yes
(182; 100% of non-emptyAbbr
): tj, nEMPTY
(56675): a, i, ale, však, nebo, ani, či, proto, až, ovšem
ADP
23 ADP tokens (0% of all ADP
tokens) have a non-empty value of Abbr
.
The most frequent other feature values with which ADP
and Abbr
co-occurred: AdpType=Prep (23; 100%), Case=Ins (16; 70%).
ADP
tokens may have the following values of Abbr
:
Yes
(23; 100% of non-emptyAbbr
): n, v, př, P, m, včEMPTY
(145920): v, na, o, z, s, do, ve, k, pro, za
VERB
22 VERB tokens (0% of all VERB
tokens) have a non-empty value of Abbr
.
The most frequent other feature values with which VERB
and Abbr
co-occurred: Gender=EMPTY (22; 100%), VerbForm=Fin (22; 100%), Number=Sing (22; 100%), Polarity=Pos (22; 100%), Voice=Act (17; 77%), Person=3 (17; 77%), Mood=Ind (17; 77%), Tense=Pres (17; 77%).
VERB
tokens may have the following values of Abbr
:
Yes
(22; 100% of non-emptyAbbr
): tzn, j, srovEMPTY
(135488): má, je, může, řekl, měl, mají, musí, jde, měla, jsou
DET
21 DET tokens (0% of all DET
tokens) have a non-empty value of Abbr
.
The most frequent other feature values with which DET
and Abbr
co-occurred: Animacy=EMPTY (21; 100%), Number[psor]=EMPTY (15; 71%), Poss=EMPTY (15; 71%), Person=EMPTY (15; 71%), PronType=Dem (13; 62%), Gender=EMPTY (11; 52%), Number=EMPTY (11; 52%), Case=EMPTY (11; 52%).
DET
tokens may have the following values of Abbr
:
Yes
(21; 100% of non-emptyAbbr
): t, n, mn, všEMPTY
(56444): to, které, který, jeho, která, jejich, své, tím, kteří, tom
X
12 X tokens (92% of all X
tokens) have a non-empty value of Abbr
.
X
tokens may have the following values of Abbr
:
Yes
(12; 100% of non-emptyAbbr
): A, H, M, SEMPTY
(1): A
PART
3 PART tokens (0% of all PART
tokens) have a non-empty value of Abbr
.
PART
tokens may have the following values of Abbr
:
Yes
(3; 100% of non-emptyAbbr
): CAEMPTY
(8162): jen, až, asi, li, ne, nejen, prý, jenom, ano, bohužel
Relations with Agreement in Abbr
The 10 most frequent relations where parent and child node agree in Abbr
:
PROPN –[conj]–> PROPN (723; 66%),
ADJ –[amod]–> ADJ (48; 77%),
PROPN –[orphan]–> PROPN (15; 58%),
NOUN –[det]–> DET (15; 52%),
X –[nmod]–> X (9; 100%),
PROPN –[nsubj]–> PROPN (3; 100%),
NOUN –[flat:foreign]–> ADV (2; 100%),
PART –[conj]–> NOUN (2; 100%),
ADP –[dep]–> NOUN (1; 100%),
DET –[amod]–> ADJ (1; 100%).