home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Czech-CAC: Features: Variant

This feature is language-specific. It occurs with 1 different values: Short.

14259 tokens (3%) have a non-empty value of Variant. 2017 types (3%) occur at least once with a non-empty value of Variant. 1099 lemmas (4%) occur at least once with a non-empty value of Variant. The feature is used with 2 part-of-speech tags: PRON (9195; 2% instances), ADJ (5064; 1% instances).

PRON

9195 PRON tokens (57% of all PRON tokens) have a non-empty value of Variant.

The most frequent other feature values with which PRON and Variant co-occurred: PrepCase=EMPTY (9195; 100%), PronType=Prs (9195; 100%), Gender=EMPTY (8883; 97%), Reflex=Yes (8706; 95%), Number=EMPTY (8705; 95%), Person=EMPTY (8705; 95%), Case=Acc (7929; 86%).

PRON tokens may have the following values of Variant:

ADJ

5064 ADJ tokens (7% of all ADJ tokens) have a non-empty value of Variant.

The most frequent other feature values with which ADJ and Variant co-occurred: Degree=EMPTY (5064; 100%), Case=EMPTY (5056; 100%), Polarity=Pos (5034; 99%), Animacy=EMPTY (3654; 72%).

ADJ tokens may have the following values of Variant:

Variant seems to be lexical feature of ADJ. 100% lemmas (1095) occur only with one value of Variant.

Relations with Agreement in Variant

The 10 most frequent relations where parent and child node agree in Variant: ADJ –[conj]–> ADJ (328; 79%), ADJ –[orphan]–> ADJ (5; 71%), ADJ –[appos]–> ADJ (4; 80%), ADJ –[advmod]–> ADJ (1; 100%), ADJ –[ccomp]–> ADJ (1; 100%).