home cs/feat edit page issue tracker

This page pertains to UD version 2.

Hyph: hyphenated compound or part of it

Values: Yes

Boolean feature. Is this the first part of a hyphenated compound?

Compound adjectives with hyphens, such as česko-slovenský  “Czech-Slovak” get split during tokenization. The last part, slovenský,  is an independent adjective with full inflection paradigm. However, the first part, česko,  is a form that does not occur elsewhere than in compounds (the independent form would be český).

Yes: it is part of hyphenated compound

Examples


Treebank Statistics (UD_Czech)

This feature is language-specific. It occurs with 1 different values: Yes.

348 tokens (0%) have a non-empty value of Hyph. 130 types (0%) occur at least once with a non-empty value of Hyph. 122 lemmas (0%) occur at least once with a non-empty value of Hyph. The feature is used with 1 part-of-speech tags: cs-pos/ADJ (348; 0% instances).

ADJ

348 cs-pos/ADJ tokens (0% of all ADJ tokens) have a non-empty value of Hyph.

The most frequent other feature values with which ADJ and Hyph co-occurred: Gender=EMPTY (348; 100%), Case=EMPTY (348; 100%), Degree=EMPTY (348; 100%), Number=EMPTY (348; 100%), Animacy=EMPTY (348; 100%), Polarity=Pos (347; 100%).

ADJ tokens may have the following values of Hyph:

Hyph seems to be lexical feature of ADJ. 100% lemmas (122) occur only with one value of Hyph.


Treebank Statistics (UD_Czech-CAC)

This feature is language-specific. It occurs with 1 different values: Yes.

130 tokens (0%) have a non-empty value of Hyph. 65 types (0%) occur at least once with a non-empty value of Hyph. 60 lemmas (0%) occur at least once with a non-empty value of Hyph. The feature is used with 1 part-of-speech tags: cs-pos/ADJ (130; 0% instances).

ADJ

130 cs-pos/ADJ tokens (0% of all ADJ tokens) have a non-empty value of Hyph.

The most frequent other feature values with which ADJ and Hyph co-occurred: Polarity=Pos (130; 100%), Animacy=EMPTY (130; 100%), Number=EMPTY (130; 100%), Degree=EMPTY (130; 100%), Case=EMPTY (130; 100%), Gender=EMPTY (130; 100%).

ADJ tokens may have the following values of Hyph:

Hyph seems to be lexical feature of ADJ. 100% lemmas (60) occur only with one value of Hyph.

Relations with Agreement in Hyph

The 10 most frequent relations where parent and child node agree in Hyph: ADJ –[amod]–> ADJ (3; 100%).


Treebank Statistics (UD_Czech-CLTT)

This feature is language-specific. It occurs with 1 different values: Yes.

10 tokens (0%) have a non-empty value of Hyph. 2 types (0%) occur at least once with a non-empty value of Hyph. 2 lemmas (0%) occur at least once with a non-empty value of Hyph. The feature is used with 1 part-of-speech tags: cs-pos/ADJ (10; 0% instances).

ADJ

10 cs-pos/ADJ tokens (0% of all ADJ tokens) have a non-empty value of Hyph.

The most frequent other feature values with which ADJ and Hyph co-occurred: Degree=EMPTY (10; 100%), Polarity=Pos (10; 100%), Number=EMPTY (10; 100%), Animacy=EMPTY (10; 100%), Gender=EMPTY (10; 100%), Case=EMPTY (10; 100%).

ADJ tokens may have the following values of Hyph:


Hyph in other languages: [cs] [et] [pl] [pt]