Variant
: Variant
Sometimes there are multiple word forms for the same lemma and set of features.
The Variant
feature helps distinguish alternate forms.
In Russian adjectives may have a short form.
This feature only marks the non-standard short forms, hence there is only one value, Short
.
For the long standard forms the Variant
feature remains unspecified.
Short
: short form of adjectives
The short form is called nominal form of adjective (краткая форма прилагательных), as opposed to the long form, which is pronominal because it originated as a combination of a nominal form and a personal pronoun._
Examples
- красив “beautiful”, возможен “able”, нужен “necessary”, известен “known”, доволен “satisfied”, уверен “sure”, равен “equal”, готов “finished”, связан “connected”, виновен “guilty”
- Long equivalents: красивый, возможный, нужный, известный, довольный, уверенный, равный, готовый, связанный, виновный
Treebank Statistics (UD_Russian)
This feature is language-specific.
It occurs with 1 different values: Short
.
1040 tokens (1%) have a non-empty value of Variant
.
643 types (2%) occur at least once with a non-empty value of Variant
.
405 lemmas (2%) occur at least once with a non-empty value of Variant
.
The feature is used with 3 part-of-speech tags: ru-pos/VERB (820; 1% instances), ru-pos/ADJ (212; 0% instances), ru-pos/AUX (8; 0% instances).
VERB
820 ru-pos/VERB tokens (11% of all VERB
tokens) have a non-empty value of Variant
.
The most frequent other feature values with which VERB
and Variant
co-occurred: Mood=EMPTY (820; 100%), VerbForm=Part (820; 100%), Case=Nom (820; 100%), Person=EMPTY (820; 100%), Voice=Pass (819; 100%), Tense=Past (818; 100%), Aspect=Perf (817; 100%), Number=Sing (651; 79%), Animacy=Inan (647; 79%).
VERB
tokens may have the following values of Variant
:
Short
(820; 100% of non-emptyVariant
): расположен, назначен, основана, награждён, основан, расположена, расположено, принято, расположены, создана
Variant
seems to be lexical feature of VERB
. 100% lemmas (314) occur only with one value of Variant
.
ADJ
212 ru-pos/ADJ tokens (2% of all ADJ
tokens) have a non-empty value of Variant
.
The most frequent other feature values with which ADJ
and Variant
co-occurred: Case=Nom (212; 100%), Number=Sing (164; 77%), Animacy=Inan (152; 72%).
ADJ
tokens may have the following values of Variant
:
Short
(212; 100% of non-emptyVariant
): должен, известен, должна, должны, женат, известно, должно, известны, обязан, близок
Variant
seems to be lexical feature of ADJ
. 100% lemmas (90) occur only with one value of Variant
.
AUX
8 ru-pos/AUX tokens (1% of all AUX
tokens) have a non-empty value of Variant
.
The most frequent other feature values with which AUX
and Variant
co-occurred: Tense=Past (8; 100%), Number=Sing (8; 100%), Mood=EMPTY (8; 100%), Aspect=Perf (8; 100%), Voice=Pass (8; 100%), VerbForm=Part (8; 100%), Person=EMPTY (8; 100%), Gender=Masc (5; 63%).
AUX
tokens may have the following values of Variant
:
Short
(8; 100% of non-emptyVariant
): назначен, исполнено, найден, предусмотрена, признана, сертифицирован
Relations with Agreement in Variant
The 10 most frequent relations where parent and child node agree in Variant
:
ADJ –[conj]–> ADJ (10; 77%),
ADJ –[ccomp]–> ADJ (1; 100%).
Treebank Statistics (UD_Russian-SynTagRus)
This feature is language-specific.
It occurs with 1 different values: Short
.
13004 tokens (1%) have a non-empty value of Variant
.
3502 types (3%) occur at least once with a non-empty value of Variant
.
1810 lemmas (5%) occur at least once with a non-empty value of Variant
.
The feature is used with 2 part-of-speech tags: ru-pos/ADJ (8235; 1% instances), ru-pos/VERB (4769; 0% instances).
ADJ
8235 ru-pos/ADJ tokens (8% of all ADJ
tokens) have a non-empty value of Variant
.
The most frequent other feature values with which ADJ
and Variant
co-occurred: Degree=Pos (8235; 100%), Case=EMPTY (8233; 100%), Number=Sing (6564; 80%).
ADJ
tokens may have the following values of Variant
:
Short
(8235; 100% of non-emptyVariant
): нужно, должен, должны, должна, известно, необходимо, невозможно, должно, важно, трудно
Variant
seems to be lexical feature of ADJ
. 100% lemmas (894) occur only with one value of Variant
.
VERB
4769 ru-pos/VERB tokens (4% of all VERB
tokens) have a non-empty value of Variant
.
The most frequent other feature values with which VERB
and Variant
co-occurred: Case=EMPTY (4769; 100%), VerbForm=Part (4769; 100%), Voice=Pass (4769; 100%), Mood=EMPTY (4769; 100%), Person=EMPTY (4769; 100%), Tense=Past (4758; 100%), Aspect=Perf (4718; 99%), Number=Sing (3320; 70%).
VERB
tokens may have the following values of Variant
:
Short
(4769; 100% of non-emptyVariant
): связано, связаны, сделано, связана, связан, сказано, принято, написано, создан, создана
Variant
seems to be lexical feature of VERB
. 100% lemmas (917) occur only with one value of Variant
.
Relations with Agreement in Variant
The 10 most frequent relations where parent and child node agree in Variant
:
ADJ –[conj]–> ADJ (382; 85%),
ADJ –[advcl]–> ADJ (96; 79%),
ADJ –[parataxis]–> ADJ (53; 52%),
ADJ –[amod]–> ADJ (4; 100%),
ADJ –[appos]–> ADJ (2; 100%),
VERB –[appos]–> VERB (1; 100%).
Variant in other languages: [cs] [da] [nl] [pl] [ro] [ru] [sl]