Voice

home ru/feat edit page issue tracker

This page still pertains to UD version 1.

`Voice`: voice

Voice is a feature of verbs that helps map the traditional syntactic functions, such as subject and object, to semantic roles, such as agent and pacient.

`Act`: active voice

Prototypically, the subject of the verb is the doer of the action (agent), the object is affected by the action (patient).

All active participles (in present and past form) are tagged Voice=Act. By default, the finite forms, ininitives and gerunds of non-reflexive verbs are also labeled Voice=Act, except for the cases when they are labeled Voice=Pass and Voice=Mid, see below.

Examples

мы атаковали врага. “We attacked the enemy”

`Pass`: passive voice

The subject of the verb is affected by the action (patient). The doer (agent) is either a non-obligatory oblique phrase of the verb or not overtly expressed;.

The passive participles (in present and past form) are tagged Voice=Pass. The finite non-reflexive forms are labeled Voice=Pass in the passive construction; in this case, the form is marked with -sja (but the lemma is tagged as non-reflexive).

Examples

Мы были атакованы врагом. “We were attacked by the enemy”
Разработки лекарства ведутся несколькими международными компаниями. “Drug development is conducted by several international companies”

`Mid`: middle voice

Between active and passive, needed for the reflexive verbs (in all forms except active participle).

Examples

Я занялся музыкой. “I took up.Refl music”

References

Anna Siewierska. 2013. Passive Constructions. In: Dryer, Matthew S. & Haspelmath, Martin (eds.) The World Atlas of Language Structures Online. Leipzig: Max Planck Institute for Evolutionary Anthropology. (http://wals.info/chapter/107)

Treebank Statistics (UD_Russian)

This feature is universal but the values Mid are language-specific. It occurs with 3 different values: Act, Mid, Pass.

3351 tokens (4%) have a non-empty value of Voice. 2220 types (8%) occur at least once with a non-empty value of Voice. 1202 lemmas (7%) occur at least once with a non-empty value of Voice. The feature is used with 2 part-of-speech tags: ru-pos/VERB (3188; 4% instances), ru-pos/AUX (163; 0% instances).

`VERB`

3188 ru-pos/VERB tokens (44% of all VERB tokens) have a non-empty value of Voice.

The most frequent other feature values with which VERB and Voice co-occurred: Person=EMPTY (2628; 82%), Variant=EMPTY (2368; 74%), Number=Sing (2171; 68%), Tense=Past (2133; 67%), Mood=EMPTY (1944; 61%), VerbForm=Part (1813; 57%), Aspect=Perf (1774; 56%).

VERB tokens may have the following values of Voice:

Act (518; 16% of non-empty Voice): вышедший, вышедшая, находящиеся, означающее, относящиеся, погибших, составляющие, существующих, устанавливающий, Singing
Mid (1375; 43% of non-empty Voice): находится, родился, относится, находился, учился, находятся, становится, удалось, состоялся, вернулся
Pass (1295; 41% of non-empty Voice): расположен, назначен, основана, награждён, основан, расположена, расположено, принято, расположены, создана
EMPTY (4115): составляет, может, получил, входит, можно, имеет, есть, было, начал, занимает

Paradigm НАХОДИТЬСЯ	`Act`	`Mid`
`Animacy=Anim\|Case=Acc\|Number=Plur\|Tense=Pres\|VerbForm=Part`	находящиеся
`Animacy=Anim\|Case=Gen\|Number=Plur\|Tense=Pres\|VerbForm=Part`	находящихся
`Animacy=Anim\|Case=Nom\|Gender=Masc\|Number=Sing\|Tense=Past\|VerbForm=Part`	находившийся
`Animacy=Anim\|Case=Nom\|Number=Plur\|Tense=Pres\|VerbForm=Part`	находящиеся
`Animacy=Inan\|Case=Acc\|Gender=Masc\|Number=Sing\|Tense=Past\|VerbForm=Part`	находившийся
`Animacy=Inan\|Case=Acc\|Gender=Fem\|Number=Sing\|Tense=Pres\|VerbForm=Part`	находящуюся
`Animacy=Inan\|Case=Dat\|Number=Plur\|Tense=Past\|VerbForm=Part`	находившимся
`Animacy=Inan\|Case=Gen\|Number=Plur\|Tense=Pres\|VerbForm=Part`	находящихся
`Animacy=Inan\|Case=Ins\|Gender=Masc\|Number=Sing\|Tense=Pres\|VerbForm=Part`	находящемся
`Animacy=Inan\|Case=Ins\|Gender=Fem\|Number=Sing\|Tense=Past\|VerbForm=Part`	находившейся
`Animacy=Inan\|Case=Ins\|Gender=Fem\|Number=Sing\|Tense=Pres\|VerbForm=Part`	находящейся
`Animacy=Inan\|Case=Ins\|Gender=Neut\|Number=Sing\|Tense=Past\|VerbForm=Part`	находившимся
`Animacy=Inan\|Case=Loc\|Gender=Fem\|Number=Sing\|Tense=Pres\|VerbForm=Part`	находящейся
`Animacy=Inan\|Case=Loc\|Number=Plur\|Tense=Past\|VerbForm=Part`	находившихся
`Animacy=Inan\|Case=Nom\|Gender=Neut\|Number=Sing\|Tense=Past\|VerbForm=Part`	находившееся
`Animacy=Inan\|Case=Nom\|Number=Plur\|Tense=Pres\|VerbForm=Part`	находящиеся
`Gender=Masc\|Mood=Ind\|Number=Sing\|Tense=Past\|VerbForm=Fin`		находился
`Gender=Fem\|Mood=Ind\|Number=Sing\|Tense=Past\|VerbForm=Fin`		находилась
`Gender=Neut\|Mood=Ind\|Number=Sing\|Tense=Past\|VerbForm=Fin`		находилось
`Mood=Ind\|Number=Sing\|Person=3\|Tense=Pres\|VerbForm=Fin`		находится
`Mood=Ind\|Number=Plur\|Person=3\|Tense=Pres\|VerbForm=Fin`		находятся
`Mood=Ind\|Number=Plur\|Tense=Past\|VerbForm=Fin`		находились

Voice seems to be lexical feature of VERB. 92% lemmas (1105) occur only with one value of Voice.

`AUX`

163 ru-pos/AUX tokens (16% of all AUX tokens) have a non-empty value of Voice.

The most frequent other feature values with which AUX and Voice co-occurred: VerbForm=Fin (140; 86%), Mood=Ind (140; 86%), Aspect=Imp (132; 81%), Number=Sing (127; 78%), Gender=EMPTY (123; 75%), Person=3 (109; 67%), Tense=Pres (109; 67%).

AUX tokens may have the following values of Voice:

Act (12; 7% of non-empty Voice): ставшие, Ставшая, бывшего, бывшие, бывшим, остававшееся, ставший, ставшим, ставших, являющегося
Mid (143; 88% of non-empty Voice): является, являются, являлся, являлась, считается, оказывается, остались, остаётся, явились, явилось
Pass (8; 5% of non-empty Voice): назначен, исполнено, найден, предусмотрена, признана, сертифицирован
EMPTY (840): был, было, были, была, стал, это, быть, будет, стала, стало

Paradigm ЯВЛЯТЬСЯ	`Act`	`Mid`
`Animacy=Anim\|Case=Gen\|Gender=Masc\|Number=Sing\|Tense=Pres\|VerbForm=Part`	являющегося
`Gender=Masc\|Mood=Ind\|Number=Sing\|Tense=Past\|VerbForm=Fin`		являлся
`Gender=Fem\|Mood=Ind\|Number=Sing\|Tense=Past\|VerbForm=Fin`		являлась
`Mood=Ind\|Number=Sing\|Person=3\|Tense=Pres\|VerbForm=Fin`		является
`Mood=Ind\|Number=Plur\|Person=3\|Tense=Pres\|VerbForm=Fin`		являются
`Mood=Ind\|Number=Plur\|Tense=Past\|VerbForm=Fin`		являлись
`VerbForm=Inf`		являться

Treebank Statistics (UD_Russian-SynTagRus)

This feature is universal but the values Mid are language-specific. It occurs with 3 different values: Act, Mid, Pass.

118006 tokens (12%) have a non-empty value of Voice. 31598 types (29%) occur at least once with a non-empty value of Voice. 5884 lemmas (15%) occur at least once with a non-empty value of Voice. The feature is used with 2 part-of-speech tags: ru-pos/VERB (110740; 11% instances), ru-pos/AUX (7266; 1% instances).

`VERB`

110740 ru-pos/VERB tokens (100% of all VERB tokens) have a non-empty value of Voice.

The most frequent other feature values with which VERB and Voice co-occurred: Case=EMPTY (99171; 90%), Gender=EMPTY (77180; 70%), Person=EMPTY (74711; 67%), VerbForm=Fin (70244; 63%), Mood=Ind (69379; 63%), Aspect=Imp (62831; 57%), Number=Sing (56828; 51%).

VERB tokens may have the following values of Voice:

Act (76159; 69% of non-empty Voice): может, есть, нет, могут, было, быть, говорит, сказал, сделать, стоит
Mid (21107; 19% of non-empty Voice): является, стало, стал, удалось, становится, стать, кажется, остается, приходится, оказалось
Pass (13474; 12% of non-empty Voice): считается, говорится, связано, используется, связаны, используются, сделано, связана, связанных, связан
EMPTY (1): и.о.

Paradigm говорить	`Act`	`Pass`
`Animacy=Anim\|Aspect=Imp\|Case=Acc\|Gender=Masc\|Number=Sing\|Tense=Past\|VerbForm=Part`	говорившего
`Animacy=Anim\|Aspect=Imp\|Case=Acc\|Number=Plur\|Tense=Pres\|VerbForm=Part`	говорящих
`Aspect=Imp\|Case=Acc\|Gender=Fem\|Number=Sing\|Tense=Past\|VerbForm=Part`	говорившую
`Aspect=Imp\|Case=Gen\|Gender=Masc\|Number=Sing\|Tense=Pres\|VerbForm=Part`	говорящего
`Aspect=Imp\|Case=Nom\|Gender=Masc\|Number=Sing\|Tense=Pres\|VerbForm=Part`	говорящий
`Aspect=Imp\|Gender=Masc\|Mood=Ind\|Number=Sing\|Tense=Past\|VerbForm=Fin`	говорил
`Aspect=Imp\|Gender=Fem\|Mood=Ind\|Number=Sing\|Tense=Past\|VerbForm=Fin`	говорила
`Aspect=Imp\|Gender=Neut\|Mood=Ind\|Number=Sing\|Tense=Past\|VerbForm=Fin`	говорило	говорилось
`Aspect=Imp\|Mood=Imp\|Number=Sing\|Person=2\|VerbForm=Fin`	говори
`Aspect=Imp\|Mood=Ind\|Number=Sing\|Person=1\|Tense=Pres\|VerbForm=Fin`	говорю
`Aspect=Imp\|Mood=Ind\|Number=Sing\|Person=2\|Tense=Pres\|VerbForm=Fin`	говоришь
`Aspect=Imp\|Mood=Ind\|Number=Sing\|Person=3\|Tense=Pres\|VerbForm=Fin`	говорит	говорится
`Aspect=Imp\|Mood=Ind\|Number=Plur\|Person=1\|Tense=Pres\|VerbForm=Fin`	говорим
`Aspect=Imp\|Mood=Ind\|Number=Plur\|Person=2\|Tense=Pres\|VerbForm=Fin`	говорите
`Aspect=Imp\|Mood=Ind\|Number=Plur\|Person=3\|Tense=Pres\|VerbForm=Fin`	говорят
`Aspect=Imp\|Mood=Ind\|Number=Plur\|Tense=Past\|VerbForm=Fin`	говорили
`Aspect=Imp\|Tense=Pres\|VerbForm=Conv`	говоря
`Aspect=Imp\|VerbForm=Inf`	говорить
`Aspect=Perf\|Gender=Masc\|Mood=Ind\|Number=Sing\|Tense=Past\|VerbForm=Fin`	поговорил
`Aspect=Perf\|Mood=Imp\|Number=Plur\|Person=2\|VerbForm=Fin`	Поговорите
`Aspect=Perf\|Mood=Ind\|Number=Sing\|Person=1\|Tense=Fut\|VerbForm=Fin`	поговорю
`Aspect=Perf\|Mood=Ind\|Number=Sing\|Person=3\|Tense=Fut\|VerbForm=Fin`	поговорит
`Aspect=Perf\|Mood=Ind\|Number=Plur\|Person=1\|Tense=Fut\|VerbForm=Fin`	поговорим
`Aspect=Perf\|Mood=Ind\|Number=Plur\|Tense=Past\|VerbForm=Fin`	поговорили
`Aspect=Perf\|Tense=Past\|VerbForm=Conv`	Поговорив
`Aspect=Perf\|VerbForm=Inf`	поговорить

`AUX`

7266 ru-pos/AUX tokens (100% of all AUX tokens) have a non-empty value of Voice.

The most frequent other feature values with which AUX and Voice co-occurred: Aspect=Imp (7266; 100%), VerbForm=Fin (6570; 90%), Mood=Ind (6547; 90%), Person=EMPTY (5382; 74%), Number=Sing (4885; 67%), Tense=Past (4691; 65%).

AUX tokens may have the following values of Voice:

Act (7266; 100% of non-empty Voice): было, был, были, будет, была, быть, будут, есть, будем, буду

Relations with Agreement in `Voice`

The 10 most frequent relations where parent and child node agree in Voice: VERB –[conj]–> VERB (8586; 65%), VERB –[xcomp]–> VERB (5294; 68%), VERB –[advcl]–> VERB (5132; 59%), VERB –[parataxis]–> VERB (2081; 58%), VERB –[aux]–> AUX (628; 76%), VERB –[ccomp]–> VERB (525; 67%), VERB –[dep]–> VERB (161; 55%), VERB –[advmod]–> VERB (93; 59%), VERB –[orphan]–> VERB (34; 54%), VERB –[cop]–> VERB (26; 79%).

Voice in other languages: [am] [ar] [bg] [bxr] [ca] [ckb] [cop] [cs] [cu] [da] [de] [el] [en] [es] [et] [eu] [fa] [fi] [fo] [fr] [ga] [gl] [got] [grc] [he] [hi] [hr] [hu] [id] [it] [ja] [kk] [kmr] [ko] [la] [lv] [mr] [nl] [no] [pl] [pt] [ro] [ru] [sa] [sk] [sla] [so] [sr] [sv] [swl] [ta] [tr] [u] [ug] [uk] [ur] [urj] [vi] [yue] [zh]

Voice: voice

Act: active voice

Examples

Pass: passive voice

Examples

Mid: middle voice

Examples

References

Treebank Statistics (UD_Russian)

VERB

AUX

Treebank Statistics (UD_Russian-SynTagRus)

VERB

AUX

Relations with Agreement in Voice

`Voice`: voice

`Act`: active voice

`Pass`: passive voice

`Mid`: middle voice

`VERB`

`AUX`

`VERB`

`AUX`

Relations with Agreement in `Voice`