home fi/feat edit page issue tracker

This page still pertains to UD version 1.

Clitic: clitic

(Please note: this part of the documentation is not yet completed.)

Language-specific feature identifying clitics attached to the word.

Finnish has a number of particle clitics used to express questions, politeness, or focus. UD Finnish captures the presence of these clitics using the Clitic feature, which takes one or more of the following values, with multiple values expressing combinations, for example Clitic=Ko,S for -kos (-ko + -s) as in voikos.

Kin

Expresses focus. Can often be translated into English as also. Forms contrasting pair with -kaan.

Examples

Kaan

Expresses focus in negative contexts. Realized as -kaan or -kään. Forms contrasting pair with -kin.

Examples

Ko

Expresses a question. Realized as -ko or -kö.

Examples

Han

Realized as -han or -hän.

Examples

Pa

Realized as -pa or -pä.

Examples

S

TODO

Examples

Ka

Realized as -ka or -kä. Attached to the negative verb ei, serves also as a conjunction.

Examples

References


Treebank Statistics (UD_Finnish)

This feature is language-specific. It occurs with 7 different values: Han, Ka, Kaan, Kin, Ko, Pa, S. Some words have combined values of the feature; 4 combinations have been observed: Han|Ko, Han|Pa, Ko|S, Pa|S.

1661 tokens (1%) have a non-empty value of Clitic. 977 types (2%) occur at least once with a non-empty value of Clitic. 531 lemmas (2%) occur at least once with a non-empty value of Clitic. The feature is used with 11 part-of-speech tags: fi-pos/VERB (540; 0% instances), fi-pos/AUX (344; 0% instances), fi-pos/ADV (241; 0% instances), fi-pos/NOUN (221; 0% instances), fi-pos/PRON (192; 0% instances), fi-pos/ADJ (69; 0% instances), fi-pos/PROPN (22; 0% instances), fi-pos/SCONJ (12; 0% instances), fi-pos/ADP (10; 0% instances), fi-pos/NUM (9; 0% instances), fi-pos/CCONJ (1; 0% instances).

VERB

540 fi-pos/VERB tokens (2% of all VERB tokens) have a non-empty value of Clitic.

The most frequent other feature values with which VERB and Clitic co-occurred: InfForm=EMPTY (518; 96%), Degree=EMPTY (511; 95%), PartForm=EMPTY (511; 95%), Case=EMPTY (502; 93%), Voice=Act (498; 92%), VerbForm=Fin (489; 91%), Number=Sing (409; 76%), Person=3 (294; 54%), Tense=EMPTY (286; 53%).

VERB tokens may have the following values of Clitic:

Paradigm ollaHanHan,KoKaanKinKoKo,SPaPa,S
Case=Nom|Degree=Pos|Number=Sing|PartForm=Past|VerbForm=Partollutkaan
Mood=Cnd|Number=Sing|Person=3|VerbForm=FinOlisikohanOlisiko
Mood=Ind|Number=Sing|Person=0|Tense=Pres|VerbForm=FinOnpas
Mood=Ind|Number=Sing|Person=1|Tense=Pres|VerbForm=Finolenkin
Mood=Ind|Number=Sing|Person=3|Style=Coll|Tense=Pres|VerbForm=Finonkionks
Mood=Ind|Number=Sing|Person=3|Tense=Past|VerbForm=Finolihanolikaanolipas
Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=FinonkoOnpa
Mood=Ind|Number=Plur|Person=1|Tense=Pres|VerbForm=Finolemmekin

AUX

344 fi-pos/AUX tokens (3% of all AUX tokens) have a non-empty value of Clitic.

The most frequent other feature values with which AUX and Clitic co-occurred: VerbForm=Fin (339; 99%), Voice=Act (324; 94%), Number=Sing (291; 85%), Polarity=EMPTY (275; 80%), Person=3 (241; 70%), Mood=Ind (233; 68%), Tense=Pres (174; 51%).

AUX tokens may have the following values of Clitic:

Paradigm ollaHanKaanKinKoKo,SPaPa,S
Case=Nom|Degree=Pos|Number=Sing|PartForm=Past|VerbForm=Part|Voice=Actollutkaan
Connegative=Yes|Mood=Cnd|VerbForm=Finolisikaan
Connegative=Yes|Mood=Ind|Style=Coll|Tense=Pres|VerbForm=Finolekkaan
Connegative=Yes|Mood=Ind|Tense=Pres|VerbForm=Finolekaanolekin
Mood=Cnd|Number=Sing|Person=0|VerbForm=Fin|Voice=Actolisiko
Mood=Cnd|Number=Sing|Person=3|VerbForm=Fin|Voice=ActOlisihanolisikinolisikoOlisipa
Mood=Ind|Number=Sing|Person=0|Tense=Past|VerbForm=Fin|Voice=Actolihanolikin
Mood=Ind|Number=Sing|Person=0|Tense=Pres|VerbForm=Fin|Voice=Actonkinonkoonpa
Mood=Ind|Number=Sing|Person=1|Tense=Past|VerbForm=Fin|Voice=Actolinkinolinko
Mood=Ind|Number=Sing|Person=1|Tense=Pres|VerbForm=Fin|Voice=Actolenkaanolenkinolenko
Mood=Ind|Number=Sing|Person=2|Style=Coll|Tense=Pres|VerbForm=Fin|Voice=Actoot
Mood=Ind|Number=Sing|Person=2|Tense=Pres|VerbForm=Fin|Voice=Actoletkooletpa
Mood=Ind|Number=Sing|Person=3|Style=Coll|Tense=Pres|VerbForm=Fin|Voice=Actonkionks
Mood=Ind|Number=Sing|Person=3|Tense=Past|VerbForm=Fin|Voice=ActolihanolikaanolikinolikoOlikosolipa
Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin|Voice=ActOnhanonkaanonkinonkoonkosOnpaonpas
Mood=Ind|Number=Plur|Person=2|Style=Coll|Tense=Pres|VerbForm=Fin|Voice=Actootteko
Mood=Ind|Number=Plur|Person=3|Style=Coll|Tense=Past|VerbForm=Fin|Voice=Actolihan
Mood=Ind|Number=Plur|Person=3|Tense=Past|VerbForm=Fin|Voice=Actolivathanolivatkinolivatkoolivatpa
Mood=Ind|Number=Plur|Person=3|Tense=Pres|VerbForm=Fin|Voice=ActovathanovatkaanovatkinOvatko
Mood=Ind|Style=Coll|Tense=Pres|VerbForm=Fin|Voice=PassOllaas

ADV

241 fi-pos/ADV tokens (2% of all ADV tokens) have a non-empty value of Clitic.

ADV tokens may have the following values of Clitic:

Paradigm niinHanKaanKinPa
niinhänniinkäänniinkinNiinpä

NOUN

221 fi-pos/NOUN tokens (0% of all NOUN tokens) have a non-empty value of Clitic.

The most frequent other feature values with which NOUN and Clitic co-occurred: Number=Sing (153; 69%).

NOUN tokens may have the following values of Clitic:

Paradigm miesHanKaanKin
Case=Gen|Number=Plurmiestenkin
Case=Nom|Number=SingmieshänMieskin
Case=Nom|Number=Sing|Number[psor]=Sing|Person[psor]=1miehenikin
Case=Nom|Number=Sing|Person[psor]=3miehensäkään

Clitic seems to be lexical feature of NOUN. 95% lemmas (184) occur only with one value of Clitic.

PRON

192 fi-pos/PRON tokens (2% of all PRON tokens) have a non-empty value of Clitic.

The most frequent other feature values with which PRON and Clitic co-occurred: Person=EMPTY (152; 79%), Number=Sing (146; 76%).

PRON tokens may have the following values of Clitic:

Paradigm seHanKaanKinPaS
Case=Ade|Number=Singsilläkin
Case=Ade|Number=Plurniilläkin
Case=Ade|Number=Plur|Style=Collniilki
Case=Ela|Number=Singsiitähänsiitäkinsiitäs
Case=Ela|Number=Plurniistäkin
Case=Gen|Number=SingSenhänsenkäänsenkin
Case=Gen|Number=Plurniidenkin
Case=Ill|Number=Singsiihenkin
Case=Ine|Number=SingSiinäpä
Case=Nom|Number=Singsehänsekäänsekin
Case=Nom|Number=Plurnekin
Case=Par|Number=SingSitähänsitäkäänsitäkin

ADJ

69 fi-pos/ADJ tokens (1% of all ADJ tokens) have a non-empty value of Clitic.

The most frequent other feature values with which ADJ and Clitic co-occurred: Number=Sing (52; 75%), Degree=Pos (50; 72%).

ADJ tokens may have the following values of Clitic:

Paradigm hyväKaanKin
Degree=Pos|Number=Singhyvääkin
Degree=Cmp|Number=Singparempaakaanparempaakin
Degree=Cmp|Number=Plurparempiakaan

PROPN

22 fi-pos/PROPN tokens (0% of all PROPN tokens) have a non-empty value of Clitic.

The most frequent other feature values with which PROPN and Clitic co-occurred: Number=Sing (21; 95%).

PROPN tokens may have the following values of Clitic:

Paradigm SuomiKaanKin
Case=GenSuomenkaan
Case=IneSuomessakin
Case=NomSuomikin

Clitic seems to be lexical feature of PROPN. 94% lemmas (17) occur only with one value of Clitic.

SCONJ

12 fi-pos/SCONJ tokens (0% of all SCONJ tokens) have a non-empty value of Clitic.

SCONJ tokens may have the following values of Clitic:

Paradigm josKinKo
joskinjosko

ADP

10 fi-pos/ADP tokens (0% of all ADP tokens) have a non-empty value of Clitic.

The most frequent other feature values with which ADP and Clitic co-occurred: AdpType=Post (6; 60%).

ADP tokens may have the following values of Clitic:

Paradigm jälkeenKaanKin
jälkeenkäänjälkeenkin

NUM

9 fi-pos/NUM tokens (0% of all NUM tokens) have a non-empty value of Clitic.

The most frequent other feature values with which NUM and Clitic co-occurred: Number=Sing (9; 100%), NumType=Card (9; 100%).

NUM tokens may have the following values of Clitic:

Paradigm yksiKaanKin
Case=Ablyhdeltäkään
Case=Essyhtenäkin
Case=Nomyksikin
Case=Paryhtäkään

CCONJ

1 fi-pos/CCONJ tokens (0% of all CCONJ tokens) have a non-empty value of Clitic.

CCONJ tokens may have the following values of Clitic:


Treebank Statistics (UD_Finnish-FTB)

This feature is language-specific. It occurs with 7 different values: Han, Ka, Kaan, Kin, Ko, Pa, S. Some words have combined values of the feature; 8 combinations have been observed: Han|Ka, Han|Ko, Han|Pa, Ka|S, Kaan|Ko, Kin|Ko, Ko|S, Pa|S.

2655 tokens (2%) have a non-empty value of Clitic. 1596 types (4%) occur at least once with a non-empty value of Clitic. 710 lemmas (4%) occur at least once with a non-empty value of Clitic. The feature is used with 11 part-of-speech tags: fi-pos/VERB (1359; 1% instances), fi-pos/NOUN (322; 0% instances), fi-pos/PRON (277; 0% instances), fi-pos/ADV (198; 0% instances), fi-pos/AUX (157; 0% instances), fi-pos/PART (105; 0% instances), fi-pos/ADJ (90; 0% instances), fi-pos/DET (71; 0% instances), fi-pos/PROPN (49; 0% instances), fi-pos/NUM (19; 0% instances), fi-pos/ADP (8; 0% instances).

VERB

1359 fi-pos/VERB tokens (4% of all VERB tokens) have a non-empty value of Clitic.

The most frequent other feature values with which VERB and Clitic co-occurred: PartForm=EMPTY (1320; 97%), InfForm=EMPTY (1310; 96%), Voice=Act (1279; 94%), Case=EMPTY (1271; 94%), VerbForm=Fin (1270; 93%), Number=Sing (1055; 78%), Mood=Ind (762; 56%), Person=3 (735; 54%).

VERB tokens may have the following values of Clitic:

Paradigm ollaHanHan,KoKaanKinKoKo,SPaPa,SS
Case=Gen|Number=Sing|PartForm=Past|VerbForm=Part|Voice=Actolleenkaan
Case=Gen|Number=Sing|PartForm=Pres|VerbForm=Part|Voice=Actolevankaan
Case=Ine|InfForm=2|VerbForm=Inf|Voice=Actollessakaan
Case=Lat|InfForm=1|VerbForm=Inf|Voice=ActollakaanOllakoOllapa
Case=Nom|Number=Sing|PartForm=Past|Style=Coll|VerbForm=Part|Voice=Actollukkaanollukkiollukko
Case=Nom|Number=Sing|PartForm=Past|VerbForm=Part|Voice=Actollutkaan
Case=Nom|Number=Plur|PartForm=Past|VerbForm=Part|Voice=Actolleetkin
Connegative=Yes|Mood=Ind|Number=Sing|Tense=Past|VerbForm=Fin|Voice=Actollutkaan
Connegative=Yes|Mood=Ind|Style=Coll|Tense=Pres|VerbForm=Fin|Voice=Actookin
Connegative=Yes|Mood=Ind|Tense=Pres|VerbForm=Fin|Voice=Actolekaanolekin
Mood=Cnd|Number=Sing|Person=3|Style=Coll|VerbForm=Fin|Voice=ActOiskohanoliskinoisko, olisko
Mood=Cnd|Number=Sing|Person=3|VerbForm=Fin|Voice=ActOlisihanOlisikohanolisikinolisiko
Mood=Cnd|Number=Plur|Person=2|VerbForm=Fin|Voice=ActOlisitteko
Mood=Imp|Number=Sing|Person=2|VerbForm=Fin|Voice=Actolekin
Mood=Ind|Number=Sing|Person=1|Style=Coll|Tense=Pres|VerbForm=Fin|Voice=Actolenk, oonko, ooks, Oonksmä
Mood=Ind|Number=Sing|Person=1|Tense=Past|VerbForm=Fin|Voice=Actolinkinolinko
Mood=Ind|Number=Sing|Person=1|Tense=Pres|VerbForm=Fin|Voice=ActOlenhanolenko
Mood=Ind|Number=Sing|Person=2|Style=Coll|Tense=Pres|VerbForm=Fin|Voice=Actootko, ootsä, oleks, Ookkonää, Ooksää
Mood=Ind|Number=Sing|Person=2|Tense=Past|VerbForm=Fin|Voice=ActOlithanOlitkoOlitkos
Mood=Ind|Number=Sing|Person=2|Tense=Pres|VerbForm=Fin|Voice=ActOletkohanoletkaanoletkoOletkosOletpa
Mood=Ind|Number=Sing|Person=3|Style=Coll|Tense=Past|VerbForm=Fin|Voice=ActolikiiOliks
Mood=Ind|Number=Sing|Person=3|Style=Coll|Tense=Pres|VerbForm=Fin|Voice=Actonkionks, onk
Mood=Ind|Number=Sing|Person=3|Tense=Past|VerbForm=Fin|Voice=ActolihanOlikohanolikaanolikinoliko
Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin|Voice=ActonhanOnkohanonkaanonkinonkoonkosOnpa
Mood=Ind|Number=Plur|Person=1|Tense=Pres|VerbForm=Fin|Voice=Actolemmeko
Mood=Ind|Number=Plur|Person=2|Style=Coll|Tense=Past|VerbForm=Fin|Voice=ActOlitteks
Mood=Ind|Number=Plur|Person=2|Style=Coll|Tense=Pres|VerbForm=Fin|Voice=Actootteko, Oottekste
Mood=Ind|Number=Plur|Person=2|Tense=Pres|VerbForm=Fin|Voice=ActOletteko
Mood=Ind|Number=Plur|Person=3|Tense=Past|VerbForm=Fin|Voice=Actolivatkaan
Mood=Ind|Number=Plur|Person=3|Tense=Pres|VerbForm=Fin|Voice=Actovatkinovatko
Mood=Ind|Style=Coll|Tense=Pres|VerbForm=Fin|Voice=PassOllaas
Mood=Ind|Tense=Past|VerbForm=Fin|Voice=PassOltiinhanOltiinkin
Mood=Ind|Tense=Pres|VerbForm=Fin|Voice=PassOllaanpas
Mood=Pot|Number=Sing|Person=3|VerbForm=Fin|Voice=ActLiekö, lieneekö

NOUN

322 fi-pos/NOUN tokens (1% of all NOUN tokens) have a non-empty value of Clitic.

The most frequent other feature values with which NOUN and Clitic co-occurred: Number=Sing (238; 74%).

NOUN tokens may have the following values of Clitic:

Paradigm lapsiHanKaanKin
Case=Ela|Number=SingLapsestakin
Case=Ill|Number=PlurLapsiinhan
Case=Nom|Number=Singlapsikaanlapsikin
Case=Nom|Number=Plurlapsetkin

Clitic seems to be lexical feature of NOUN. 92% lemmas (234) occur only with one value of Clitic.

PRON

277 fi-pos/PRON tokens (3% of all PRON tokens) have a non-empty value of Clitic.

The most frequent other feature values with which PRON and Clitic co-occurred: Number=Sing (213; 77%), Person=EMPTY (184; 66%), Case=Nom (159; 57%).

PRON tokens may have the following values of Clitic:

Paradigm seHanKaanKaan,KoKinKoPaPa,S
Case=Adesilläkin
Case=ElasiitähänsiitäkäänSiitäkinSiitäpä
Case=Gensenhänsenkään
Case=IllsiihenkinSiihenkö
Case=Inesiinäkinsiinäpä
Case=Ine|Style=Collsiinähä
Case=NomsehänsekäänsekinseköSepäSepäs
Case=ParSitähänsitäkäänsitäkäänkösitäkinSitäkö

ADV

198 fi-pos/ADV tokens (2% of all ADV tokens) have a non-empty value of Clitic.

The most frequent other feature values with which ADV and Clitic co-occurred: PronType=EMPTY (120; 61%).

ADV tokens may have the following values of Clitic:

Paradigm miksiHanHan,KaHan,KoKoPa
miksihänMiksikähänMiksiköhänMiksikömiksipä

AUX

157 fi-pos/AUX tokens (5% of all AUX tokens) have a non-empty value of Clitic.

The most frequent other feature values with which AUX and Clitic co-occurred: Voice=Act (153; 97%), VerbForm=Fin (153; 97%), Mood=Ind (135; 86%), Number=Sing (135; 86%), Person=3 (119; 76%), Tense=Pres (93; 59%).

AUX tokens may have the following values of Clitic:

Paradigm ollaHanHan,KoKaanKinKoKo,SPaPa,S
Case=Lat|InfForm=1|VerbForm=Inf|Voice=Actollapa
Case=Nom|Number=Plur|PartForm=Past|VerbForm=Part|Voice=Actolleetkaan
Connegative=Yes|Mood=Ind|Number=Sing|Tense=Past|VerbForm=Fin|Voice=Actollutkaan
Connegative=Yes|Mood=Ind|Tense=Pres|VerbForm=Fin|Voice=Actolekaanolekin
Mood=Cnd|Number=Sing|Person=1|VerbForm=Fin|Voice=ActOlisinko
Mood=Cnd|Number=Sing|Person=2|VerbForm=Fin|Voice=ActOlisitpa
Mood=Cnd|Number=Sing|Person=3|Style=Coll|VerbForm=Fin|Voice=Actoisko, olisko
Mood=Cnd|Number=Sing|Person=3|VerbForm=Fin|Voice=Actolisikinolisiko
Mood=Cnd|Number=Plur|Person=3|VerbForm=Fin|Voice=Actolisivatko
Mood=Imp|Number=Sing|Person=2|VerbForm=Fin|Voice=Actolepa
Mood=Imp|Number=Sing|Person=3|VerbForm=Fin|Voice=Actolkoonkinolkoonpa
Mood=Ind|Number=Sing|Person=1|Tense=Past|VerbForm=Fin|Voice=ActolinkinolinkoOlinpa
Mood=Ind|Number=Sing|Person=1|Tense=Pres|VerbForm=Fin|Voice=Actolenkinolenko
Mood=Ind|Number=Sing|Person=2|Style=Coll|Tense=Pres|VerbForm=Fin|Voice=Actoleksä, ook
Mood=Ind|Number=Sing|Person=2|Tense=Pres|VerbForm=Fin|Voice=Actoletkinoletkooletpa
Mood=Ind|Number=Sing|Person=3|Style=Coll|Tense=Past|VerbForm=Fin|Voice=Actoliks, olik
Mood=Ind|Number=Sing|Person=3|Style=Coll|Tense=Pres|VerbForm=Fin|Voice=Actonkohaonks, onk
Mood=Ind|Number=Sing|Person=3|Tense=Past|VerbForm=Fin|Voice=ActOlihanolikaanolikinolikoOlikosolipaOlipas
Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin|Voice=ActonhanOnkohanonkaanonkinonkoonpaOnpas
Mood=Ind|Number=Plur|Person=1|Tense=Pres|VerbForm=Fin|Voice=Actolemmeko
Mood=Ind|Number=Plur|Person=3|Tense=Pres|VerbForm=Fin|Voice=Actovathanovatkaanovatkinovatko
Mood=Ind|Tense=Pres|VerbForm=Fin|Voice=PassOllaanhanollaanpas
Mood=Pot|Number=Sing|Person=3|VerbForm=Fin|Voice=ActLieneekö

PART

105 fi-pos/PART tokens (2% of all PART tokens) have a non-empty value of Clitic.

PART tokens may have the following values of Clitic:

Paradigm kylläHanKaanKinPaPa,S
_kyllähänkylläkäänkylläkinKylläpäkylläpäs
Style=Collkylhän

ADJ

90 fi-pos/ADJ tokens (1% of all ADJ tokens) have a non-empty value of Clitic.

The most frequent other feature values with which ADJ and Clitic co-occurred: Number=Sing (58; 64%).

ADJ tokens may have the following values of Clitic:

Paradigm omaKinPa
Case=Ela|Number=Plur|Style=Collomistaki
Case=Nom|Number=SingOmapa
Case=Nom|Number=Pluromatkin

Clitic seems to be lexical feature of ADJ. 94% lemmas (59) occur only with one value of Clitic.

DET

71 fi-pos/DET tokens (2% of all DET tokens) have a non-empty value of Clitic.

The most frequent other feature values with which DET and Clitic co-occurred: Number=Sing (43; 61%).

DET tokens may have the following values of Clitic:

Paradigm tämäHanKaanKinKo
Case=EssTänäkääntänäkin
Case=Gentämänkääntämänkin
Case=Inetässäkin
Case=NomTämähänTämäkäänTämäkö
Case=Par|Style=Colltätäkä

PROPN

49 fi-pos/PROPN tokens (1% of all PROPN tokens) have a non-empty value of Clitic.

The most frequent other feature values with which PROPN and Clitic co-occurred: Number=Sing (48; 98%).

PROPN tokens may have the following values of Clitic:

Clitic seems to be lexical feature of PROPN. 100% lemmas (41) occur only with one value of Clitic.

NUM

19 fi-pos/NUM tokens (1% of all NUM tokens) have a non-empty value of Clitic.

The most frequent other feature values with which NUM and Clitic co-occurred: NumType=Card (19; 100%), Number=Sing (18; 95%), Case=Nom (14; 74%).

NUM tokens may have the following values of Clitic:

Paradigm yksiKaanKin
Case=Essyhtenäkään
Case=Genyhdenkin
Case=Nomyksikäänyksikin

ADP

8 fi-pos/ADP tokens (0% of all ADP tokens) have a non-empty value of Clitic.

ADP tokens may have the following values of Clitic:


Clitic in other languages: [fi]