home edit page issue tracker

This page pertains to UD version 2.

UD for Czech

Tokenization and Word Segmentation


Instruction: Describe the general rules for delimiting words (for example, based on whitespace and punctuation) and exceptions to these rules. Specify whether words with spaces and/or multiword tokens occur. Include links to further language-specific documentation if available.


Morphology

Tags

This is an overview only. For more detailed discussion and examples, see the list of Czech POS tags and Czech features.


Instruction: Specify any unused tags. Explain what words are tagged as PART. Describe how the AUX-VERB and DET-PRON distinctions are drawn, and specify whether there are (de)verbal forms tagged as ADJ, ADV or NOUN. Include links to language-specific tag definitions if any.


Nominal Features

Degree and Polarity

Verbal Features

Pronouns, Determiners, Quantifiers

Other Features


Instruction: Describe inherent and inflectional features for major word classes (at least NOUN and VERB). Describe other noteworthy features. Include links to language-specific feature definitions if any.


Syntax

This is an overview only. For more detailed discussion and examples, see the list of Czech relations, as well as Czech-specific examples scattered across the documentation of constructions.

Core Arguments, Oblique Arguments and Adjuncts

Non-verbal Clauses

Relations Overview


Instruction: Give criteria for identifying core arguments (subjects and objects), and describe the range of copula constructions in nonverbal clauses. List all subtype relations used. Include links to language-specific relations definitions if any.


Treebanks

There are five Czech UD treebanks:


Instruction: Treebank-specific pages are generated automatically from the README file in the treebank repository and from the data in the latest release. Link to the respective *-index.html page in the treebanks folder, using the language code and the treebank code in the file name.