Web register annotation guidelines

The annotation task consists of two steps: deciding whether to accept or reject a document, and giving a register label / labels to the accepted documents.

When to accept or reject a document

When to give a document several labels

Short list of register labels and their abbreviations

Video instructions to the annotation on Prodigy

Please note that

  • You can have a look at how the document website looks like by following the document url on the annotator
  • The annotation decision should, however, base on the text on the annotator
  • Some documents may be followed by a large number of comments. Please do not base your decision on those.

Quickstart

  1. Is the web page Machine translated or generated from a template?
  2. Is the web page Lyrical, such as songs or poems?
  3. Is the web page originally spoken? (Texts composed of more than 50% spoken quotes classified as spoken)
  4. Is the web page Interactive discussion written by multiple participants in a discussion format (e.g. discussion or Q&A forum)? (Reader comments following e.g. an article or blog post are NOT included here)
  5. Is the purpose of the document to narrate or report on EVENTS? If yes, select one of the following registers:
  6. Is the purpose of the document to explain HOW-TO or INSTRUCTIONS?
    • If yes, is it a Recipe?
    • If no, select Other how-to. These are typically step-by-step, objective instructions on how to do something.
  7. Is the purpose of the document to describe or explain INFORMATION? If yes, select one of the following registers:
  8. Is the purpose of the document to express OPINIONS? If yes, select one of the following registers:
  9. Is the purpose of the document to describe or explain FACTS WITH INTENT TO PERSUADE or MARKET? If yes, select one of the following registers: