Paper ID: 2410.02281

Annotation Guidelines for Corpus Novelties: Part 1 -- Named Entity Recognition

Arthur Amalvy (LIA), Vincent Labatut (LIA)

The Novelties corpus is a collection of novels (and parts of novels) annotated for Named Entity Recognition (NER) among other tasks. This document describes the guidelines applied during its annotation. It contains the instructions used by the annotators, as well as a number of examples retrieved from the annotated novels, and illustrating expressions that should be marked as entities as well as expressions that should not.

Submitted: Oct 3, 2024