Quotation Extraction

Quotation extraction, the task of identifying and attributing quotes within text, is a growing area of natural language processing research focused on improving the accuracy and efficiency of automated quote identification and speaker attribution. Current work emphasizes advancements in model architectures, such as leveraging pre-trained language models and incorporating character embeddings to better handle complex scenarios like anaphora and implicit quotes, across diverse text types including novels and news articles. This research is significant for its applications in various fields, including media analysis (e.g., identifying influence networks), fact-checking (e.g., verifying source credibility), and literary studies (e.g., analyzing character relationships). The development of high-quality annotated corpora in multiple languages is also a key focus to facilitate further progress.

Papers