Paper ID: 2409.20466

Language Resources in Spanish for Automatic Text Simplification across Domains

Antonio Moreno-Sandoval, Leonardo Campillos-Llanos, Ana García-Serrano

This work describes the language resources and models developed for automatic simplification of Spanish texts in three domains: Finance, Medicine and History studies. We created several corpora in each domain, annotation and simplification guidelines, a lexicon of technical and simplified medical terms, datasets used in shared tasks for the financial domain, and two simplification tools. The methodology, resources and companion publications are shared publicly on the web-site: this https URL.

Submitted: Sep 30, 2024