Perso Arabic Script

Perso-Arabic script research focuses on developing computational tools and techniques to effectively process and analyze the diverse writing systems using this script family. Current efforts concentrate on improving language identification within the script, particularly addressing challenges posed by multilingual contexts and inconsistent orthographies, often employing supervised machine learning models and finite-state transducers for tasks like normalization and transliteration. These advancements are crucial for improving natural language processing capabilities for numerous languages, particularly those with limited digital resources, impacting fields like machine translation and language modeling.

Papers