Tamil Language
Tamil, a Dravidian language with significant diglossia (literary vs. colloquial forms), is a focus of increasing research in natural language processing (NLP). Current efforts concentrate on developing accurate automatic speech recognition and dialect identification systems, employing machine learning models like convolutional neural networks (CNNs), recurrent neural networks (RNNs), and Gaussian Mixture Models (GMMs), often enhanced with attention mechanisms. This research is crucial for preserving linguistic diversity, improving human-computer interaction for Tamil speakers, and advancing multilingual NLP capabilities, particularly within the context of low-resource languages.
Papers
Zero-shot Code-Mixed Offensive Span Identification through Rationale Extraction
Manikandan Ravikiran, Bharathi Raja Chakravarthi
Findings of the Shared Task on Offensive Span Identification from Code-Mixed Tamil-English Comments
Manikandan Ravikiran, Bharathi Raja Chakravarthi, Anand Kumar Madasamy, Sangeetha Sivanesan, Ratnavel Rajalakshmi, Sajeetha Thavareesan, Rahul Ponnusamy, Shankar Mahadevan