Language Diarization

Language diarization (LD) aims to automatically identify the spoken language(s) and their temporal boundaries within a multi-speaker conversation, often in multilingual settings. Current research focuses on improving LD accuracy using techniques like spectral clustering, self-supervised learning with architectures such as WavLM, and integrating LD with speaker diarization and speech recognition systems, often employing implicit language modeling to handle low-resource languages. These advancements are crucial for improving the performance of speech technologies in diverse, real-world scenarios, such as multilingual transcription and cross-lingual information retrieval. The development of robust LD systems is vital for bridging the language gap in increasingly globalized communication.

Papers

September 16, 2024

TCG CREST System Description for the Second DISPLACE Challenge
Nikhil Raghav, Subhajit Saha, Md Sahidullah, Swagatam Das
Speaker Diarization Multi Speaker Voice Activity Detection Conformational Ensemble M2MeT Challenge Language Diarization

June 13, 2024

December 15, 2023

Fine-Tuned Self-Supervised Speech Representations for Language Diarization in Multilingual Code-Switched Speech
Geoffrey Frost, Emily Morris, Joshua Jansen van Vüren, Thomas Niesler
Self Supervised Speech Representation Code Switched Diarization Error Rate Language Diarization

November 21, 2023

Summary of the DISPLACE Challenge 2023 - DIarization of SPeaker and LAnguage in Conversational Environments
Shikha Baghel, Shreyas Ramoji, Somil Jain, Pratik Roy Chowdhuri, Prachi Singh, Deepu Vijayasenan, Sriram Ganapathy
Human Language Speech Data Speaker Diarization Speech Driven Refined Diarization Core Challenge Language Diarization

August 21, 2023

Implicit Self-supervised Language Representation for Spoken Language Diarization
Jagabandhu Mishra, S. R. Mahadeva Prasanna
Language Model Self Supervised Speaker Diarization Language Diarization

June 22, 2023

Implicit spoken language diarization
Jagabandhu Mishra, Amartya Chowdhury, S. R. Mahadeva Prasanna
Speaker Diarization Implicit Language Phoneme Recognition Diarization Error Rate Language Diarization Phonotactic Complexity

March 1, 2023

DISPLACE Challenge: DIarization of SPeaker and LAnguage in Conversational Environments
Shikha Baghel, Shreyas Ramoji, Sidharth, Ranjana H, Prachi Singh, Somil Jain, Pratik Roy Chowdhuri, Kaustubh Kulkarni, Swapnil Padhi, Deepu Vijayasenan, Sriram Ganapathy
Human Language Speaker Diarization Code Mixed Speech Driven Refined Diarization Core Challenge Language Diarization

October 26, 2022

Reducing Language confusion for Code-switching Speech Recognition with Token-level Language Diarization
Hexin Liu, Haihua Xu, Leibny Paola Garcia, Andy W. H. Khong, Yi He, Sanjeev Khudanpur
Automatic Speech Recognition Code Switching Automatic Speech Recognition Language Diarization

Language Diarization

Papers

TCG CREST System Description for the Second DISPLACE Challenge

The Second DISPLACE Challenge : DIarization of SPeaker and LAnguage in Conversational Environments

Exploring Spoken Language Identification Strategies for Automatic Transcription of Multilingual Broadcast and Institutional Speech

Fine-Tuned Self-Supervised Speech Representations for Language Diarization in Multilingual Code-Switched Speech

Summary of the DISPLACE Challenge 2023 - DIarization of SPeaker and LAnguage in Conversational Environments

Implicit Self-supervised Language Representation for Spoken Language Diarization

Implicit spoken language diarization

DISPLACE Challenge: DIarization of SPeaker and LAnguage in Conversational Environments

Reducing Language confusion for Code-switching Speech Recognition with Token-level Language Diarization