Speaker Detection
Speaker detection research focuses on identifying and tracking individual speakers within audio recordings, whether from multiple simultaneous speakers or in noisy environments. Current efforts utilize various approaches, including deep learning models like LSTM-CNNs and scene graph generation networks, to extract relevant features from audio and even brainwave (EEG) data to improve accuracy and reduce latency. This field is crucial for applications ranging from assistive hearing technologies and home monitoring systems for the elderly to security and forensic analysis, with ongoing research aiming to enhance robustness in challenging acoustic conditions and improve the speed and accuracy of speaker identification.
Papers
March 22, 2024
December 12, 2023
August 28, 2023
August 17, 2023
June 30, 2023