Speaker Recognition Task

Speaker recognition, the task of identifying individuals based on their voice, aims to develop robust and accurate systems for various applications. Current research focuses on improving robustness to noisy environments and channel variations, often employing deep learning architectures like TDNNs and Transformers, sometimes enhanced with attention mechanisms and self-supervised learning techniques such as wav2vec 2.0. These advancements are driven by the need for reliable speaker verification in diverse real-world scenarios, impacting fields ranging from security and forensics to personalized user interfaces and accessibility technologies.

Papers

July 14, 2022

Deep versus Wide: An Analysis of Student Architectures for Task-Agnostic Knowledge Distillation of Self-Supervised Speech Models
Takanori Ashihara, Takafumi Moriya, Kohei Matsuura, Tomohiro Tanaka
Knowledge Distillation Automatic Speech Recognition Self Supervised General Analysis Self Supervised Speech Model Speaker Identification Speaker Recognition Task Task Agnostic Distillation Student Architecture

May 3, 2022

Efficient dynamic filter for robust and low computational feature extraction
Donghyeon Kim, Gwantae Kim, Bokyeung Lee, Jeong-gi Kwak, David K. Han, Hanseok Ko
Feature Extraction Speaker Recognition Task Dynamic Filter

April 8, 2022

April 6, 2022

A New Nonlinear speaker parameterization algorithm for speaker identification
Mohamed Chetouani, Marcos Faundez-Zanuy, Bruno Gas, Jean-Luc Zarader
Speaker Identification Nonlinear Prediction Speaker Recognition Task Neural Network Initialization Spatial Acoustic

March 28, 2022

Robust Speaker Recognition with Transformers Using wav2vec 2.0
Sergey Novoselov, Galina Lavrentyeva, Anastasia Avdeeva, Vladimir Volokhov, Aleksei Gusev
Transformer Megatron Decepticons Speaker Verification Speaker Recognition Task Unsupervised Speech Representation

February 24, 2022

On the relevance of bandwidth extension for speaker identification
Marcos Faundez-Zanuy, Mattias Nilsson, W. Bastiaan Kleijn
Speech Signal Relative Relevance Speaker Identification Speaker Recognition Task Mel Frequency Cepstral Coefficient Bandwidth Extension

December 14, 2021

Explore Long-Range Context feature for Speaker Verification
Zhuo Li
Speaker Verification Multi Head Self Attention Sparse Attention Long Range Context Speaker Recognition Task Separable Self Attention