Pitch Extractor
Pitch extraction, the process of identifying the fundamental frequency of a sound, is crucial for various applications in music information retrieval, speech processing, and audio synthesis. Current research focuses on improving the robustness of pitch extraction algorithms, particularly in challenging acoustic environments (e.g., reverberation, noise) and for complex signals like singing voices, often employing convolutional neural networks, Gammatone filterbanks, and attention-based mechanisms within multi-scale frameworks. These advancements are driving improvements in automatic music transcription, singing voice synthesis, and objective evaluation methods for pitch extraction algorithms themselves, ultimately leading to more accurate and reliable tools for scientific research and practical applications.
Papers
Measuring pitch extractors' response to frequency-modulated multi-component signals
Hideki Kawahara, Kohei Yatabe, Ken-Ichi Sakakibara, Tatsuya Kitamura, Hideki Banno, Masanori Morise
An objective test tool for pitch extractors' response attributes
Hideki Kawahara, Kohei Yatabe, Ken-Ichi Sakakibara, Tatsuya Kitamura, Hideki Banno, Masanori Morise