Stochastic Pitch Prediction

Stochastic pitch prediction focuses on accurately and probabilistically modeling the fundamental frequency of sound, particularly in speech and music synthesis. Current research emphasizes developing robust end-to-end models, often employing variational autoencoders or neural networks (like convolutional networks), to improve the naturalness and diversity of generated audio by explicitly modeling pitch variability. This work is significant for advancing speech and music synthesis technologies, enabling more realistic and expressive audio generation across various applications, including text-to-speech systems and singing voice synthesis.

Papers

June 14, 2024

Period Singer: Integrating Periodic and Aperiodic Variational Autoencoders for Natural-Sounding End-to-End Singing Voice Synthesis
Taewoo Kim, Choongsang Cho, Young Han Lee
Waveform Domain Vocal Performance Variational Recurrent Periodicity Detection Phoneme Alignment End to End Singing Voice Stochastic Pitch Prediction

May 13, 2024

PitcherNet: Powering the Moneyball Evolution in Baseball Video Analytics
Jerrin Bright, Bavesh Balaji, Yuhao Chen, David A Clausi, John S Zelek
Sport Video Stochastic Pitch Prediction Player Movement

May 28, 2023

Stochastic Pitch Prediction Improves the Diversity and Naturalness of Speech in Glow-TTS
Sewade Ogun, Vincent Colotte, Emmanuel Vincent
Speech Analysis Text to Speech Diversity Awareness Text to Speech Model Visual Naturalness Stochastic Pitch Prediction

October 28, 2022

Period VITS: Variational Inference with Explicit Pitch Modeling for End-to-end Emotional Speech Synthesis
Yuma Shirahata, Ryuichi Yamamoto, Eunwoo Song, Ryo Terashima, Jae-Min Kim, Kentaro Tachibana
End to End Variational Inference Speech Synthesis Prosodic Feature Periodicity Detection Vocoder Model Stochastic Pitch Prediction End to End Tt System

October 27, 2022

A Fast and Accurate Pitch Estimation Algorithm Based on the Pseudo Wigner-Ville Distribution
Yisi Liu, Peter Wu, Alan W Black, Gopala K. Anumanchipalli
Pseudo Profound Statement Wigner Ville Distribution Stochastic Pitch Prediction

June 29, 2022

Comparing Conventional Pitch Detection Algorithms with a Neural Network Approach
Anja Kroon
Pitch Estimation Neural Network Approach Stochastic Pitch Prediction

Stochastic Pitch Prediction

Papers

Period Singer: Integrating Periodic and Aperiodic Variational Autoencoders for Natural-Sounding End-to-End Singing Voice Synthesis

PitcherNet: Powering the Moneyball Evolution in Baseball Video Analytics

Stochastic Pitch Prediction Improves the Diversity and Naturalness of Speech in Glow-TTS

Period VITS: Variational Inference with Explicit Pitch Modeling for End-to-end Emotional Speech Synthesis

A Fast and Accurate Pitch Estimation Algorithm Based on the Pseudo Wigner-Ville Distribution

Comparing Conventional Pitch Detection Algorithms with a Neural Network Approach