Pitch Controllability

Pitch controllability research focuses on accurately manipulating and representing pitch information in audio signals, aiming to improve applications ranging from music information retrieval to speech synthesis. Current efforts involve developing novel model architectures, such as conditional autoencoders and transformers, often incorporating techniques like variational inference and data augmentation to enhance robustness and accuracy. These advancements are impacting diverse fields, improving the quality of synthesized speech, enabling more precise music analysis, and enhancing the performance of audio processing systems.

Papers