Timbre Perception

Timbre perception, the subjective experience of a sound's unique "color," is a crucial area of research in auditory science and music technology, aiming to understand how we perceive and represent timbre and how this perception can be manipulated and modeled. Current research focuses on developing machine learning models, including neural networks and autoencoders, to analyze and synthesize timbre, often employing techniques like self-supervised learning and disentanglement of timbre from other audio features (e.g., pitch, style). These advancements have implications for various applications, such as improving music transcription systems, creating realistic virtual instruments, and enhancing speech and singing voice conversion technologies.

Papers