Text to Music

Text-to-music research aims to generate musical audio or symbolic representations from textual descriptions, enabling users to create music through natural language. Current efforts focus on improving the quality and controllability of generated music using large language models (LLMs) to enhance datasets and refine diffusion models, as well as exploring model compression for wider accessibility. These advancements are significant for both music creation and the broader field of AI, offering new tools for composers and researchers while pushing the boundaries of cross-modal generation and representation learning.

Papers