Qwen Audio
Qwen Audio is a family of large-scale audio-language models designed to achieve universal audio understanding, enabling interaction with diverse audio types and tasks. Current research focuses on improving instruction-following capabilities through techniques like natural language prompting and optimized pre-training, as well as developing efficient architectures that handle varying audio resolutions and seamlessly integrate voice chat and audio analysis functionalities. These advancements are significant for advancing multimodal AI, offering potential for improved human-computer interaction and applications in areas like voice assistants, audio analysis tools, and accessibility technologies.
Papers
November 8, 2024
October 24, 2024
October 9, 2024
September 18, 2024
July 15, 2024
November 14, 2023
September 28, 2023
August 24, 2023