Text Description
Current research in text description focuses on leveraging the power of large language models (LLMs) and vision-language models (VLMs) to bridge the gap between textual descriptions and various modalities, including images, 3D scenes, sounds, and even robot designs. Key research areas involve generating realistic outputs from text prompts, improving the robustness of systems to noise and ambiguity, and developing methods for disentangling complex representations to enable finer control and editing. This work has significant implications for diverse fields, ranging from robotics and virtual reality to geology and materials science, by enabling more intuitive and efficient interaction with complex data and systems.
Papers
October 4, 2024
July 25, 2024
July 21, 2024
June 28, 2024
May 13, 2024
March 22, 2024
February 23, 2024
January 8, 2024
January 4, 2024
December 8, 2023
October 21, 2023
October 9, 2023
October 8, 2023
September 11, 2023
July 10, 2023
June 28, 2023
May 3, 2023
February 17, 2023
January 21, 2023