Text Description
Current research in text description focuses on leveraging the power of large language models (LLMs) and vision-language models (VLMs) to bridge the gap between textual descriptions and various modalities, including images, 3D scenes, sounds, and even robot designs. Key research areas involve generating realistic outputs from text prompts, improving the robustness of systems to noise and ambiguity, and developing methods for disentangling complex representations to enable finer control and editing. This work has significant implications for diverse fields, ranging from robotics and virtual reality to geology and materials science, by enabling more intuitive and efficient interaction with complex data and systems.
Papers
December 21, 2022
November 22, 2022
November 8, 2022
October 2, 2022
September 19, 2022
September 5, 2022
August 3, 2022
May 5, 2022
April 17, 2022
January 12, 2022