Textual Guidance
Textual guidance in AI focuses on using text descriptions to control the generation and manipulation of visual data, including images, videos, and 3D models. Current research emphasizes improving the accuracy and efficiency of this control, exploring techniques like cross-frame textual guidance for video generation and visual context-modulated prompts for diffusion models. This field is crucial for advancing AI's creative capabilities and mitigating safety concerns, such as generating inappropriate content, through methods like adversarial prompt detection. The development of robust and reliable textual guidance is vital for numerous applications, ranging from content creation to image editing and 3D modeling.
Papers
August 15, 2024
March 3, 2024
December 9, 2023
December 3, 2023
March 28, 2023