Instructional Visual Editing
Instructional visual editing focuses on modifying images based on textual instructions, aiming to create systems that accurately and efficiently translate natural language commands into visual changes. Current research emphasizes developing robust models, often leveraging large language and diffusion models, and creating diverse, high-quality datasets to train these models, including those incorporating human feedback for improved alignment with user preferences. This field is significant for its potential to improve image manipulation tools, enhance accessibility for non-experts, and address ethical concerns surrounding AI-generated imagery by enabling responsible editing of potentially harmful content.
Papers
August 3, 2024
May 7, 2024
April 8, 2024
March 16, 2023
November 17, 2022
December 6, 2021