CLIP Space

CLIP space, the embedding space generated by the CLIP (Contrastive Language–Image Pre-training) model, is being actively explored for its potential in enabling flexible and efficient text-guided image manipulation. Current research focuses on leveraging CLIP's ability to bridge the gap between text and image representations to develop methods for image editing and generation, often employing diffusion models or GANs, by manipulating CLIP embeddings directly or their differences (DeltaSpace). This approach offers advantages such as text-free training and zero-shot inference capabilities, reducing the reliance on large annotated datasets and improving the efficiency and versatility of image editing tools.

Papers

May 1, 2024

TexSliders: Diffusion-Based Texture Editing in CLIP Space
Julia Guerrero-Viu, Milos Hasan, Arthur Roullier, Midhun Harikumar, Yiwei Hu, Paul Guerrero, Diego Gutierrez, Belen Masia, Valentin Deschaintre
Generative Model Diffusion Generation Natural Video Texture Editing CLIP Space

October 12, 2023

DeltaSpace: A Semantic-aligned Feature Space for Flexible Text-guided Image Editing
Yueming Lyu, Kang Zhao, Bo Peng, Yue Jiang, Yingya Zhang, Jing Dong
Generative Model Text Guided Image Editing CLIP Space

March 11, 2023

DeltaEdit: Exploring Text-free Training for Text-Driven Image Manipulation
Yueming Lyu, Tianwei Lin, Fu Li, Dongliang He, Jing Dong, Tieniu Tan
Training Free Conditional Generative CLIP Embeddings Text Driven Image Manipulation CLIP Space

March 17, 2022

One-Shot Adaptation of GAN in Just One CLIP
Gihyun Kwon, Jong Chul Ye
Fine Tuning GAN Model Adaptation Concern Single CLIP CLIP Space

CLIP Space

Papers

TexSliders: Diffusion-Based Texture Editing in CLIP Space

DeltaSpace: A Semantic-aligned Feature Space for Flexible Text-guided Image Editing

DeltaEdit: Exploring Text-free Training for Text-Driven Image Manipulation

One-Shot Adaptation of GAN in Just One CLIP