Image Transformer
Image transformers leverage the power of self-attention mechanisms, initially developed for natural language processing, to analyze and manipulate images and videos. Current research focuses on improving efficiency (e.g., through techniques like group-shifted window attention and wavelet transforms), expanding applications (including image restoration, inpainting, generation, and video understanding), and addressing challenges like memory consumption and bias in model outputs. This rapidly evolving field is significantly impacting computer vision, enabling advancements in diverse areas such as medical image analysis, robotic interaction, and creative content generation.
Papers
November 15, 2024
October 9, 2024
September 16, 2024
September 10, 2024
August 21, 2024
June 30, 2024
June 28, 2024
June 11, 2024
July 28, 2023
July 16, 2023
June 29, 2023
June 6, 2023
April 13, 2023
April 10, 2023
March 17, 2023
March 8, 2023
August 3, 2022
July 6, 2022
July 5, 2022