Visual Relation
Visual relation understanding in computer vision aims to enable machines to comprehend the relationships between objects within images and videos, mirroring human visual perception. Current research focuses on improving the accuracy and efficiency of visual relation detection and generation using various deep learning architectures, including transformers, graph neural networks, and diffusion models, often incorporating techniques like active perception and knowledge graphs to enhance performance. This field is crucial for advancing artificial intelligence, with applications ranging from scene understanding and image captioning to more complex tasks like robotic manipulation and medical image analysis.
Papers
June 18, 2022
June 11, 2022
May 21, 2022
April 24, 2022
April 6, 2022
April 3, 2022
December 10, 2021