Object Localization
Object localization, the task of precisely determining the location and extent of objects within an image or scene, is a core problem in computer vision with applications ranging from robotics to medical imaging. Current research emphasizes improving robustness and accuracy under challenging conditions like distribution shifts (e.g., varying weather or viewpoints) and limited data, often employing convolutional neural networks (CNNs), transformers, and graph-based methods for feature extraction and object representation. These advancements are crucial for enhancing the reliability and performance of numerous applications, including autonomous navigation, object manipulation by robots, and medical image analysis.
Papers
I see what you hear: a vision-inspired method to localize words
Mohammad Samragh, Arnav Kundu, Ting-Yao Hu, Minsik Cho, Aman Chadha, Ashish Shrivastava, Oncel Tuzel, Devang Naik
Robust Object Detection in Remote Sensing Imagery with Noisy and Sparse Geo-Annotations (Full Version)
Maximilian Bernhard, Matthias Schubert