Mask Prediction
Mask prediction, a core task in computer vision, aims to accurately delineate objects or regions of interest within images or videos, generating pixel-level masks. Current research focuses on improving mask prediction accuracy and efficiency across diverse applications, employing techniques like prototype-based methods for point clouds, diffusion models for high-fidelity image synthesis, and transformer-based architectures for instance and panoptic segmentation. These advancements are driving progress in various fields, including autonomous driving (HD map construction), medical image analysis (segmentation with uncertainty quantification), and video understanding (instance segmentation and tracking). The development of robust and efficient mask prediction methods is crucial for advancing numerous applications requiring precise object localization and segmentation.
Papers
What is Point Supervision Worth in Video Instance Segmentation?
Shuaiyi Huang, De-An Huang, Zhiding Yu, Shiyi Lan, Subhashree Radhakrishnan, Jose M. Alvarez, Abhinav Shrivastava, Anima Anandkumar
Texture-Preserving Diffusion Models for High-Fidelity Virtual Try-On
Xu Yang, Changxing Ding, Zhibin Hong, Junhao Huang, Jin Tao, Xiangmin Xu