Semantic Segmentation
Semantic segmentation, the task of assigning a semantic label to each pixel in an image, aims to achieve precise pixel-level scene understanding. Current research emphasizes improving accuracy and efficiency across diverse data modalities (RGB, depth, lidar, hyperspectral, and time series) and challenging conditions (low light, adverse weather, imbalanced datasets), often employing advanced architectures like transformers and diffusion models alongside innovative loss functions and training strategies. This field is crucial for numerous applications, including autonomous driving, medical image analysis, remote sensing, and robotics, driving advancements in both model robustness and interpretability.
Papers
Self-Correlation and Cross-Correlation Learning for Few-Shot Remote Sensing Image Semantic Segmentation
Linhan Wang, Shuo Lei, Jianfeng He, Shengkun Wang, Min Zhang, Chang-Tien Lu
Learning Semantic Segmentation with Query Points Supervision on Aerial Images
Santiago Rivier, Carlos Hinojosa, Silvio Giancola, Bernard Ghanem
Panoptic Vision-Language Feature Fields
Haoran Chen, Kenneth Blomqvist, Francesco Milano, Roland Siegwart
MMSFormer: Multimodal Transformer for Material and Semantic Segmentation
Md Kaykobad Reza, Ashley Prater-Bennette, M. Salman Asif
Towards Comparable Knowledge Distillation in Semantic Image Segmentation
Onno Niemann, Christopher Vox, Thorben Werner
BroadCAM: Outcome-agnostic Class Activation Mapping for Small-scale Weakly Supervised Applications
Jiatai Lin, Guoqiang Han, Xuemiao Xu, Changhong Liang, Tien-Tsin Wong, C. L. Philip Chen, Zaiyi Liu, Chu Han
EGIC: Enhanced Low-Bit-Rate Generative Image Compression Guided by Semantic Segmentation
Nikolai Körber, Eduard Kromer, Andreas Siebert, Sascha Hauke, Daniel Mueller-Gritschneder, Björn Schuller
Diffusion Model is Secretly a Training-free Open Vocabulary Semantic Segmenter
Jinglong Wang, Xiawei Li, Jing Zhang, Qingyuan Xu, Qin Zhou, Qian Yu, Lu Sheng, Dong Xu
DeepTriNet: A Tri-Level Attention Based DeepLabv3+ Architecture for Semantic Segmentation of Satellite Images
Tareque Bashar Ovi, Shakil Mosharrof, Nomaiya Bashree, Md Shofiqul Islam, Muhammad Nazrul Islam
Performance Analysis of Various EfficientNet Based U-Net++ Architecture for Automatic Building Extraction from High Resolution Satellite Images
Tareque Bashar Ovi, Nomaiya Bashree, Protik Mukherjee, Shakil Mosharrof, Masuma Anjum Parthima
A Re-Parameterized Vision Transformer (ReVT) for Domain-Generalized Semantic Segmentation
Jan-Aike Termöhlen, Timo Bartels, Tim Fingscheidt
SVQNet: Sparse Voxel-Adjacent Query Network for 4D Spatio-Temporal LiDAR Semantic Segmentation
Xuechao Chen, Shuangjie Xu, Xiaoyi Zou, Tongyi Cao, Dit-Yan Yeung, Lu Fang