Segmentation Based Approach
Segmentation-based approaches aim to partition images into meaningful regions, facilitating analysis and interpretation across diverse fields. Current research emphasizes the development and application of advanced deep learning architectures, including U-Net variants, transformers (like Mamba), and foundation models (like SAM), often combined with innovative loss functions and data augmentation techniques to address challenges such as class imbalance and limited annotated data. These methods are proving impactful in various applications, from medical image analysis (e.g., tumor detection, organ segmentation) and remote sensing (e.g., crop field mapping, flood detection) to other domains requiring precise object delineation. The ongoing focus is on improving accuracy, efficiency, and explainability, particularly in scenarios with scarce or heterogeneous data.
Papers
Incorporating Feature Pyramid Tokenization and Open Vocabulary Semantic Segmentation
Jianyu Zhang, Li Zhang, Shijian Li
M$^3$-VOS: Multi-Phase, Multi-Transition, and Multi-Scenery Video Object Segmentation
Zixuan Chen, Jiaxin Li, Liming Tan, Yejie Guo, Junxuan Liang, Cewu Lu, Yonglu Li
Query-centric Audio-Visual Cognition Network for Moment Retrieval, Segmentation and Step-Captioning
Yunbin Tu, Liang Li, Li Su, Qingming Huang
3D Graph Attention Networks for High Fidelity Pediatric Glioma Segmentation
Harish Thangaraj, Diya Katariya, Eshaan Joshi, Sangeetha N
CAD-Unet: A Capsule Network-Enhanced Unet Architecture for Accurate Segmentation of COVID-19 Lung Infections from CT Images
Yijie Dang, Weijun Ma, Xiaohu Luo
Open-Vocabulary High-Resolution 3D (OVHR3D) Data Segmentation and Annotation Framework
Jiuyi Xu, Meida Chen, Andrew Feng, Yangming Shi, Zifan Yu
HSDA: High-frequency Shuffle Data Augmentation for Bird's-Eye-View Map Segmentation
Calvin Glisson, Qiuxiao Chen
RefSAM3D: Adapting SAM with Cross-modal Reference for 3D Medical Image Segmentation
Xiang Gao, Kai Lu
UNet++ and LSTM combined approach for Breast Ultrasound Image Segmentation
Saba Hesaraki, Morteza Akbari, Ramin Mousa
CLIP-TNseg: A Multi-Modal Hybrid Framework for Thyroid Nodule Segmentation in Ultrasound Images
Xinjie Sun, Boxiong Wei, Yalong Jiang, Liquan Mao, Qi Zhao