Convolutional Neural Network
Convolutional Neural Networks (CNNs) are a class of deep learning models designed for processing grid-like data, excelling in image analysis and related tasks. Current research focuses on improving CNN efficiency and robustness, exploring architectures like EfficientNet and Swin Transformers, as well as novel approaches such as Mamba models to address limitations in computational cost and long-range dependency capture. This active field of research has significant implications across diverse applications, including medical image analysis (e.g., cancer detection, Alzheimer's diagnosis), damage assessment, and art forgery detection, demonstrating the power of CNNs for automating complex visual tasks.
Papers
T-Mamba: A unified framework with Long-Range Dependency in dual-domain for 2D & 3D Tooth Segmentation
Jing Hao, Yonghui Zhu, Lei He, Moyun Liu, James Kit Hon Tsoi, Kuo Feng Hung
Harnessing The Power of Attention For Patch-Based Biomedical Image Classification
Gousia Habib, Shaima Qureshi, Malik ishfaq
Instance-Aware Group Quantization for Vision Transformers
Jaehyeon Moon, Dohyung Kim, Junyong Cheon, Bumsub Ham
Illicit object detection in X-ray images using Vision Transformers
Jorgen Cani, Ioannis Mademlis, Adamantia Anna Rebolledo Chrysochoou, Georgios Th. Papadopoulos
SMOF: Streaming Modern CNNs on FPGAs with Smart Off-Chip Eviction
Petros Toupas, Zhewen Yu, Christos-Savvas Bouganis, Dimitrios Tzovaras
Scalable Lipschitz Estimation for CNNs
Yusuf Sulehman, Tingting Mu
The Topos of Transformer Networks
Mattia Jacopo Villani, Peter McBurney
Learning CNN on ViT: A Hybrid Model to Explicitly Class-specific Boundaries for Domain Adaptation
Ba Hung Ngo, Nhat-Tuong Do-Tran, Tuan-Ngoc Nguyen, Hae-Gon Jeon, Tae Jong Choi
Multi-scale Unified Network for Image Classification
Wenzhuo Liu, Fei Zhu, Cheng-Lin Liu
Automated Report Generation for Lung Cytological Images Using a CNN Vision Classifier and Multiple-Transformer Text Decoders: Preliminary Study
Atsushi Teramoto, Ayano Michiba, Yuka Kiriyama, Tetsuya Tsukamoto, Kazuyoshi Imaizumi, Hiroshi Fujita
ELGC-Net: Efficient Local-Global Context Aggregation for Remote Sensing Change Detection
Mubashir Noman, Mustansar Fiaz, Hisham Cholakkal, Salman Khan, Fahad Shahbaz Khan
Rotate to Scan: UNet-like Mamba with Triplet SSM Module for Medical Image Segmentation
Hao Tang, Lianglun Cheng, Guoheng Huang, Zhengguang Tan, Junhao Lu, Kaihong Wu
Order of Compression: A Systematic and Optimal Sequence to Combinationally Compress CNN
Yingtao Shen, Minqing Sun, Jianzhe Lin, Jie Zhao, An Zou
Integrating Mamba Sequence Model and Hierarchical Upsampling Network for Accurate Semantic Segmentation of Multiple Sclerosis Legion
Kazi Shahriar Sanjid, Md. Tanzim Hossain, Md. Shakib Shahariar Junayed, Dr. Mohammad Monir Uddin