Direct Convolution
Direct convolution, a fundamental operation in convolutional neural networks (CNNs), is being actively researched to improve efficiency and accuracy across diverse applications. Current efforts focus on optimizing convolution through architectural innovations like dilated convolutions, attention mechanisms integrated with convolutions, and novel data layouts for improved hardware performance, as well as exploring alternatives to traditional convolutions, such as using semirings or table lookups. These advancements aim to enhance the speed and accuracy of CNNs for tasks ranging from medical image analysis and object detection to speech processing and large language model efficiency, ultimately impacting various scientific fields and practical applications.
Papers
Convolutions, Transformers, and their Ensembles for the Segmentation of Organs at Risk in Radiation Treatment of Cervical Cancer
Vangelis Kostoulas, Peter A. N. Bosman, Tanja Alderliesten
Model Stitching: Looking For Functional Similarity Between Representations
Adriano Hernandez, Rumen Dangovski, Peter Y. Lu, Marin Soljacic