Shift Equivariance

Shift equivariance, a property where a model's output shifts proportionally to input shifts, is a crucial concept in machine learning, particularly for image and signal processing. Current research focuses on enhancing shift equivariance in convolutional neural networks (CNNs) and vision transformers (ViTs) by addressing limitations introduced by downsampling, pooling, and positional encoding, often employing techniques like polyphase sampling and adaptive anchoring. This research aims to improve model robustness to small input variations, leading to more reliable predictions and increased performance in applications such as image classification, segmentation, and time-series analysis. Achieving true shift equivariance is vital for building more robust and generalizable models across diverse datasets and applications.

Papers