A Separable Self-attention Inspired by the State Space Model for Computer Vision [2501.02040]