Learning Fine-Grained Features for Pixel-wise Video Correspondences [2308.03040]