Multi-modal Stance Detection: New Datasets and Model [2402.14298]