Integrating Audio-Visual Features for Multimodal Deepfake Detection [2310.03827]