Bag of Tricks for Multimodal AutoML with Image, Text, and Tabular Data [2412.16243]