Heterogeneous Tabular Data
Heterogeneous tabular data, characterized by diverse data types within a single dataset, presents significant challenges for machine learning. Current research focuses on developing robust models, including graph neural networks, transformers, and enhanced tree-based ensembles, capable of handling this heterogeneity and improving predictive performance. These efforts are driven by the prevalence of such data in various domains and aim to overcome limitations of traditional methods, leading to more accurate and interpretable models for diverse applications. Furthermore, research is exploring techniques for synthetic data generation and cross-dataset pretraining to address data scarcity and improve generalization capabilities.
Papers
November 4, 2024
October 28, 2024
September 22, 2024
August 14, 2024
June 7, 2024
June 1, 2024
March 7, 2024
November 7, 2023
October 31, 2023
October 30, 2023
September 29, 2023
May 23, 2023
February 13, 2023
November 28, 2022
September 2, 2022