Tabular Data
Tabular data, ubiquitous in various fields, presents unique challenges for machine learning due to its structured nature and mixed data types. Current research focuses on improving model performance through techniques like self-supervised learning (e.g., JEPA), generative models (e.g., GANs, VAEs, diffusion models) for data augmentation and synthesis, and the integration of large language models (LLMs) for enhanced feature extraction and data generation. These advancements aim to address limitations in existing methods, such as gradient boosted decision trees, and improve accuracy, efficiency, and robustness in applications ranging from medical diagnosis to anomaly detection and scientific simulations.
Papers
October 1, 2024
September 30, 2024
September 25, 2024
September 20, 2024
September 13, 2024
September 11, 2024
September 6, 2024
August 30, 2024
August 28, 2024
August 20, 2024
August 17, 2024
August 12, 2024
August 8, 2024
August 6, 2024
July 18, 2024
July 17, 2024
July 16, 2024
July 15, 2024