Distribution Data
Distribution data, encompassing both in-distribution (ID) and out-of-distribution (OOD) data, is a critical area of machine learning research focused on improving model robustness and reliability. Current research emphasizes developing methods for detecting and handling OOD data, including techniques that leverage graph theory, contrastive learning, and diffusion models, as well as adapting existing models through reweighting and fine-tuning strategies. This work is crucial for building safer and more dependable AI systems across various applications, from autonomous vehicles to medical image analysis, by mitigating the risks associated with unexpected or unseen data. A key challenge remains effectively handling imbalanced datasets and complex real-world distribution shifts.
Papers
On the Inherent Robustness of One-Stage Object Detection against Out-of-Distribution Data
Aitor Martinez-Seras, Javier Del Ser, Alain Andres, Pablo Garcia-Bringas
Unlearning in- vs. out-of-distribution data in LLMs under gradient-based method
Teodora Baluta, Pascal Lamblin, Daniel Tarlow, Fabian Pedregosa, Gintare Karolina Dziugaite