Distribution Data
Distribution data, encompassing both in-distribution (ID) and out-of-distribution (OOD) data, is a critical area of machine learning research focused on improving model robustness and reliability. Current research emphasizes developing methods for detecting and handling OOD data, including techniques that leverage graph theory, contrastive learning, and diffusion models, as well as adapting existing models through reweighting and fine-tuning strategies. This work is crucial for building safer and more dependable AI systems across various applications, from autonomous vehicles to medical image analysis, by mitigating the risks associated with unexpected or unseen data. A key challenge remains effectively handling imbalanced datasets and complex real-world distribution shifts.
Papers
Out-of-distribution Detection Learning with Unreliable Out-of-distribution Sources
Haotian Zheng, Qizhou Wang, Zhen Fang, Xiaobo Xia, Feng Liu, Tongliang Liu, Bo Han
Fast and Interpretable Face Identification for Out-Of-Distribution Data Using Vision Transformers
Hai Phan, Cindy Le, Vu Le, Yihui He, Anh Totti Nguyen
Towards out-of-distribution generalizable predictions of chemical kinetics properties
Zihao Wang, Yongqiang Chen, Yang Duan, Weijiang Li, Bo Han, James Cheng, Hanghang Tong
Fairness-enhancing mixed effects deep learning improves fairness on in- and out-of-distribution clustered (non-iid) data
Son Nguyen, Adam Wang, Albert Montillo