Data Corruption
Data corruption, encompassing errors and inconsistencies in datasets, poses a significant challenge across diverse machine learning applications. Current research focuses on developing robust algorithms and model architectures, such as those based on sequence modeling and robust statistical methods (e.g., Huber loss, quantile estimators), to mitigate the impact of corrupted data on model performance and reliability. This work is crucial for improving the trustworthiness and generalizability of AI systems across various domains, from reinforcement learning and hypothesis testing to natural language processing and building energy assessment, where data quality significantly impacts the accuracy and utility of results.
Papers
November 1, 2024
July 5, 2024
May 30, 2024
May 24, 2024
April 14, 2024
October 31, 2023
October 19, 2023
September 20, 2023
September 16, 2023
August 9, 2022
July 4, 2022
May 2, 2022
March 19, 2022