Data Auditing
Data auditing in machine learning focuses on verifying the ethical and responsible use of data in model training, aiming to detect unauthorized or biased data usage. Current research emphasizes developing methods to identify data provenance, particularly within federated learning and for various model types (including image classifiers and language models), often employing membership inference attacks and techniques from information theory and causal inference. These auditing techniques are crucial for ensuring model fairness, accountability, and transparency, impacting both the development of responsible AI and the legal compliance of data-driven applications.
Papers
September 28, 2024
July 21, 2024
November 10, 2023
April 6, 2023
April 5, 2023
November 23, 2022
October 25, 2022