Data Contamination

Data contamination, the unintentional or deliberate inclusion of evaluation data within the training data of machine learning models, particularly large language models (LLMs), is a significant challenge undermining the reliability of benchmark results. Current research focuses on developing robust detection methods, often employing techniques like membership inference attacks, perplexity analysis, and internal activation probing, to identify contamination across various model architectures, including transformers and autoencoders. Addressing data contamination is crucial for ensuring the trustworthiness of LLM evaluations and for fostering more reliable progress in the field, impacting both scientific understanding and the development of robust, generalizable AI systems.

Papers