Mutation Testing

Mutation testing assesses the effectiveness of test suites by introducing artificial faults into software or models, evaluating their detection rate. Current research focuses on adapting this technique to various machine learning domains, including large language models, chatbots, and deep neural networks deployed on hardware accelerators, investigating optimal mutation operators and evaluating the performance of different mutation-based model selection methods against established techniques like cross-validation. This work is significant for improving the reliability and trustworthiness of increasingly complex AI systems across diverse applications, ultimately contributing to more robust and dependable AI deployments.

Papers