Bias Discovery

Bias discovery in machine learning focuses on identifying and understanding how models unfairly discriminate against certain groups, hindering their performance and potentially causing harm. Current research emphasizes detecting biases in various model architectures, including large vision-language models and those used in mental health applications, employing techniques like counterfactual analysis, latent space exploration, and keyword-based explanations to uncover spurious correlations within data. This work is crucial for improving model fairness, robustness, and trustworthiness across diverse applications, ultimately leading to more equitable and reliable AI systems.

Papers