Bias Score

Bias scores are quantitative metrics designed to assess and quantify biases present in various machine learning models and datasets, aiming to promote fairness and mitigate discriminatory outcomes. Current research focuses on developing novel bias score methodologies, particularly for text data (e.g., analyzing language models and documents) and image data (e.g., in medical image analysis), often employing techniques like topic modeling, word embeddings, and deep learning architectures such as transformers. These advancements are crucial for improving the reliability and ethical implications of AI systems across diverse applications, from news classification and healthcare to legal and scientific domains, by enabling the identification and mitigation of biases that might otherwise lead to unfair or inaccurate predictions.

Papers