Space Bias

Space bias, or the uneven representation of different groups or features within a model's learned space, is a significant problem across machine learning domains, hindering fairness and accuracy. Current research focuses on identifying and mitigating these biases in various models, including large language models (LLMs) and computer vision systems, using techniques like activation steering, adversarial training, and data augmentation strategies designed to improve feature representation and balance class distributions. Understanding and addressing space bias is crucial for developing reliable and equitable AI systems, impacting fields ranging from natural language processing and recommendation systems to medical diagnosis and autonomous driving.

Papers