Stereotype Content

Stereotype content research investigates how biases and stereotypes are represented and perpetuated within large language models (LLMs) and other AI systems, aiming to understand and mitigate their harmful societal impact. Current research focuses on identifying and quantifying these biases across various modalities (text, images), languages, and demographic groups, often employing techniques like adversarial attacks and explainable AI methods to analyze model behavior and develop mitigation strategies. This work is crucial for ensuring fairness and equity in AI applications, impacting fields ranging from education and healthcare to hiring and criminal justice, by promoting the development of less biased and more responsible AI systems.

Papers

March 27, 2022

Reinforcement Guided Multi-Task Learning Framework for Low-Resource Stereotype Detection
Rajkumar Pujari, Erik Oveson, Priyanka Kulkarni, Elnaz Nouri
Hate Speech Detection Stereotype Content Multi Task Reinforcement Learning Offensive Language Detection Stereotype Detection Misogyny Detection

March 24, 2022

Gender and Racial Stereotype Detection in Legal Opinion Word Embeddings
Sean Matthews, John Hudzina, Dawn Sepehr
Natural Language Processing Word Embeddings Gender Bias Stereotype Content Gender Inclusive Text

March 6, 2022

A Survey on Bias and Fairness in Natural Language Processing
Rajas Bansal
Natural Language Processing Timely Survey Procedural Fairness Absolute Stance Bias NLP Model Stereotype Content

January 10, 2022

Quantifying Gender Bias in Consumer Culture
Reihane Boghrati, Jonah Berger
Absolute Stance Bias Gender Bias Stereotype Content Consumer Behavior Cultural Evolution

December 1, 2021

CO-STAR: Conceptualisation of Stereotypes for Analysis and Reasoning
Teyun Kwon, Anandha Gopalan
General Analysis Complex Reasoning Hate Speech Stereotype Content Offensive Content Co Training Actor Loss Conceptual Tool

November 20, 2021

Exploring Language Patterns in a Medical Licensure Exam Item Bank
Swati Padhee, Kimberly Swygert, Ian Micir
Natural Language Processing Stereotype Content Biased Behavior Language Bias Language Pattern State Medical Licensing Examination