Social Desirability

Social desirability bias, the tendency to present oneself in a favorable light, is a significant concern in psychological research and is now being actively investigated in the context of large language models (LLMs). Current research focuses on detecting and quantifying this bias in LLMs like GPT-3, GPT-4, and others, using personality assessments and analyzing the impact of training data on agent behavior. These findings highlight the limitations of using LLMs as proxies for human participants and underscore the need for methods to mitigate this bias in AI systems to ensure reliable and unbiased results in various applications. The implications extend to the broader trustworthiness and ethical considerations of AI in social science research and beyond.

Papers

October 20, 2024

Exploring Social Desirability Response Bias in Large Language Models: Evidence from GPT-4 Simulations
Sanguk Lee, Kai-Qi Yang, Tai-Quan Peng, Ruth Heo, Hui Liu
Absolute Stance Bias Evidence Piece Human Like User Willingness Social Desirability

May 9, 2024

Large Language Models Show Human-like Social Desirability Biases in Survey Responses
Aadesh Salecha, Molly E. Ireland, Shashanka Subrahmanya, João Sedoc, Lyle H. Ungar, Johannes C. Eichstaedt
Large Language Model Personality Measurement Survey Response Social Desirability

May 6, 2024

Select to Perfect: Imitating desired behavior from large multi-agent data
Tim Franzmeyer, Edith Elkind, Philip Torr, Jakob Foerster, Joao Henriques
Multi Agent AI Agent BEHAVIOR Explanation Imitation Policy Collective Navigation Social Desirability

June 7, 2023

Personality testing of GPT-3: Limited temporal reliability, but highlighted social desirability of GPT-3's personality instruments results
Bojana Bodroza, Bojana M. Dinic, Ljubisa Bojic
Chatbot Response Human Like GPT 3 Personality Classification Conversational Assistant Personality Measurement AI Bot Prosocial Behavior Social Desirability

Social Desirability

Papers

Exploring Social Desirability Response Bias in Large Language Models: Evidence from GPT-4 Simulations

Large Language Models Show Human-like Social Desirability Biases in Survey Responses

Select to Perfect: Imitating desired behavior from large multi-agent data

Personality testing of GPT-3: Limited temporal reliability, but highlighted social desirability of GPT-3's personality instruments results