Policy Value
Policy value research focuses on aligning artificial intelligence systems, particularly large language models (LLMs), with human values and societal norms. Current research emphasizes developing robust evaluation frameworks and benchmarks to assess this alignment across diverse contexts, employing techniques like Bayesian inverse reinforcement learning and generative evolving testing, as well as exploring the use of transformer-based models for imputation of missing data in value-related datasets. This work is crucial for mitigating potential harms from AI systems and ensuring responsible development and deployment, impacting fields ranging from news recommendation to healthcare and education.
Papers
March 13, 2024
March 6, 2024
March 1, 2024
February 26, 2024
February 13, 2024
January 16, 2024
December 7, 2023
November 13, 2023
November 7, 2023
November 6, 2023
October 27, 2023
October 21, 2023
October 11, 2023
September 27, 2023
September 11, 2023
August 23, 2023
July 19, 2023
May 30, 2023
May 10, 2023