Policy Value
Policy value research focuses on aligning artificial intelligence systems, particularly large language models (LLMs), with human values and societal norms. Current research emphasizes developing robust evaluation frameworks and benchmarks to assess this alignment across diverse contexts, employing techniques like Bayesian inverse reinforcement learning and generative evolving testing, as well as exploring the use of transformer-based models for imputation of missing data in value-related datasets. This work is crucial for mitigating potential harms from AI systems and ensuring responsible development and deployment, impacting fields ranging from news recommendation to healthcare and education.
Papers
November 7, 2022
October 31, 2022
October 26, 2022
October 19, 2022
October 15, 2022
September 28, 2022
August 24, 2022
June 30, 2022
May 4, 2022
March 25, 2022
March 15, 2022
February 28, 2022
February 3, 2022
January 13, 2022