Truthful Space
"Truthful Space" in AI research focuses on developing large language models (LLMs) that reliably produce accurate and honest responses, avoiding both unintentional errors ("hallucinations") and deliberate deception. Current research emphasizes evaluating and improving LLM truthfulness through various methods, including analyzing internal model representations, developing new evaluation benchmarks (like TruthfulQA), and designing techniques to filter misleading information or steer models towards truthful generation. This work is crucial for building trust in LLMs and ensuring their safe and responsible deployment in diverse applications, ranging from question answering to decision support systems.
Papers
December 11, 2024
December 1, 2024
November 7, 2024
October 21, 2024
October 17, 2024
October 9, 2024
September 27, 2024
September 13, 2024
July 19, 2024
July 16, 2024
June 19, 2024
May 22, 2024
April 8, 2024
March 12, 2024
February 28, 2024
February 27, 2024
February 11, 2024
February 9, 2024
February 7, 2024