LLM Reasoning
Research on Large Language Model (LLM) reasoning focuses on improving the ability of LLMs to perform complex, multi-step reasoning tasks, often by augmenting them with techniques like chain-of-thought prompting, reinforcement learning (RL), and integration with symbolic reasoning methods. Current efforts concentrate on enhancing the accuracy and reliability of LLM reasoning, addressing issues like hallucination and inconsistent performance across different domains and tasks, often through improved credit assignment in RL and the development of novel evaluation metrics. These advancements are significant because reliable LLM reasoning is crucial for building trustworthy AI systems across diverse applications, from robotics and healthcare to scientific discovery and decision support.
Papers
CoT Rerailer: Enhancing the Reliability of Large Language Models in Complex Reasoning Tasks through Error Detection and Correction
Guangya Wan, Yuqi Wu, Jie Chen, Sheng Li
LLMs are Superior Feedback Providers: Bootstrapping Reasoning for Lie Detection with Self-Generated Feedback
Tanushree Banerjee, Richard Zhu, Runzhe Yang, Karthik Narasimhan
CaLMQA: Exploring culturally specific long-form question answering across 23 languages
Shane Arora, Marzena Karpinska, Hung-Ting Chen, Ipsita Bhattacharjee, Mohit Iyyer, Eunsol Choi
LLM-ARC: Enhancing LLMs with an Automated Reasoning Critic
Aditya Kalyanpur, Kailash Karthik Saravanakumar, Victor Barres, Jennifer Chu-Carroll, David Melville, David Ferrucci
The CLRS-Text Algorithmic Reasoning Language Benchmark
Larisa Markeeva, Sean McLeish, Borja Ibarz, Wilfried Bounsi, Olga Kozlova, Alex Vitvitskyi, Charles Blundell, Tom Goldstein, Avi Schwarzschild, Petar Veličković
ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search
Dan Zhang, Sining Zhoubian, Ziniu Hu, Yisong Yue, Yuxiao Dong, Jie Tang