Reasoning Ability
Reasoning ability in large language models (LLMs) is a burgeoning research area focused on evaluating and enhancing the capacity of these models to perform multi-step inferences and solve complex problems requiring logical deduction and inductive learning. Current research emphasizes benchmarking LLMs on diverse tasks, including mathematical reasoning, commonsense reasoning, and following procedures, often employing techniques like chain-of-thought prompting and knowledge distillation to improve performance. Understanding and improving LLM reasoning is crucial for building more reliable and trustworthy AI systems with broader applications across various fields, from scientific discovery to decision-making support.
Papers
PHAnToM: Persona-based Prompting Has An Effect on Theory-of-Mind Reasoning in Large Language Models
Fiona Anting Tan, Gerard Christopher Yeo, Kokil Jaidka, Fanyou Wu, Weijie Xu, Vinija Jain, Aman Chadha, Yang Liu, See-Kiong Ng
ProTrix: Building Models for Planning and Reasoning over Tables with Sentence Context
Zirui Wu, Yansong Feng
NPHardEval4V: A Dynamic Reasoning Benchmark of Multimodal Large Language Models
Lizhou Fan, Wenyue Hua, Xiang Li, Kaijie Zhu, Mingyu Jin, Lingyao Li, Haoyang Ling, Jinkui Chi, Jindong Wang, Xin Ma, Yongfeng Zhang
Understanding Reasoning Ability of Language Models From the Perspective of Reasoning Paths Aggregation
Xinyi Wang, Alfonso Amayuelas, Kexun Zhang, Liangming Pan, Wenhu Chen, William Yang Wang
Enhancing Textbook Question Answering Task with Large Language Models and Retrieval Augmented Generation
Hessa Abdulrahman Alawwad, Areej Alhothali, Usman Naseem, Ali Alkhathlan, Amani Jamal
Exploring the Reasoning Abilities of Multimodal Large Language Models (MLLMs): A Comprehensive Survey on Emerging Trends in Multimodal Reasoning
Yiqi Wang, Wentao Chen, Xiaotian Han, Xudong Lin, Haiteng Zhao, Yongfei Liu, Bohan Zhai, Jianbo Yuan, Quanzeng You, Hongxia Yang
The Impact of Reasoning Step Length on Large Language Models
Mingyu Jin, Qinkai Yu, Dong Shu, Haiyan Zhao, Wenyue Hua, Yanda Meng, Yongfeng Zhang, Mengnan Du