Reasoning Datasets
Reasoning datasets are collections of problems designed to evaluate and improve the reasoning capabilities of large language models (LLMs). Current research focuses on creating larger, more diverse datasets encompassing various reasoning types (mathematical, commonsense, logical) and incorporating multimodal data (text and images). These datasets, coupled with techniques like chain-of-thought prompting, process supervision, and tool augmentation (e.g., integrating external calculators or search engines), aim to enhance LLMs' ability to solve complex problems. The development of robust reasoning datasets is crucial for advancing LLM capabilities and ensuring their reliable application in diverse fields, including healthcare and education.
Papers
Make Your Decision Convincing! A Unified Two-Stage Framework: Self-Attribution and Decision-Making
Yanrui Du, Sendong Zhao, Haochun Wang, Yuhan Chen, Rui Bai, Zewen Qiang, Muzhen Cai, Bing Qin
Teaching Language Models to Self-Improve through Interactive Demonstrations
Xiao Yu, Baolin Peng, Michel Galley, Jianfeng Gao, Zhou Yu