Multi Step Reasoning
Multi-step reasoning research focuses on enhancing the ability of large language models (LLMs) to solve complex problems requiring multiple sequential steps of inference. Current efforts concentrate on improving LLMs' ability to plan, execute, and verify these steps, often employing techniques like chain-of-thought prompting, structured planning with world models, and the integration of external tools or knowledge graphs. This research is crucial for advancing AI capabilities in various fields, from automated problem-solving and decision-making to more sophisticated question answering and improved human-computer interaction. The development of robust benchmarks and evaluation metrics is also a key focus, enabling more rigorous comparison and progress tracking of different approaches.
Papers
LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs
Omkar Thawakar, Dinura Dissanayake, Ketan More, Ritesh Thawkar, Ahmed Heakl, Noor Ahsan, Yuhao Li, Mohammed Zumri, Jean Lahoud, Rao Muhammad Anwer, Hisham Cholakkal, Ivan Laptev, Mubarak Shah, Fahad Shahbaz Khan, Salman Khan
Multi-Step Reasoning in Korean and the Emergent Mirage
Guijin Son, Hyunwoo Ko, Dasol Choi
InfiGUIAgent: A Multimodal Generalist GUI Agent with Native Reasoning and Reflection
Yuhang Liu, Pengxiang Li, Zishu Wei, Congkai Xie, Xueyu Hu, Xinchen Xu, Shengyu Zhang, Xiaotian Han, Hongxia Yang, Fei Wu
Understanding Before Reasoning: Enhancing Chain-of-Thought with Iterative Summarization Pre-Prompting
Dong-Hai Zhu, Yu-Jie Xiong, Jia-Chen Zhang, Xi-Jiong Xie, Chun-Ming Xia
Unlocking Video-LLM via Agent-of-Thoughts Distillation
Yudi Shi, Shangzhe Di, Qirui Chen, Weidi Xie
Think-to-Talk or Talk-to-Think? When LLMs Come Up with an Answer in Multi-Step Reasoning
Keito Kudo, Yoichi Aoki, Tatsuki Kuribayashi, Shusaku Sone, Masaya Taniguchi, Ana Brassard, Keisuke Sakaguchi, Kentaro Inui