Reasoning Datasets
Reasoning datasets are collections of problems designed to evaluate and improve the reasoning capabilities of large language models (LLMs). Current research focuses on creating larger, more diverse datasets encompassing various reasoning types (mathematical, commonsense, logical) and incorporating multimodal data (text and images). These datasets, coupled with techniques like chain-of-thought prompting, process supervision, and tool augmentation (e.g., integrating external calculators or search engines), aim to enhance LLMs' ability to solve complex problems. The development of robust reasoning datasets is crucial for advancing LLM capabilities and ensuring their reliable application in diverse fields, including healthcare and education.
Papers
Integrating Arithmetic Learning Improves Mathematical Reasoning in Smaller Models
Neeraj Gangwar, Suma P Bhat, Nickvash KaniUniversity of Illinois Urbana-ChampaignCommonsense Reasoning in Arab Culture
Abdelrahman Sadallah, Junior Cedric Tonga, Khalid Almubarak, Saeed Almheiri, Farah Atif, Chatrine Qwaider, Karima Kadaoui, Sara Shatnawi+2MBZUAI●SDAIA●Al-Balqa Applied University●Khalifa University