State of the Art LLM
State-of-the-art Large Language Models (LLMs) are rapidly evolving, focusing on improving performance across diverse tasks and domains, including finance, healthcare, and process engineering. Research emphasizes enhancing reasoning capabilities, particularly for multi-step problems, through techniques like incorporating external symbolic working memory and modular architectures with specialized expert models (e.g., Mixture of Experts). These advancements are significant because they enable more reliable and efficient LLM applications, ranging from automating complex processes to providing personalized user experiences and improving access to information in various fields.
47papers
Papers
April 1, 2025
First Field-Trial Demonstration of L4 Autonomous Optical Network for Distributed AI Training Communication: An LLM-Powered Multi-AI-Agent Solution
Yihao Zhang, Qizhi Qiu, Xiaomin Liu, Dianxuan Fu, Xingyu Liu, Leyan Fei, Yuming Cheng, Lilin Yi, Weisheng Hu, Qunbi ZhugeShanghai Jiao Tong UniversityRecitation over Reasoning: How Cutting-Edge Language Models Can Fail on Elementary School-Level Reasoning Problems?
Kai Yan, Yufei Xu, Zhengyin Du, Xuesong Yao, Zheyu Wang, Xiaowen Guo, Jiecao ChenByteDance Seed●University of Illinois Urbana-Champaign
February 18, 2025
February 12, 2025
February 11, 2025
AI-VERDE: A Gateway for Egalitarian Access to Large Language Model-Based Resources For Educational Institutions
Paul Mithun, Enrique Noriega-Atala, Nirav Merchant, Edwin SkidmoreMiniF2F in Rocq: Automatic Translation Between Proof Assistants -- A Case Study
Jules Viennot, Guillaume Baudart, Emilio Jesùs Gallego Arias, Marc Lelarge
February 4, 2025
LLM-ProS: Analyzing Large Language Models' Performance in Competitive Problem Solving
Md Sifat Hossain, Anika Tabassum, Md. Fahim Arefin, Tarannum Shaila ZamanEvalita-LLM: Benchmarking Large Language Models on Italian
Bernardo Magnini, Roberto Zanoli, Michele Resta, Martin Cimmino, Paolo Albano, Marco Madeddu, Viviana Patti
February 1, 2025