LLM Agent
LLM agents are autonomous systems that combine large language models with the ability to interact with their environment, achieving complex tasks through planning, tool use, and iterative refinement. Current research focuses on improving their reliability and safety, including aligning their behavior with human values, enhancing their decision-making processes (e.g., through Q-value models and tree search algorithms), and mitigating vulnerabilities to adversarial attacks. This field is significant because it bridges the gap between theoretical AI and practical applications, impacting diverse areas such as game development, software testing, healthcare, and financial markets by automating tasks and improving decision-making.
Papers
Identifying Performance-Sensitive Configurations in Software Systems through Code Analysis with LLM Agents
Zehao Wang, Dong Jae Kim, Tse-Hsun Chen
Ask-before-Plan: Proactive Language Agents for Real-World Planning
Xuan Zhang, Yang Deng, Zifeng Ren, See-Kiong Ng, Tat-Seng Chua
CodeNav: Beyond tool-use to using real-world codebases with LLM agents
Tanmay Gupta, Luca Weihs, Aniruddha Kembhavi
MASAI: Modular Architecture for Software-engineering AI Agents
Daman Arora, Atharv Sonwane, Nalin Wadhwa, Abhav Mehrotra, Saiteja Utpala, Ramakrishna Bairi, Aditya Kanade, Nagarajan Natarajan
Watch Every Step! LLM Agent Learning via Iterative Step-Level Process Refinement
Weimin Xiong, Yifan Song, Xiutian Zhao, Wenhao Wu, Xun Wang, Ke Wang, Cheng Li, Wei Peng, Sujian Li
AGILE: A Novel Reinforcement Learning Framework of LLM Agents
Peiyuan Feng, Yichen He, Guanhua Huang, Yuan Lin, Hanchong Zhang, Yuchen Zhang, Hang Li
ALI-Agent: Assessing LLMs' Alignment with Human Values via Agent-based Evaluation
Jingnan Zheng, Han Wang, An Zhang, Tai D. Nguyen, Jun Sun, Tat-Seng Chua
CRISPR-GPT: An LLM Agent for Automated Design of Gene-Editing Experiments
Kaixuan Huang, Yuanhao Qu, Henry Cousins, William A. Johnson, Di Yin, Mihir Shah, Denny Zhou, Russ Altman, Mengdi Wang, Le Cong
Testing and Understanding Erroneous Planning in LLM Agents through Synthesized User Inputs
Zhenlan Ji, Daoyuan Wu, Pingchuan Ma, Zongjie Li, Shuai Wang