LLM Agent
LLM agents are autonomous systems that combine large language models with the ability to interact with their environment, achieving complex tasks through planning, tool use, and iterative refinement. Current research focuses on improving their reliability and safety, including aligning their behavior with human values, enhancing their decision-making processes (e.g., through Q-value models and tree search algorithms), and mitigating vulnerabilities to adversarial attacks. This field is significant because it bridges the gap between theoretical AI and practical applications, impacting diverse areas such as game development, software testing, healthcare, and financial markets by automating tasks and improving decision-making.
Papers
VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents
Xiao Liu, Tianjie Zhang, Yu Gu, Iat Long Iong, Yifan Xu, Xixuan Song, Shudan Zhang, Hanyu Lai, Xinyi Liu, Hanlin Zhao, Jiadai Sun, Xinyue Yang, Yu Yang, Zehan Qi, Shuntian Yao, Xueqiao Sun, Siyi Cheng, Qinkai Zheng, Hao Yu, Hanchen Zhang, Wenyi Hong, Ming Ding, Lihang Pan, Xiaotao Gu, Aohan Zeng, Zhengxiao Du, Chan Hee Song, Yu Su, Yuxiao Dong, Jie Tang
Can We Rely on LLM Agents to Draft Long-Horizon Plans? Let's Take TravelPlanner as an Example
Yanan Chen, Ali Pesaranghader, Tanmana Sadhu, Dong Hoon Yi