AI Agent
AI agents are autonomous systems designed to perceive, reason, and act within an environment to achieve specified goals. Current research emphasizes improving agent capabilities through techniques like self-improvement mechanisms (e.g., recursive self-modification), enhanced search algorithms (e.g., Monte Carlo Tree Search), and the integration of large language models (LLMs) for reasoning and tool use. This field is crucial for advancing AI safety and reliability, particularly in addressing challenges like adversarial attacks and ensuring responsible deployment across diverse applications, from traffic modeling to personalized search engines.
Papers
Eliza: A Web3 friendly AI Agent Operating System
Shaw Walters, Sam Gao, Shakker Nerd, Feng Da, Warren Williams, Ting-Chien Meng, Hunter Han, Frank He, Allen Zhang, Ming Wu, Timothy Shen, Maxwell Hu, Jerry Yan
AIOpsLab: A Holistic Framework to Evaluate AI Agents for Enabling Autonomous Clouds
Yinfang Chen, Manish Shetty, Gagan Somashekar, Minghua Ma, Yogesh Simmhan, Jonathan Mace, Chetan Bansal, Rujia Wang, Saravan Rajmohan
A hybrid marketplace of ideas
Tomer Jordi Chaffer, Dontrail Cotlage, Justin Goldston
QuArch: A Question-Answering Dataset for AI Agents in Computer Architecture
Shvetank Prakash, Andrew Cheng, Jason Yik, Arya Tschand, Radhika Ghosal, Ikechukwu Uchendu, Jessica Quaye, Jeffrey Ma, Shreyas Grampurohit, Sofia Giannuzzi, Arnav Balyan, Fin Amin, Aadya Pipersenia, Yash Choudhary, Ankita Nayak, Amir Yazdanbakhsh, Vijay Janapa Reddi
TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks
Frank F. Xu, Yufan Song, Boxuan Li, Yuxuan Tang, Kritanjali Jain, Mengxue Bao, Zora Z. Wang, Xuhui Zhou, Zhitong Guo, Murong Cao, Mingyang Yang, Hao Yang Lu, Amaad Martin, Zhe Su, Leander Maben, Raj Mehta, Wayne Chi, Lawrence Jang, Yiqing Xie, Shuyan Zhou, Graham Neubig
Tree-of-Code: A Hybrid Approach for Robust Complex Task Planning and Execution
Ziyi Ni, Yifan Li, Daxiang Dong
CUAL: Continual Uncertainty-aware Active Learner
Amanda Rios, Ibrahima Ndiour, Parual Datta, Jerry Sydir, Omesh Tickoo, Nilesh Ahuja
LMAgent: A Large-scale Multimodal Agents Society for Multi-user Simulation
Yijun Liu, Wu Liu, Xiaoyan Gu, Yong Rui, Xiaodong He, Yongdong Zhang
Brain-inspired AI Agent: The Way Towards AGI
Bo Yu, Jiangning Wei, Minzhen Hu, Zejie Han, Tianjian Zou, Ye He, Jun Liu