Autonomous Agent
Autonomous agents are software or robotic systems capable of independent decision-making and action within their environment, aiming to achieve specified goals. Current research heavily focuses on leveraging large language models (LLMs) and reinforcement learning (RL) algorithms, often combined with techniques like Monte Carlo Tree Search and contrastive learning, to enhance agent capabilities in diverse tasks such as game testing, network security, and robotic navigation. This field is significant due to its potential to automate complex processes across various sectors, from optimizing industrial workflows to improving safety and efficiency in autonomous vehicles and robotics. The development of robust benchmarks and frameworks for evaluating agent performance and safety is a key area of ongoing investigation.
Papers
CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents
Tianqi Xu, Linyao Chen, Dai-Jie Wu, Yanjun Chen, Zecheng Zhang, Xiang Yao, Zhiqiang Xie, Yongchao Chen, Shilong Liu, Bochen Qian, Anjie Yang, Zhaoxuan Jin, Jianbo Deng, Philip Torr, Bernard Ghanem, Guohao Li
Agentless: Demystifying LLM-based Software Engineering Agents
Chunqiu Steven Xia, Yinlin Deng, Soren Dunn, Lingming Zhang
Tree Search for Language Model Agents
Jing Yu Koh, Stephen McAleer, Daniel Fried, Ruslan Salakhutdinov
Ontology-Enhanced Decision-Making for Autonomous Agents in Dynamic and Partially Observable Environments
Saeedeh Ghanadbashi, Fatemeh Golpayegani
Knowing What Not to Do: Leverage Language Model Insights for Action Space Pruning in Multi-agent Reinforcement Learning
Zhihao Liu, Xianliang Yang, Zichuan Liu, Yifan Xia, Wei Jiang, Yuanyu Zhang, Lijuan Li, Guoliang Fan, Lei Song, Bian Jiang
AndroidWorld: A Dynamic Benchmarking Environment for Autonomous Agents
Christopher Rawles, Sarah Clinckemaillie, Yifan Chang, Jonathan Waltz, Gabrielle Lau, Marybeth Fair, Alice Li, William Bishop, Wei Li, Folawiyo Campbell-Ajala, Daniel Toyama, Robert Berry, Divya Tyamagundlu, Timothy Lillicrap, Oriana Riva
Human-Agent Cooperation in Games under Incomplete Information through Natural Language Communication
Shenghui Chen, Daniel Fried, Ufuk Topcu