Agent Smith
Research on "Agent Smith" (a placeholder name, as the provided papers don't refer to a specific entity named Agent Smith) focuses on developing autonomous AI agents capable of complex reasoning and interaction within various environments, leveraging large language models (LLMs) as their core decision-making component. Current research emphasizes improving agent capabilities through techniques like knowledge graph integration, multi-agent collaboration, and the incorporation of error-correction mechanisms, often within specialized frameworks designed for specific tasks (e.g., medical question answering, social simulation, or software engineering). This work is significant for advancing AI capabilities in complex domains and improving the reliability and safety of autonomous systems, with potential applications ranging from scientific research to healthcare and industrial automation.
Papers
Agents on the Bench: Large Language Model Based Multi Agent Framework for Trustworthy Digital Justice
Cong Jiang, Xiaolei Yang
GeAR: Graph-enhanced Agent for Retrieval-augmented Generation
Zhili Shen, Chenxin Diao, Pavlos Vougiouklis, Pascual Merita, Shriram Piramanayagam, Damien Graux, Dandan Tu, Zeren Jiang, Ruofei Lai, Yang Ren, Jeff Z. Pan
Explainable Multi-Modal Data Exploration in Natural Language via LLM Agent
Farhad Nooralahzadeh, Yi Zhang, Jonathan Furst, Kurt Stockinger
INVESTORBENCH: A Benchmark for Financial Decision-Making Tasks with LLM-based Agent
Haohang Li, Yupeng Cao, Yangyang Yu, Shashidhar Reddy Javaji, Zhiyang Deng, Yueru He, Yuechen Jiang, Zining Zhu, Koduvayur Subbalakshmi, Guojun Xiong, Jimin Huang, Lingfei Qian, Xueqing Peng, Qianqian Xie, Jordan W. Suchow
Contrato360 2.0: A Document and Database-Driven Question-Answer System using Large Language Models and Agents
Antony Seabra, Claudio Cavalcante, Joao Nepomuceno, Lucas Lago, Nicolaas Ruberg, Sergio Lifschitz
PC Agent: While You Sleep, AI Works -- A Cognitive Journey into Digital World
Yanheng He, Jiahe Jin, Shijie Xia, Jiadi Su, Runze Fan, Haoyang Zou, Xiangkun Hu, Pengfei Liu
SafeAgentBench: A Benchmark for Safe Task Planning of Embodied LLM Agents
Sheng Yin, Xianghe Pang, Yuanzhuo Ding, Menglan Chen, Yutong Bi, Yichen Xiong, Wenhao Huang, Zhen Xiang, Jing Shao, Siheng Chen
RareAgents: Autonomous Multi-disciplinary Team for Rare Disease Diagnosis and Treatment
Xuanzhong Chen, Ye Jin, Xiaohao Mao, Lun Wang, Shuyang Zhang, Ting Chen