Agent Capability
Agent capability research focuses on evaluating and enhancing the performance of artificial intelligence agents across diverse tasks, aiming to create more robust, adaptable, and reliable systems. Current research emphasizes developing novel agent architectures, such as those incorporating self-improvement mechanisms or normative modules, and improving evaluation methods through techniques like value function decomposition and benchmark creation tailored to specific domains (e.g., cybersecurity, biomedical science). These advancements are crucial for mitigating risks associated with increasingly capable AI systems and for advancing the broader field of artificial intelligence through more rigorous evaluation and improved agent design.
Papers
February 23, 2023
December 15, 2022
November 29, 2022
November 23, 2022
November 7, 2022
November 4, 2022
October 6, 2022
September 13, 2022
June 24, 2022
January 10, 2022
January 9, 2022
December 14, 2021