Web Task

Web task automation research focuses on enabling AI agents to perform complex tasks within web environments, mirroring human adaptability and efficiency. Current efforts concentrate on developing agents that leverage large language models (LLMs) combined with reinforcement learning (RL) and advanced search algorithms like Monte Carlo Tree Search (MCTS), often incorporating techniques like workflow memory and policy stacking to improve performance and generalization across diverse websites and tasks. These advancements aim to create more robust and versatile AI agents capable of handling real-world web-based operations, impacting fields like automated software development and data entry.

Papers