SGDR:面向Web智能体的在线技能学习方法
阅读原文· arxiv.orgSGDR是一种面向Web智能体的在线技能学习方法,能在执行过程中按步骤动态复用技能。它包含三个组件:滑动窗口提取将完成轨迹转化为可调用子过程;双文本-代码表示连接技能检索与可执行动作;状态接地动态检索机制同时匹配任务目标与当前网页状态。在WebArena五个领域上,SGDR搭配GPT-4.1的平均成功率达37.5%,搭配Qwen3-4B达24.3%,分别相对最强基线提升10.6%和10.0%。代码已开源。
Language agents increasingly rely on reusable skills to improve multi-step web automation across related tasks. A growing line of work studies online skill learning, where agents continually induce skills from previous task trajectories and reuse them in future tasks on the fly. However, existing methods mainly reuse skills at the task-level: a fixed set of skills is retrieved based on the initial task instruction and then held fixed throughout execution. This static strategy is misaligned with web execution, where the appropriate next action depends not only on the task goal but also on the current webpage state, which often transitions into situations that the initial skills fail to cover. To address this gap, we propose State-Grounded Dynamic Retrieval (SGDR), an online skill learning method that enables stepwise skill reuse for web agents. SGDR consists of three components: a sliding-window extraction process that turns completed trajectories into reusable sub-procedures invokable at intermediate execution states, a dual text-code representation that connects skill retrieval with executable action, and a state-grounded dynamic retrieval mechanism that matches skills to both the task goal and the current webpage state. Experiments on WebArena across five domains show that SGDR consistently outperforms strong baselines, achieving average success rates of 37.5% with GPT-4.1 and 24.3% with Qwen3-4B, corresponding to relative gains of 10.6% and 10.0% over the strongest baseline, respectively. The code is available at https://github.com/plusnli/skill-dynamic-retrieval.