# EvoDS：自进化自主数据科学智能体，带有技能学习与上下文管理

- 来源：HuggingFace Daily Papers（社区热门论文）
- 发布时间：2026-06-02 08:00
- AIHOT 分数：51
- AIHOT 链接：https://aihot.virxact.com/items/cmq0dttfu05assltrv0p71eik
- 原文链接：https://arxiv.org/abs/2606.03841

## AI 摘要

EvoDS 是一个自进化自主数据科学智能体，通过智能体强化学习实现技能扩展与长期上下文自适应管理。核心包括自主技能获取（ASA）机制与自适应上下文压缩（ACC）策略，前者用于合成、验证和复用可执行技能，后者将上下文管理转化为学习控制问题。采用两阶段多智能体训练方案。理论证明其分层设计降低工具选择错误，优化目标符合信息瓶颈原理。在四个基准测试中，EvoDS 平均优于现有开源数据科学智能体 28.9%，并消除 token 溢出失败。代码与数据已开源。

## 正文

Recent progress in Large Language Model (LLM) agents has enabled promising advances in automated data science. However, existing approaches remain fundamentally limited by their static action sets and lack of principled long-horizon context management, hindering their ability to accumulate reusable experience across tasks and operate reliably in multi-stage, iterative data science pipelines. To address these challenges, we introduce EvoDS, a self-evolving autonomous data science agent that learns to expand its skills and adaptively managing long-term context through agentic reinforcement learning. Specifically, EvoDS introduces two key strategies: (1) Autonomous Skill Acquisition (ASA) mechanism, which enables agents to synthesize, validate, and reuse executable skills; and (2) Adaptive Context Compression (ACC) strategy, which treats context management as a learned control problem rather than passive truncation. These strategies are orchestrated within a two-stage multi-agent training scheme, enabling EvoDS to autonomously improve over time. Theoretically, we prove that EvoDS's hierarchical design reduces tool-selection error, and its optimization objective aligns with an information bottleneck principle, ensuring efficient context use. Empirically, EvoDS outperforms state-of-the-art open-source data science agents by an average of 28.9% across four diverse benchmarks while eliminating out-of-token failures. Our code and data are available at https://github.com/usail-hkust/EvoDS.
