# DeNovoSWE： 扩展长周期环境以从零生成完整仓库

- 来源：HuggingFace Daily Papers（社区热门论文）
- 发布时间：2026-06-09 08:00
- AIHOT 分数：69
- AIHOT 链接：https://aihot.virxact.com/items/cmq8ws2lk06ipslldfrrnd6hd
- 原文链接：https://arxiv.org/abs/2606.10728

## AI 摘要

DeNovoSWE 是一个大规模完整仓库生成数据集，包含4,818个高质量实例，每个实例要求从文档生成完整仓库。该数据集通过沙盒智能体工作流自动构建，无需人工标注，采用分治与批评修复策略以及难度感知轨迹过滤保证质量。微调 Qwen3-30B-A3B 后，在 BeyondSWE-Doc2Repo 基准上的得分从5.8%提升至47.2%。

## 正文

As the capabilities of LLM-based code agents continue to advance, their expected role is expanding beyond localized bug fixing in existing codebases toward architecting and implementing complete software repositories from high-level specifications. However, training agents for such long-horizon software engineering tasks remains difficult due to the scarcity of large-scale, verifiable whole-repository generation data. In this paper, we introduce DeNovoSWE, a large-scale dataset for whole-repository generation. DeNovoSWE comprises 4,818 high-quality instances, where each instance requires generating a complete repository from documentation. Our dataset is automatically constructed through a carefully designed sandboxed agentic workflow, enabling scalable curation without human annotation. DeNovoSWE is constructed with "divide and conquer" and critic-repair philosophy. To balance data quality and diversity, we further introduce a difficulty-aware trajectory filtering strategy. Fine-tuning Qwen3-30B-A3B on DeNovoSWE substantially improves long-horizon SWE performance, raising its score on the challenging BeyondSWE-Doc2Repo benchmark from 5.8% to 47.2%.