# Crafter：一个用于从多样化输入生成可编辑科学图表的多智能体框架

- 来源：HuggingFace Daily Papers（社区热门论文）
- 发布时间：2026-05-28 08:00
- AIHOT 分数：51
- AIHOT 链接：https://aihot.virxact.com/items/cmpw15rry03fpslukz5p0h2ze
- 原文链接：https://arxiv.org/abs/2605.30611

## AI 摘要

科学图表生成是论文准备中最耗时的环节之一，现有自动化系统仅支持单一图表类型和纯文本输入，且输出不可编辑。为此，研究团队提出了Crafter，一个多智能体框架，它能适配多种图表类型和输入条件，生成出版质量的图表。其配套工具CraftEditor可将生成的栅格图转换为可编辑的SVG文件。同时发布的CraftBench评测基准涵盖了三种图表类型和四种输入条件。实验显示，Crafter在PaperBanana-Bench和CraftBench上均优于独立生成器和现有智能体基线，其组件贡献也得到了验证。代码与基准已开源。

## 正文

Scientific figures are among the most effective means of communicating complex research ideas, yet producing publication-quality illustrations remains one of the most labor-intensive parts of paper preparation. Existing automated systems each target a single figure type under text-only input, leaving the diversity of types and conditions researchers actually use unaddressed; their raster outputs further cannot be locally revised. Because scientific figures are structured compositions of discrete semantic components, the localized errors generators produce on such layouts demand not a stronger backbone but a harness. We instantiate this harness in two complementary systems: Crafter, a multi-agent harness for figure generation that generalizes across figure types and input conditions without architectural changes, and CraftEditor, which applies the same pattern to convert raster outputs into editable SVGs. Moreover, we introduce CraftBench, a benchmark spanning three figure types and four input conditions with human quality annotation. Experiments show that Crafter substantially outperforms both standalone generators and the agentic baseline on PaperBanana-Bench and CraftBench, with ablations confirming each component's independent contribution; CraftEditor faithfully converts outputs into editable SVGs that surpass all baselines. Our code and benchmark are available at https://github.com/HaozheZhao/Crafter.
