# 迈向自动化科学评审：Google 的 Paper Assistant Tool（PAT）

- 来源：HuggingFace Daily Papers（社区热门论文）
- 发布时间：2026-06-26 08:00
- AIHOT 分数：55
- AIHOT 链接：https://aihot.virxact.com/items/cmqylfgci02xwsliv1ex105yf
- 原文链接：https://arxiv.org/abs/2606.28277

## AI 摘要

Google 发布 Paper Assistant Tool（PAT），一个用于深度科学评审和验证的智能体 AI 框架。PAT 能摄取完整学术论文，生成综合评估，包括检查理论结果、验证实验、提出改进建议和识别潜在缺陷。通过利用推理扩展技术，PAT 在 SPOT 基准上对数学错误的召回率比零样本提升 34%。该工具已在 STOC 和 ICML 两大计算机科学会议上作为作者预提交工具进行试点，可发现关键错误并提出实质性改进，在保留评审员对结果控制权的同时减轻其认知负担。

## 正文

Artificial intelligence is driving a revolution in scientific discovery, accelerating everything from hypothesis generation to mathematical theorem proving. However, this rapid acceleration is creating a systemic challenge: traditional human peer review cannot scale to match the influx of AI-assisted science. Ultimately, to resolve this tension, we must also deploy AI to accelerate the verification and review process itself. To frame the discussion around this transition, we propose a taxonomy consisting of four progressive levels of AI-human collaboration in scientific evaluation, and discuss various trade-offs involved with each. As a step toward this future, we introduce the Paper Assistant Tool (PAT), an agentic AI framework built for deep scientific review and verification. PAT ingests full scientific manuscripts and produces a comprehensive evaluation, checking theoretical results, validating experiments, suggesting improvements, and identifying potential flaws. By utilizing inference scaling techniques, PAT is able to identify deeper issues than a single model call alone, achieving a 34% improvement over zero-shot recall on mathematical errors in the SPOT benchmark. Pilot deployments of PAT as a pre-submission tool for authors at two major Computer Science conferences -- STOC and ICML -- demonstrate its ability to identify critical errors and suggest substantive improvements to research papers. By catching errors early, PAT eases the cognitive burden placed on referees, while preserving their control over the outcomes of the review process.
