似乎GPT-5.2在同行评审中达到了专家水平:45位科学家花费469小时,评估了人类与AI对82篇论文的评审。 “令人惊讶的是,当前的AI评审甚至能与《自然》官方同行评审中的顶级评审人相媲美……”尽管并非没有弱点。
Seems GPT-5.2 reaches expert level in peer review: 45 scientists took 469 hours evaluating human &; AI reviews on 82 papers.
"Surprisingly, current AI reviewers are competitive even with the top-rated reviewers in Nature's official peer review…" though not without weaknesses.