# 前沿LLM在医学评估中超越专业临床AI工具

- 来源：Ethan Mollick (@emollick)
- 发布时间：2026-06-12 22:43
- AIHOT 分数：72
- AIHOT 链接：https://aihot.virxact.com/items/cmqb22z3y00d4sl9zkeyvfxhm
- 原文链接：https://x.com/emollick/status/2065444925483692192

## AI 摘要

一项发表在Nature Medicine的研究显示，通用前沿大语言模型（Google、OpenAI、Anthropic）在医学信息评估中全面优于专门的临床AI工具（OpenEvidence和UpToDate）。12名美国临床医生进行随机盲测，Frontier LLMs在三项评估中均胜出。临床AI工具的表现与自动启用的Google Search AI Overview在RCQ测试中相当。

## 正文

There has been a push to use OpenEvidence AI for doctors. But this paper suggests general models are much better： "Frontier LLMs outperformed clinical AI tools in all three evaluations. Clinical AI tools performed comparably to auto-enabled Google Search AI Overview on the RCQ."

### 引用推文

> Eric Topol：For medical information, general AI frontier models (Google, OpenAI, Anthropic) outperformed specialized @EvidenceOpen and @UpToDate as assessed by 12 US clinic...
