Ethan Mollick@emollick

2026-06-13 03:16·20天前

AI 摘要

Claude Fable 5 在 FrontierMath 基准测试（Tiers 1-4, v2）中表现优异，Tiers 1-3 得分 87%，Tier 4 得分 88%，延续了 Anthropic 模型数学能力快速提升的趋势。主推文评论道：“图形的形状越来越熟悉了。”

The shape of the graph is getting very familiar.

Epoch AIClaude Fable 5 scores very well on FrontierMath: Tiers 1-4 (v2), reaching 87% on Tiers 1-3 and 88% on Tier 4. This continues a streak of Anthropic models improv...

Anthropic 推理评测/基准

在 X 查看原推导出 Markdown

Ethan Mollick@emollick · X

57导出 Markdown

2026-06-13 03:16·20天前

在 X 看原推· x.com

AI 摘要

The shape of the graph is getting very familiar.

Epoch AIClaude Fable 5 scores very well on FrontierMath: Tiers 1-4 (v2), reaching 87% on Tiers 1-3 and 88% on Tier 4. This continues a streak of Anthropic models improv...

Anthropic 推理评测/基准

在 X 查看原推