Claude 的新模型在"出错时更'诚实'"

2026-05-29 01:00·35天前·Jay Peters

AI 摘要

Anthropic 在周四发布了其最新模型 Claude Opus 4.8。新模型在生成错误内容时，更倾向于主动标示不确定性，并减少做出无根据的断言。在内部评估中，其产出未经证实断言的可能性比前代模型降低约 4 倍。

原文 · 未翻译

News

Anthropic

Claude’s new model is more ‘honest’ when it messes up

Opus 4.8 is launching on Thursday.

Anthropic is releasing Claude Opus 4.8 on Thursday, and the company is touting the model’s “honesty.”

According to Anthropic, it trains “all [its] models to be honest — for instance, to avoid making claims that they can’t support.” But it notes that “a general problem with AI models is that they sometimes jump to conclusions, confidently presenting their work as making progress despite thin evidence.”

The AI lab claims that early testers have found that Opus 4.8 “is more likely to flag uncertainties about its work and less likely to make unsupported claims.” In the company’s evaluations, Opus 4.8 is “around 4x less likely than its predecessor to allow flaws in code it’s written to pass unremarked.”

In addition to the honesty improvements, with Opus 4.8, users can direct the amount of effort Claude puts into a task. Higher-effort responses will use more tokens, giving users the option of lower-effort responses if they don’t want to burn through their rate limits as quickly.

Anthropic is also launching a feature called “dynamic workflows” in research preview, which the company says will let Claude “take on even bigger tasks.” With dynamic workflows, “Claude can plan the work and then run hundreds of parallel subagents in a single session (and with Opus 4.8, the agents can run for even longer). It then verifies its outputs before reporting back to the user.”

Jay Peters

Anthropic

News

The Verge：AI（RSS）

68导出 Markdown

Claude 的新模型在"出错时更'诚实'"

2026-05-29 01:00·35天前·Jay Peters

阅读原文· theverge.com

AI 摘要

原文 · 保持原样，未翻译

News

Anthropic

Claude’s new model is more ‘honest’ when it messes up

Opus 4.8 is launching on Thursday.

Anthropic is releasing Claude Opus 4.8 on Thursday, and the company is touting the model’s “honesty.”