Anthropic发布Claude Opus 4.8模型。其快速模式速度提升2.5倍,同时成本降低3倍。在agentic终端编码基准测试上,性能从66.1%大幅提升至74.6%,成为GDPval-AA基准的新领导者。新推出“动态工作流”功能,可通过Claude Code将大型工程任务分解为数十至数百个并行子任务,由多个AI智能体协同处理并互相验证。官方介绍称,该版本在判断力、诚实度以及独立工作能力上均有提升,今日起以相同价格提供服务。
Claude Opus 4.8 dropped.
- 2.5x faster fast mode, which is also 3x cheaper
- has a new "dynamic workflows" feature that allows it to tackle very large-scale problems.
- 74.6% on agentic terminal coding is the biggest benchmark jump over Opus 4.7, rising from 66.1%
- New "dynamic workflows" feature that allows it to tackle very large-scale problems.
- The new leader on our GDPval-AA benchmark for agentic real-world work tasks
The dynamic workflows in Claude Code will break a massive engineering task into many smaller jobs, run them through tens to hundreds of parallel subagents, and check the results before handing anything back.