# Opus 4.8发布，SWE-Bench Pro得分创新高

- 来源：Yuchen Jin (@Yuchenj_UW)
- 发布时间：2026-05-29 00:57
- AIHOT 分数：72
- AIHOT 链接：https://aihot.virxact.com/items/cmppr3mx800e7slm6fmy8tbql
- 原文链接：https://x.com/Yuchenj_UW/status/2060042830559756407

## AI 摘要

Opus 4.8在SWE-Bench Pro上得分69.2%，比GPT-5.5高出10分。

发布博客中最有趣的部分是“动态工作流”：

“这项新功能（目前处于研究预览阶段）允许Claude在Claude Code中承担更大的任务。Claude可以规划工作，然后在单个会话中运行数百个并行子智能体（使用Opus 4.8时，智能体可以运行更长时间）。它在向用户报告之前会先验证其输出。”

## 正文

Opus 4.8 scores 69.2% on SWE-Bench Pro， 10 points higher than GPT-5.5.

Most interesting part of the release blog is "Dynamic Workflows"：

"This new feature， available in research preview， allows Claude to take on even bigger tasks in Claude Code. Claude can plan the work and then run hundreds of parallel subagents in a single session （and with Opus 4.8， the agents can run for even longer）. It then verifies its outputs before reporting back to the user."