文章更新提醒:我们发布《Finding Miscompiles for Fun, Not Profit》的次日,Anthropic发布了Opus 4.8和Claude Code中的ultracode模式。我们的初步实验表明,两者结合在过滤低严重性漏洞方面显著更优,且发现中高严重性漏洞的成本可能仅为本文所述工作流的1/5(误差范围极大)。(1/2)🧵
ARTICLE UPDATE ALERT: The day after we published Finding Miscompiles for Fun, Not Profit, Anthropic released Opus 4.8 and ultracode mode in Claude Code. Our preliminary experiments indicate that together these are significantly better at filtering out low-severity bugs, and that the cost per medium-to-high severity bug found is maybe 1/5 (with VERY large error bars) that of the workflow described in this article. (1/2)🧵