AI 摘要
效率前沿! 你认为 GPT-5.6 会落在哪里?
The efficiency frontier!
Where do you think GPT-5.6 will land?
Claude Opus 4.8 has landed on DeepSWE Bench, posting a 58% Pass@1 and taking #2 overall behind GPT-5.5. It continues a broader trend: slightly behind on raw sco...