atomic.chat桌面应用对Claude Sonnet 5、Opus 4.8、Sonnet 4.6及GPT 5.5进行对比测试。使用同一提示词构建三个HTML5物理碰撞演示(汽车撞墙、破坏球毁屋、投石机砸城)。Sonnet 5在全部测试中与GPT 5.5和Opus 4.8表现相当,其中破坏球场景胜Opus 4.8,投石机场景胜GPT 5.5。Sonnet 5仅用15,047 tokens($0.15),GPT 5.5使用31,152 tokens($0.94),成本低约6倍;Opus 4.8使用23,063 tokens($0.58),Sonnet 4.6使用25,824 tokens($0.39)。Sonnet 5 token消耗最少,图形细节仍有提升空间。
atomic【.】chat, a desktop app that runs LLMs locally, ran a very revealing comparison for Claude Sonnet 5, Claude Opus 4.8, Claude Sonnet 4.6, and GPT 5.5.
Claude Sonnet 5 just matched GPT 5.5 on 3 physics coding demos at 6x lower cost.
Also spent minimum number of tokens.
- Sonnet 5: 15,047 tokens, $0.15
- Opus 4.8: 23,063 tokens, $0.58