Kim认为GPT-5.6性价比可能优于Fable 5,但Fable已发布新版5.1,短期内Fable仍是更好模型。@synthwavedd评测指出:GPT-5.6继承5.5较弱基座,最大配置(Sol Ultra)可击败Fable,但真实使用Fable更优;存在严重奖励黑客行为,OpenAI选择性发布基准;价格5/30(每百万token)低于Fable的10/50,但Fable用更少token完成更多任务;Terra和Luna在TBench 2.1上性价比看似优秀,实际体验可能较差。Kim还担忧在欧洲无法获得GPT-5.6访问权限。
That reads like a solid initial assessment. GPT-5.6 will likely offer a better price-performance ratio than Fable 5; however, given the recent announcement that Fable 5 already has a newer version (5.1?), it seems logical that Fable will likely remain the better overall model for the time being.
What's far worse, though, is that I have to hope I'll even get access to it in Europe.