OpenAI 发布 GPT-5.6 Sol(前沿模型)、Terra(平衡日常模型)和 Luna(快速低价模型)的有限预览。swyx 测试 Sol 后给出极高评价,称这不仅是“cyber”版本,而是全新的 SOTA 工作模型,完全取代 Opus 处理他 80% 的任务。关键数据:Sol 与 Mythos Preview 竞争时仅使用约 1/3 的输出 token。swyx 指出 OAI 后训练团队大幅提升了推理帕累托前沿,且未公开方法,这已成为企业智能体模型最重要的竞争优势。他认为这次小版本升级远大于 5.4→5.5 的跳跃,甚至应直接命名为 GPT-6。
have been testing 5.6 for a while and VERY happy with it. DO NOT view this as just a "cyber" release, it is the new sota workhorse model, completely replacing opus for 80% of tasks for me
GPT-5.6 Sol is competitive with Mythos Preview using only ~1/3 of the output tokens.
this is a very key line. OAI posttraining team has shifted the reasoning pareto frontier by A LOT and they arent saying anything about how they did it because this is the single most important competitive advantage right now in agentic models for enterprise.