近期开发者社区对Codex的评价显著转好,许多观点认为搭配GPT-5.5的Codex表现优异,其部分使用体验甚至常被优先选择。与此同时,新发布的智能体编码基准测试DeepSWE显示,GPT-5.5在此评测中位列第一。该基准测试旨在打破顶尖模型在公开排行榜上能力相近的表象,更真实地反映模型在开发者日常任务中的实际差异。
It's truly amazing to see how the general sentiment has shifted in favor of Codex.
I'm reading so many posts saying that Codex is really good now with GPT-5.5, and that Claude Code is regularly preferred.
(I've become a huge Codex fan myself).
At the same time, the new DeepSWE benchmark shows that GPT-5.5 is now ranked number one in this measurement as well.