Chubby♨️@kimmonismus

2026-05-27 05:44·37天前

AI 摘要

近期开发者社区对Codex的评价显著转好，许多观点认为搭配GPT-5.5的Codex表现优异，其部分使用体验甚至常被优先选择。与此同时，新发布的智能体编码基准测试DeepSWE显示，GPT-5.5在此评测中位列第一。该基准测试旨在打破顶尖模型在公开排行榜上能力相近的表象，更真实地反映模型在开发者日常任务中的实际差异。

It's truly amazing to see how the general sentiment has shifted in favor of Codex.

I'm reading so many posts saying that Codex is really good now with GPT-5.5， and that Claude Code is regularly preferred.

（I've become a huge Codex fan myself）.

At the same time， the new DeepSWE benchmark shows that GPT-5.5 is now ranked number one in this measurement as well.

Serena Ge (Datacurve)Today we're releasing DeepSWE, a new standard for agentic coding benchmarks. On public leaderboards, top models often look relatively close in capability. DeepS...

OpenAI 大佬观点编码

在 X 查看原推导出 Markdown

Chubby♨️@kimmonismus · X

62导出 Markdown