# Codex风评逆转：GPT-5.5领跑新基准测试DeepSWE

- 来源：Chubby♨️ (@kimmonismus)
- 发布时间：2026-05-27 05:44
- AIHOT 分数：62
- AIHOT 链接：https://aihot.virxact.com/items/cmpn6mioc0uxbsl01kyyi72qm
- 原文链接：https://x.com/kimmonismus/status/2059390325077004549

## AI 摘要

近期开发者社区对Codex的评价显著转好，许多观点认为搭配GPT-5.5的Codex表现优异，其部分使用体验甚至常被优先选择。与此同时，新发布的智能体编码基准测试DeepSWE显示，GPT-5.5在此评测中位列第一。该基准测试旨在打破顶尖模型在公开排行榜上能力相近的表象，更真实地反映模型在开发者日常任务中的实际差异。

## 正文

It's truly amazing to see how the general sentiment has shifted in favor of Codex.

I'm reading so many posts saying that Codex is really good now with GPT-5.5， and that Claude Code is regularly preferred.

（I've become a huge Codex fan myself）.

At the same time， the new DeepSWE benchmark shows that GPT-5.5 is now ranked number one in this measurement as well.

### 引用推文

> Serena Ge (Datacurve)：Today we're releasing DeepSWE, a new standard for agentic coding benchmarks. On public leaderboards, top models often look relatively close in capability. DeepS...