Rohan Paul@rohanpaul_ai · 6月23日75OpenAI’s new GPT-5.5-Cyber just beat Mythos 5 on CyberGym.
CyberGym measures whether an agent can reproduce known software vulnerabilities, so this is quite a strong signal for defensive vulnerability analysis of models.
OpenAI also launched a major push to use GPT-5.5-Cyber and human security teams to fix open source bugs before AI bug-hunting tools flood maintainers with low-quality reports.
Vulnerability discovery is becoming much easier, so the scarce part is now remediation, which means confirming the bug, proving reachability, writing a fix, testing it, and giving humans enough evidence to merge safely.
OpenAI’s initiative is to use GPT-5.5-Cyber as a defensive security worker inside Codex.
It scans code, checks whether a vulnerability is real and reachable, writes a patch, tests the patch, and gives humans evidence to approve it.
Daybreak is OpenAI’s new cybersecurity initiative to help trusted defenders find, verify, and patch vulnerable software much faster using AI.
The new checkpoint of GPT-5.5-Cyber, are all part of the company's limited “Trusted Access for Cyber” program and do not involve a public release.
译OpenAI 新模型 GPT-5.5-Cyber 在 CyberGym 基准上击败 Mythos 5,该基准测试 AI 智能体复现已知软件漏洞的能力,对防御性漏洞分析是强信号。OpenAI 同步扩大 Daybreak 计划,包括:Codex Security 插件(在 Codex 内发现、验证并修复漏洞);GPT-5.5-Cyber 完整版(供受信任防御者使用);Cyber Partner Program(赋能安全公司构建基于 OpenAI 能力的安防产品);Patch the Planet(与维护者合作保护关键开源项目)。本轮模型和计划属于“Trusted Access for Cyber”项目,不公开发布。OpenAI 旨在用 GPT-5.5-Cyber 作为 Codex 内的防御性安全工人,自动扫描代码、确认漏洞真实可达、编写补丁并测试,