DeepReinforce 发布 Ornith-1.0 系列开源模型,专为智能体编码设计。参数覆盖 9B Dense、31B Dense、35B MoE 和 397B MoE,基于 gemma4 和 qwen3.5 微调。采用自我改进训练策略:强化学习同时生成解决方案和任务脚手架。旗舰 397B MoE 在编码基准上匹配 Claude Opus 4.7,9B Dense 针对边缘设备优化。评测成绩包括 Terminal-Bench 2.1 77.5、SWE-Bench verified 82.4、SWE-Bench Pro 62.2、NL2Repo 48.2 等。全部模型以 MIT 许可证开源,可商用和研究使用。
DeepReinforce has released Ornith-1.0, their new self-improving family of open-source models designed for agentic coding.
Ornith-1.0 learns to write its own task scaffolds during training rather than relying on human-designed harnesses.
The 397B MoE flagship can match Claude Opus 4.7 on coding benchmarks, and the compact 9B Dense variant is optimized for edge devices.