🚨 AI News | TestingCatalog@testingcatalog

2026-06-25 22:32·7天前

AI 摘要

DeepReinforce 发布 Ornith-1.0 系列开源模型，专为智能体编码设计。参数覆盖 9B Dense、31B Dense、35B MoE 和 397B MoE，基于 gemma4 和 qwen3.5 微调。采用自我改进训练策略：强化学习同时生成解决方案和任务脚手架。旗舰 397B MoE 在编码基准上匹配 Claude Opus 4.7，9B Dense 针对边缘设备优化。评测成绩包括 Terminal-Bench 2.1 77.5、SWE-Bench verified 82.4、SWE-Bench Pro 62.2、NL2Repo 48.2 等。全部模型以 MIT 许可证开源，可商用和研究使用。

DeepReinforce has released Ornith-1.0， their new self-improving family of open-source models designed for agentic coding.

Ornith-1.0 learns to write its own task scaffolds during training rather than relying on human-designed harnesses.

The 397B MoE flagship can match Claude Opus 4.7 on coding benchmarks， and the compact 9B Dense variant is optimized for edge devices.

OrnithAloha! 🌺 Meet Ornith-1.0, a family of open-source LLMs specialized for agentic coding. Ornith-1.0 spans the full parameter sizes including 9B Dense, 31B Dense,...

开源生态模型发布编码

在 X 查看原推导出 Markdown

🚨 AI News | TestingCatalog@testingcatalog · X

74导出 Markdown

2026-06-25 22:32·7天前

在 X 看原推· x.com

AI 摘要

DeepReinforce has released Ornith-1.0， their new self-improving family of open-source models designed for agentic coding.

Ornith-1.0 learns to write its own task scaffolds during training rather than relying on human-designed harnesses.

The 397B MoE flagship can match Claude Opus 4.7 on coding benchmarks， and the compact 9B Dense variant is optimized for edge devices.