ii 团队推出开源 Zenith harness,通过自适应自我改进(adaptive self improvement)将基础模型推向 FrontierSWE 基准榜首,在需数小时或数天的复杂任务(如蛋白质预测模型训练、编译器优化)上超越 Fable。同时预告 GLM 5.2 即将到来。
We have seen multi model harnesses for cheaper &; faster tasks
What about for the hardest challenges?
What about open source?
Proud to share the latest update our Zenith harness, taking models you can use today above Fable on tasks that take hours or days