AI 摘要
看着所有模型依托快速改进的后训练陆续发布,显然我们需要一个完全开放的实验室,展示现代后训练中应优先拉动哪些杠杆。 现有的完全开放方案如 olmo 3 正迅速落后。糟糕的均衡。
Watching all the model releases come out on the back of quickly improving post-training makes it clear we need a fully open lab showing the high-priority levers to pull on modern post-training.
Existing fully open recipes like olmo 3 quickly falling behind. A bad equilibrium.