Nathan Lambert@natolambert

2026-06-16 21:44·16天前

AI 摘要

Nathan Lambert与Finbarr Timbers合作推出新播客，系统调研GLM 5.1、Kimi K2.6、DeepSeek V4、Xiaomi MiMo V2.5、Nemotron Ultra等模型的最新后训练方法。核心讨论包括：行业转向多教师在线策略蒸馏（MOPD）的原因；Olmo风格配方需改进的方向；后训练如何适配大型组织；以及在AGI早期阶段的职业建议。播客章节涵盖历史回顾、2026年模型配方（MiMo Flash、DeepSeek V4、GLM 5、Kimi K2.6等）及开放式后训练讨论。

New podcast with @finbarrtimbers！ We survey the latest post-training recipes， from GLM 5.1， Kimi K2.6， DeepSeek V4， Xiaomi MiMo V2.5， Nemotron Ultra， etc. and discuss： - Why the industry slowly shifted to multi-teacher on-policy distillation （MOPD）. - What an Olmo-style recipe would need improvements in - How post-training works / suits larger organizational efforts - Career advice in the foothills of the singularity - and other topics

I heard y'all wanted me to start doing this， so making some time when I'm in funemployment！

Chapters：

00：00 Introduction & Olmo reflections 06：28 Post-train recipes review （history） 23：00 2026's model recipes （MiMo Flash， DeepSeek V4， GLM 5， Kimi K2.6， etc.） 39：05 Open-ended post-training discussions 48：22 Career advice in the LLM race

Links below， please follow @interconnectsai and like and subscribe and buy my book？