Nathan Lambert@natolambert

2026-06-24 22:25·8天前

AI 摘要

Nathan Lambert 发布了一期 AI 基础讲座视频，使用 GLM 5.2 模型生成。内容涵盖语言模型概述、LM Head、Softmax 与对数概率、训练样本结构、概率计算、后训练中的三种掩码、解码、交叉熵损失、优化与微调、预训练到 SFT 流程、KL 散度与熵、Sigmoid 与成对似然、强化学习 MDP 框架等。视频包含时间戳目录，并持续收集观众问题用于后续 Q&A。

Another quick lecture -- I've been asked many times for prereq's to my book and what you should know， so built a little lecture （with GLM 5.2） to cover some more basics.

Topics include：

00：00 Introduction & Course Prerequisites 01：37 Language Models Overview 02：47 The LM Head 04：29 Softmax & Log-Probabilities 06：13 Anatomy of an LM Training Example 06：37 Computing LLM Probabilities （+Phoebe the Dog） 09：52 Three Common Masks in Post-Training 11：03 A Small Decoding Review 12：14 Training an LM： Cross-Entropy 13：23 Optimization & Fine-Tuning 13：55 Pretraining to Midtraining to SFT Pipeline 15：25 Probability Essentials： KL Divergence & Entropy 19：36 Sigmoid & Pairwise Likelihood 20：29 Reinforcement Learning Framing （MDP） 22：28 Transitioning Tools into Post-Training 23：12 Recommended Resources & Wrap-Up  Happy learning and I'm still taking questions from during the course for Q&A videos.

安全/对齐教程/实践

在 X 查看原推导出 Markdown

Nathan Lambert@natolambert · X

49导出 Markdown

2026-06-24 22:25·8天前

在 X 看原推· x.com

AI 摘要

Another quick lecture -- I've been asked many times for prereq's to my book and what you should know， so built a little lecture （with GLM 5.2） to cover some more basics.

Topics include：