Nathan Lambert@natolambert

2026-04-15 01:19·79天前

AI 摘要

我的一个信念是，教育应该尽可能免费且广泛地传播，尤其是对于 LLMs/AI 这样动态且关键的技术。我很自豪有这样一群朋友：如果我做付费课程，他们会与我绝交。 [引用 @natolambert]：很高兴为我的书推出配套的免费 RLHF 课程。作为开始，我已发布： - Welcome video - Lecture 1: Overview of RLHF & Post-training - Lecture 2: IFT, Reward Models, Rejection Sampling - Lecture 3: RL Math - Lecture 4: RL Implementation 我将在整个课程中添加问答视频，深入讲解需要展开的主题，并可能涵盖一些太新且仍在变动、无法印刷的内容。预计未来几个月总共会有10-15个视频。与此同时，本书代码的开发工作也在加速。现在是构建 Post-training 方法基础的好时机。 YT 播放列表和课程页面见下方。

One of my passions is that education should be dispersed freely and as widely as possible， especially for technologies as dynamic and crucial as LLMs/AI.

I'm proud to have friends who would disown me if I did a paywalled course.

Nathan LambertExcited to launch the accompanying free RLHF Course for my book. To kick it off, I've released: - Welcome video - Lecture 1: Overview of RLHF & Post-training - ...

教程/实践数据/训练

在 X 查看原推导出 Markdown

Nathan Lambert@natolambert · X

导出 Markdown

2026-04-15 01:19·79天前

在 X 看原推· x.com

AI 摘要

One of my passions is that education should be dispersed freely and as widely as possible， especially for technologies as dynamic and crucial as LLMs/AI.

I'm proud to have friends who would disown me if I did a paywalled course.