我的一个信念是,教育应该尽可能免费且广泛地传播,尤其是对于 LLMs/AI 这样动态且关键的技术。 我很自豪有这样一群朋友:如果我做付费课程,他们会与我绝交。 [引用 @natolambert]:很高兴为我的书推出配套的免费 RLHF 课程。作为开始,我已发布: - Welcome video - Lecture 1: Overview of RLHF & Post-training - Lecture 2: IFT, Reward Models, Rejection Sampling - Lecture 3: RL Math - Lecture 4: RL Implementation 我将在整个课程中添加问答视频,深入讲解需要展开的主题,并可能涵盖一些太新且仍在变动、无法印刷的内容。预计未来几个月总共会有10-15个视频。 与此同时,本书代码的开发工作也在加速。现在是构建 Post-training 方法基础的好时机。 YT 播放列表和课程页面见下方。
One of my passions is that education should be dispersed freely and as widely as possible, especially for technologies as dynamic and crucial as LLMs/AI.
I'm proud to have friends who would disown me if I did a paywalled course.