AI 摘要
OpenAI拥有所谓的人类强化学习,这相当于说他们有一大批人员查看ChatGPT的输出,然后判断其是否合适。本质上他们是在训练AI撒谎。 — Elon Musk
"OpenAI have what's called human reinforcement learning, which is another way of saying that they have a whole bunch of people that look at the output of ChatGPT and then say whether that's okay or not okay. Essentially they are training the AI to lie."
- Elon Musk