Cursor 正用当前版 Composer 训练下一代 Composer,形成递归自我改进循环。训练大型模型需要大量 RL 数据(模型通过“游戏”提升能力),新模型能自动配置开发环境(如自动安装依赖、修复故障)。Composer 2 在环境配置能力上显著优于版本 1,模型越强,越擅长创造训练其继任者的条件。Cursor 的 autoinstall 系统让前代 Composer 设置 RL 训练环境,使下一代专注于解决更难题,每一代都解锁先前版本不具备的能力。
We're training the next version of Composer… with Composer!
The model is always learning from itself. This kind of "recursive self-improvement" might sound new, but it's been happening for many months!
For example, training big models requires creating *lots* of data for RL - essentially games the model plays to improve at any task you can grade.
The newest models can configure their own environments to make those games playable (auto-installing dependencies, fixing broken setups).
Composer 2 was *dramatically* better at this than version 1. So the better the model gets, the better it gets at creating the conditions to train its successor.