AI 摘要
训练时使用自回归,推理时将权重转换为扩散模型。
Train with autoregression &; convert weights to diffusion for inference.
Most researchers agree that autoregression is best when memory bandwidth is cheap and diffusion is best when FLOPS are cheap. They also admit the future of comp...