AI 摘要
从优化过程、数据构成与模型能力三个条件维度,对推理 SFT 的泛化性展开分析,重新审视监督微调在推理任务中的泛化机制与关键影响因素。
Rethinking Generalization in Reasoning SFT
A Conditional Analysis on Optimization, Data, and Model Capability
paper: https://huggingface.co/papers/2604.06628
从优化过程、数据构成与模型能力三个条件维度,对推理 SFT 的泛化性展开分析,重新审视监督微调在推理任务中的泛化机制与关键影响因素。
Rethinking Generalization in Reasoning SFT
A Conditional Analysis on Optimization, Data, and Model Capability
paper: https://huggingface.co/papers/2604.06628
从优化过程、数据构成与模型能力三个条件维度,对推理 SFT 的泛化性展开分析,重新审视监督微调在推理任务中的泛化机制与关键影响因素。
Rethinking Generalization in Reasoning SFT
A Conditional Analysis on Optimization, Data, and Model Capability
paper: https://huggingface.co/papers/2604.06628