AI 摘要
KnowRL 通过强化学习与最小充分知识指导来提升大语言模型的推理能力 论文: https://huggingface.co/papers/2604.12627 https://t.co/vnNFqXJ8hY
KnowRL
Boosting LLM Reasoning via Reinforcement Learning with Minimal-Sufficient Knowledge Guidance
paper: https://huggingface.co/papers/2604.12627