研究通过贪婪剪枝方法(逐个删除对模型似然度影响最小的token)评估LLM推理token的功能重要性。发现符号数学token比语法叙述更能经受剪枝,表明模型内部存在重要性排序。重要性具有动态性,早期可丢弃的token可能在上下文减少后变得关键。注意力模式可预测剪枝分数,说明功能重要性在模型内部可见。该发现有助于使chain-of-thought更可解释,而非仅仅缩短长度。
Reasoning tokens in LLMs are not equal.
Models seem to know which parts of their own reasoning matter most.
What survives pruning is usually the part doing actual computational work, not the fluent narration wrapped around it.
The method is clever in a plain way. Start with a full chain of thought, delete one token at a time, and keep deleting whichever removal hurts the model's likelihood least. The resulting order becomes a functional ranking, not of what sounds important to us, but of what the model itself seems to need.
Here's the interesting part.
If a model's reasoning were just verbose decoration, pruning should look mostly random once you preserve the answer. Instead, the paper finds structure. Symbolic math tokens survive pruning far more than grammar, narration, and referential bookkeeping, which means the model is not treating all tokens as equally useful.