研究发现,当语言模型面对困难问题时,其内部“脑活动”会收缩到更少的路径中。模型在感到困惑时会压缩内部思考,表现为从广泛分散的神经元激活,坍缩为最终处理层中高度集中的信号。这是因为系统放弃了稳健的分布式记忆,将计算强制压缩到狭小的专门空间以应对陌生挑战。关键在于,这种收缩效应可被量化为一个原始数值,从而无需猜测问题对AI是否过难。通过读取此内部信号,便能自动为系统提供恰如其分的“垫脚石”以辅助其解决问题。
Researchers found that when language models face harder questions, their internal brain activity literally shrinks into fewer paths.
Language models actually compress their internal thinking when they get confused, and we can use that to help them.
Standard AI models usually spread their thinking across many artificial neurons when they confidently recognize familiar information.
The team discovered that if you confuse a model with tricky math or conflicting facts, this broad activation collapses into a highly concentrated signal in its final processing layer.
This shrinking happens because the system drops its robust distributed memory and forces the computation into a tiny specialized space to survive the unfamiliar challenge.