Equilibrium Reasoners：学习吸引子实现可扩展推理

2026-05-20 08:00·44天前

AI 摘要

Equilibrium Reasoners (EqR) 提出一种无需外部验证器的测试时计算扩展框架。其核心假设是，可泛化的推理能力源于学习任务条件下的吸引子，即稳定不动点对应有效解的潜在动力系统。EqR通过深度（更多迭代）与广度（聚合多条随机轨迹）两个维度扩展内部动力。实验表明，测试时扩展的收益与向解对齐吸引子的收敛强度紧密相关。该框架使模型能根据任务难度自适应分配计算：简单案例在1至5次迭代内收敛，复杂案例则受益于大规模扩展。通过展开相当于40,000层，可扩展潜在推理在Sudoku-Extreme任务上将准确率从2.6%提升至超过99%。

原文 · 未翻译

Scaling test-time compute by iteratively updating a latent state has emerged as a powerful paradigm for reasoning. Yet the internal mechanisms that enable these iterative models to generalize beyond memorized patterns remain unclear. We hypothesize that generalizable reasoning arises from learning task-conditioned attractors: latent dynamical systems whose stable fixed points correspond to valid solutions. We formalize this process through Equilibrium Reasoners (EqR), which enable test-time scaling without external verifiers or task-specific priors. EqR scales internal dynamics along two axes: depth, by running more iterations, and breadth, by aggregating stochastic trajectories from multiple initializations. Empirically, gains from test-time scaling are tightly coupled with stronger convergence toward solution-aligned attractors. This attractor perspective allows neural networks to adaptively allocate test-time compute based on task difficulty. While simple cases converge within 1 to 5 iteration steps, harder cases benefit from massive test-time scaling. By unrolling up to the equivalent of 40,000 layers, scalable latent reasoning boosts accuracy from 2.6% for feedforward models to over 99% on Sudoku-Extreme. These results suggest that learned attractor landscapes provide a useful mechanistic lens for understanding scalable reasoning in iterative latent models.

HuggingFace Daily Papers（社区热门论文）

54导出 Markdown

Equilibrium Reasoners：学习吸引子实现可扩展推理

2026-05-20 08:00·44天前

阅读原文· arxiv.org

AI 摘要

原文 · 保持原样，未翻译