OpenAI的通用推理模型近期通过连接代数数论与平面几何,成功解决了保持数十年的平面单位距离猜想(Erdős猜想)。关键突破在于模型并非专用定理证明引擎,其成功依赖于延长和深化测试时计算过程,而非仅增加训练数据。这一进展表明前沿大模型已蕴含潜在的数学研究能力,当前瓶颈部分源于模型被允许“思考”的时间和方式。未来方向不是AI取代人类判断,而是在人类判断开始前拓宽思维的疆域,从而推动科学发现与创新。
A general-purpose LLM can produce frontier research when given enough test-time compute.
Here, just a general-purpose OpenAI model has connected algebraic number theory to plane geometry and used that bridge to beat a decades-old conjecture.
Shows how frontier models may already contain useful latent mathematical competence, and the bottleneck is partly how long and how well they are allowed to think.