Rohan Paul@rohanpaul_ai

2026-06-14 22:30·18天前

AI 摘要

一项新研究发现，当前提升AI随时间表现的方法存在盲点：LLM智能体实际上并不理解或应用抽象规则总结，而是仅依赖直接复制原始逐步骤历史日志。实验显示，当研究者将浓缩的规则总结替换为随机垃圾文本时，智能体表现无下降；但破坏逐步执行历史则导致明显失败。这表明智能体只是在机械模仿过往步骤，而非真正从教训中学习。论文质疑需重新设计AI记忆机制，因为当前系统仅是模仿而非理解。

Researchers found our current approach to making AI smarter over time has a giant blind spot.

AI is not actually understanding or applying high-level abstract lessons at all.

Developers spend massive amounts of time building systems that condense past AI mistakes into neat little rules for the future.

This paper proves that the AI essentially throws those rules in the trash and only looks at raw historical logs.

Modern LLM systems try to get better over time by storing past tasks as either raw step-by-step histories or condensed summary rules. The study tested if these agents actually use their stored memories by secretly swapping the correct tips with random garbage text.

When the step-by-step histories were messed up， the AI failed hard， proving it heavily relies on copying exact past actions.

But when researchers completely corrupted the condensed summary rules， the AI kept acting normally and showed zero performance drop.

If an AI cannot apply an abstract lesson to a new situation， it is not truly reasoning or learning.

This raises the question if the entire AI industry need to rethink how memory works because right now these agents are just mimicking instead of understanding.

----

arxiv. org/abs/2601.22436

"LLM Agents Are Not Always Faithful Self-Evolvers"

智能体推理论文/研究

在 X 查看原推导出 Markdown

Rohan Paul@rohanpaul_ai · X

59导出 Markdown