该研究提出"artifacts"概念,指环境中记录历史信息的可观察痕迹(如路径),并证明其可减少智能体需存储的历史信息。Artifact Reduction Theorem指出,当当前观察能保证过去事件发生时,无需同时存储两者即可预测未来。在五个导航场景中,能看到空间痕迹的智能体只需更少内部容量即可学习强策略(适用于linear Q-learning和DQN),且随机、次优或渐褪的路径同样有效。这表明记忆可外化于环境并通过感知读取,为智能体设计提供了除增加模型规模外的新思路。
This paper formalizes a simple idea: sometimes the world remembers for an agent, so the agent can remember less.
The problem is that AI research usually treats memory as something stored inside the agent, even when the environment may quietly keep useful records of earlier events.
The key idea is an artifact, which is a current observation that reveals something about the past, like a visible path that tells the agent where it has already been, and the paper proves that such artifacts can reduce how much history must be represented.
Once that exists, the Artifact Reduction Theorem says part of history has become redundant. If seeing X now guarantees Y happened earlier, you do not need to store both to predict what comes next.