# 静态基准测记忆而非智能

- 来源：François Chollet (@fchollet)
- 发布时间：2026-06-27 01:08
- AIHOT 分数：47
- AIHOT 链接：https://aihot.virxact.com/items/cmqv7e5zj0aqasl80dn6yrwiv
- 原文链接：https://x.com/fchollet/status/2070554884999692698

## AI 摘要

如果你的基准测试依赖于静态数据集或从训练时已知的静态分布中采样，那么它本质上衡量的是记忆/检索。如果你需要的是检索基准测试，那倒也无妨，但不要将其与智能混淆。

## 正文

If your benchmark relies on a static dataset or sampling from a static distribution densely known at training time， then it is fundamentally measuring memorization/retrieval. Which might be fine if you're looking for a retrieval benchmark！ But don't confuse it with intelligence.