# When a new dataset comes out， I get excited and check it out and then only realize that this is anot…

- 来源：Lilian Weng (@lilianweng)
- 发布时间：2025-05-13 04:51
- AIHOT 链接：https://aihot.virxact.com/items/cmnxjn7to00elsl9o1ayilsee
- 原文链接：https://x.com/lilianweng/status/1922031826731716997

## AI 摘要

当新数据集发布时，我会很兴奋地去查看，然后才意识到这又是一个元混合数据集，结合了其他现有数据集的集合。我的大脑立刻反应："我去……数据污染！" 请不要有元元混合数据集了 :lolsob:

## 正文

When a new dataset comes out， I get excited and check it out and then only realize that this is another meta-mixed dataset combining a collections of other existing datasets. My brain immediately acts like "oh fork … contamination！" No meta-meta-mixed dataset plzzzz ：lolsob：
