耶鲁大学和芝加哥大学最新论文基于11,683篇真实论文构造对照测试,让LLM基于与人类相同的先前工作提出研究动机和方法。结果发现,人类研究者想法模式多样(如解释机制、测试失败、测量证据、构建系统、提升效率),仅12.1%属于“连接已有工作”类;而LLM生成的同类想法占比高达47.1%至64.2%,频率约为人类的4至5倍。即使增加推理步骤(额外CoT),这一连接偏好反而更强,说明LLM倾向于优化已有配方,而非探索多样化的研究路径。
This is the prompt Yale and Univ of Chicago researchers used when asking LLMs for new research ideas.
Feed LLMs prior work, ask for ideas, then measure how repetitive the ideas get.
The surprising finding is that LLMs often treat research ideation as connecting what already exists, while humans use a wider set of problem-finding moves.
LLM-generated ideas reveal a bias toward safe bridge-and-combine proposals.