DiscoPER:基于迭代元反思的自主科学发现框架
阅读原文· arxiv.orgDiscoPER 是一个大语言模型驱动的自主科学发现框架。它无需预设研究目标,动态生成代码探索数据集,且每个发现必须通过统计检验。框架引入二阶元反思机制,周期性分析自身已有发现,识别结构模式、混淆和认知空白,主动将假设探索重定向到未知区域。结合工具使用,可处理多模态来源(如图像)的信息。在 iNatDisco 生态基准上,DiscoPER 恢复 8/9 已知模式,假设支持率 72.7%,优于经典因果发现与 LLM 引导基线。消融实验证实随数据规模扩展及二阶元反思的收益。
Autonomous scientific discovery systems offer the potential to accelerate research by automating the process of hypothesis generation and validation. However, current systems operate within constrained search spaces or require predefined research questions, limiting their capacity for true open-ended inquiry. Furthermore, while they generate hypotheses iteratively, they largely lack the ability to explicitly synthesize their own accumulated findings to uncover complex, interconnected phenomena. We introduce DiscoPER, an autonomous large language model-powered framework that conducts open-ended research by dynamically generating and executing code to explore datasets without pre-specified research objectives. To ensure rigorous scientific validity, every proposed discovery must pass statistical testing. To overcome the limitations of isolated search, our framework introduces a second-order reasoning mechanism that periodically analyzes its own accumulated discoveries. By treating prior discoveries as empirical data, DiscoPER identifies structural patterns, confounds, and epistemic gaps, actively redirecting hypothesis exploration toward uncharted regions of the search space. The search space is further expanded by incorporating tool use, enabling the system to explore hypotheses beyond structured metadata by seamlessly processing and extracting useful information from multimodal sources like images. Evaluated on iNatDisco, a new multimodal ecological knowledge benchmark with pattern-level ground truth obtained from peer-reviewed literature, DiscoPER recovers 8 of 9 known patterns with a 72.7% hypothesis support rate, outperforming both classical causal discovery and LLM-guided baselines. Ablations show that DiscoPER scales with more data, and confirms the benefits of second-order meta-reflection.