OpenAI 发布 LifeSciBench 生命科学基准

elvis@omarsar0

2026-06-18 23:23·2天前

AI 摘要

OpenAI 推出 LifeSciBench，用于衡量 AI 支持真实世界生命科学研究的能力。该基准与 173 位生物技术与制药科学家共同开发，包含 750 个专家编写任务，覆盖七种生物研究流程。DAIR.AI 的 Elvis Saravia 推荐阅读，并指出通用模型在处理复杂结构时仍然失败，而面向科学研究的专用模型表现显著更优。

Recommended reading.

Great insights， especially in areas where general-purpose models continue to fail， like dealing with complex structures. It also highlights that for scientific research， specialized models are winning big time.

OpenAIIntroducing LifeSciBench, a benchmark for measuring and improving how well AI supports real-world life science research. Developed with 173 scientists from biot...

OpenAI评测/基准

在 X 查看原推

elvis@omarsar0 · X

2026-06-18 23:23·2天前

AI 摘要