世界模型被视为AI继大语言模型后的关键新范式,过去18个月已获百亿美元投资,其核心承诺是通过规模化数据推动机器人基础模型发展。然而,该术语目前被广泛滥用,含义模糊。本文系统阐述了世界模型的五大特质,对比了不同技术路径,探讨了其在机器人及其他领域的应用与未来机遇。领域参与者包括谷歌Genie、特斯拉Optimus等巨头产品,以及众多专注世界模型或机器人基础模型的初创公司。它很可能成为未来十年的奠基性技术之一。
This is the single best read on World Models and one of the most important reads in AI.
$10B has flowed into "world models" in the last 18mos, from Yann LeCun to FeiFei Li. The promise is, like LLMs, world models will provide the data it takes to scale robotics foundation models, and solve robotics.
..but the word has been abused to mean one of many things.
This post unpacks: - What 5 traits makes a world model? - How do the different approaches stack up? - What is it used for within and beyond robotics? - Where is the opportunity? - Citations to research, news and blog posts
Companies / products in the space include: - BigCo products: Google Genie, Tesla Optimus, Nvidia DreamDojo, DreamZero, Microsoft Muse - Pure world model: AMI Labs, World Labs, Runway, Rhoda, Decart, Spaitial, Odyssey, Embo, Dream Labs, OneWorld - Robot foundation model cos: Skild, Physical Intelligence, Figure, Mind