推文观点认为,将自动驾驶视为专注于避障的低维行动空间二维机器人,能更快产生实际影响。Waymo世界模型的核心不止于视频生成,更是对连续、高维、多模态嘈杂信号的建模。该模型基于Google DeepMind的Genie 3构建,能创建大规模、超逼真的驾驶模拟。通过模拟如龙卷风、飞机降落高速公路等极端罕见场景,Waymo Driver可在真实遭遇前进行针对性训练,从而显著提升系统应对复杂情况的能力,加速自动驾驶技术的安全部署与成熟。
self-driving <as a 2D robot with a low-dim action space that focused mostly on avoidance rather than interaction> will reach real-world impact faster than anything else. the really cool part is that the world model isn't just about videos; it's about modeling continuous, high-dimension, and noisy signals of all kinds. that's what "multimodal" actually means. congrats to @maxjiang93, xander, bo, and the whole waymo team 👏