通义千问推出Qwen-Robot Suite,包含三个基础模型:Qwen-RobotNav统一5种导航任务(指令跟随、点目标、物体目标、目标追踪、自动驾驶),具备可控观测协议和智能体工具接口;Qwen-RobotManip实现异构机器人统一状态-动作空间,基于38,100+小时开源语料预训练;Qwen-RobotWorld是单一世界模型,支持20+具身形态,通过自然语言动作接口预测物理世界未来(涵盖操作、驾驶、导航)。三个模型可独立使用或组合,构成通用智能体的底层工具包。
📣 Introducing the Qwen-Robot Suite - Qwen-RobotNav, Qwen-RobotManip, Qwen-RobotWorld, three foundation models, a full stack for embodied intelligence.
🧭 Qwen-RobotNav - the gateway to mobility. • Unifies 5 navigation tasks in one model: instruction following, point-goal, object-goal, target tracking, autonomous driving • Controllable observation protocol • Tool interface for agentic systems
🤖 Qwen-RobotManip - the foundation of interaction. • Unified state-action space across heterogeneous robots • Camera-frame delta poses for coherent cross-embodiment training • Pretrained on a 38,100+ hour open-source corpus
🌍 Qwen-RobotWorld - infinite worlds for physical agents. • Single world model, 20+ embodiments • Natural-language action interface • Predicts physically grounded futures across manipulation, driving, and navigation
Each model is independently useful, and could be composed as physical-world tools.Together, they form the low-level toolkit for general-purpose agentic systems that don't just see the world, but act in it.
📷 Blog: https://qwen.ai/blog?id=qwen-robotsuite 📖 Report: Qwen-RobotNav: https://qianwen-res.oss-accelerate.aliyuncs.com/qwenrobot/papers/Qwen_RobotNav.pdf Qwen-RobotManip: https://qianwen-res.oss-accelerate.aliyuncs.com/qwenrobot/papers/Qwen_RobotManip.pdf Qwen-RobotWorld: https://qianwen-res.oss-accelerate.aliyuncs.com/qwenrobot/papers/Qwen_RobotWorld.pdf