# 机器人进展难追踪，独立基准测试缺失

- 来源：Ethan Mollick (@emollick)
- 发布时间：2026-05-09 21:27
- AIHOT 分数：46
- AIHOT 链接：https://aihot.virxact.com/items/cmoyerttf09b2sllhu263esg1
- 原文链接：https://x.com/emollick/status/2053104629282378061

## AI 摘要

AI基准测试虽有缺陷，但进展追踪相对容易；机器人学则缺乏明确的衡量标准，演示视频如赛跑或洗衣无法有效评估进展，需要建立类似AI的独立基准测试如ARC-AGI-BOT。引用推文指出，尽管对机器人技术充满期待，但使其在经济上大规模实用的关键飞跃时间表仍不确定，可能在1年、3年、5年或10年内实现。

## 正文

As much as the state of benchmarks in AI is flawed， it is so much easier to track AI progress than robotics. Not sure what you can make of all the videos of robots running races or doing laundry - are there any equivalents to independent AI benchmarks for robots？ ARC-AGI-BOT？

### 引用推文

> prinz：@Miles_Brundage I am actually extremely excited about robotics, but have not been able to figure out whether the major leap that makes robots useful economicall...