AI 摘要
OSWorld2.0 对计算机使用智能体在长程真实世界任务上进行评测
OSWorld2.0
Benchmarking Computer Use Agents on Long-Horizon Real-World Tasks
OSWorld2.0 对计算机使用智能体在长程真实世界任务上进行评测
OSWorld2.0
Benchmarking Computer Use Agents on Long-Horizon Real-World Tasks
OSWorld2.0 对计算机使用智能体在长程真实世界任务上进行评测
OSWorld2.0
Benchmarking Computer Use Agents on Long-Horizon Real-World Tasks