# RobotValues：家庭机器人价值观冲突评估基准

- 来源：HuggingFace Daily Papers（社区热门论文）
- 发布时间：2026-06-02 08:00
- AIHOT 分数：57
- AIHOT 链接：https://aihot.virxact.com/items/cmq0fzz6c05vesltr4mzn3wx4
- 原文链接：https://arxiv.org/abs/2606.03312

## AI 摘要

RobotValues 是一个包含1万个价值观冲突场景的基准，用于评估家庭机器人规划器在任务成功与自主性、效率、社会恰当性等人类价值观冲突时的行为选择。该基准通过LLM辅助场景生成、利益相关者导向的价值提取、图像生成和自动质量控制构建。使用RobotValues评估视觉语言模型（VLM）发现，模型默认偏向安全和包容，较少优先隐私保护。当指令要求优先与自身偏好冲突的特定价值观时，模型在80%的情况下无法覆盖默认动作，选择了错误行为。这表明家庭机器人评估应衡量价值观冲突中选择合理动作的能力。

## 正文

While household robots are often evaluated based on task completion, everyday domestic environments involve value-conflicting situations in which robots are expected to choose actions that prioritize other values than task success, such as human autonomy, efficiency, or social appropriateness. Yet, there are no benchmarks for evaluating robots' value preferences in such scenarios. We introduce RobotValues, a benchmark to evaluate household robot planners in 10K value-conflict scenarios. Each instance consists of a realistic household image with multiple plausible robot actions that prioritize different human values. We construct RobotValues through LLM-assisted scenario generation, stakeholder-grounded value extraction, image generation and automatic quality control. Using RobotValues we evaluate VLMs used in robotics and find that models exhibit default value preferences, including safety and accommodation, while underselecting privacy-prioritizing actions. When the models are instructed to prioritize specific values that conflict with their own preferences, they often fail to override their default actions, choosing incorrect actions for 80% of the time. These findings suggest that household robot evaluation should measure not only task completion or safety compliance, but also whether robots can choose among plausible actions when human values conflict.
