超越对齐:多元文化智能体系统的价值多样性
阅读原文· arxiv.org现有评估聚焦于单个智能体的价值对齐,无法衡量系统整体文化多元性。本研究提出价值多样性作为系统级评估轴,基于World Values Survey测量19种文化下18种骨干模型的响应差异。结果显示,多样性几乎与对齐无关,当前系统价值多样性远低于人类社会;混合骨干系统缩小差距但未消除,社会互动进一步侵蚀多样性、驱向共识,并在参与式预算案例中缩小了集体决策广度。代码和数据已公开。
Multicultural multi-agent systems are increasingly deployed in globally diverse settings, where different agents are grounded in different cultural backgrounds. Existing cultural evaluation focuses on value alignment: how closely a single agent matches a target culture. Yet alignment is a per-agent property and cannot reveal whether a system, taken as a whole, preserves the cultural plurality it is meant to represent. We propose value diversity as a system-level evaluation axis for multicultural agent systems, defined through the dissimilarity between culturally conditioned agents' responses on a shared value survey. Using the World Values Survey, we evaluate 19 cultures and 18 backbone models across a wide range of system configurations. We find that diversity is largely uncorrelated with alignment, indicating that the two capture complementary system properties, and that current multicultural agent systems fall substantially below human societies in value diversity. Mixed-backbone systems narrow this gap but do not close it, and the gap persists across culture compositions and agent scales. Social interaction further erodes diversity by driving agents toward consensus, and a participatory budgeting case study shows that this homogenization narrows the breadth of collective decision-making. Together, our results establish value diversity as a distinct evaluation axis for multicultural multi-agent systems and reveal a persistent homogenization tendency in current LLM-based societies. Our code and data are publicly available at https://github.com/iNLP-Lab/MultiAgent-Diversity.