从技能到人才：将异构智能体组织为现实世界的公司

2026-04-27 12:00·55天前·Zhengxu Yu, Yu Fu, Zhiyuan He, Yuxuan Huang, Lee Ka Yiu, Meng Fang, Weilin Luo, Jun Wang

精选理由

这篇论文把多智能体系统从固定的管道变成了一个自组织、可进化的『AI公司』，对做Agent框架的人来说，这个『组织层』的抽象和动态招聘机制是个值得借鉴的新思路。

AI 摘要

研究提出 OneManCompany (OMC) 框架，将多智能体系统提升至组织层面。该框架将技能、工具与配置封装为可移植的“人才”身份，通过类型化接口协调异构后端，并借助社区驱动的“人才市场”实现按需招募，动态弥补能力缺口。组织决策通过“探索-执行-评审”树搜索实现，将规划、执行与评估统一为分层循环，并提供终止与无死锁的形式化保证。在 PRDBench 上的实验显示，OMC 达到 84.67% 的成功率，较现有最佳技术提升 15.48 个百分点，跨领域案例验证了其通用性与自组织能力。

原文 · 未翻译

Computer Science > Artificial Intelligence

[Submitted on 24 Apr 2026]

Title:From Skills to Talent: Organising Heterogeneous Agents as a Real-World Company

Authors:Zhengxu Yu, Yu Fu, Zhiyuan He, Yuxuan Huang, Lee Ka Yiu, Meng Fang, Weilin Luo, Jun Wang

View PDF HTML (experimental)

Abstract:Individual agent capabilities have advanced rapidly through modular skills and tool integrations, yet multi-agent systems remain constrained by fixed team structures, tightly coupled coordination logic, and session-bound learning. We argue that this reflects a deeper absence: a principled organisational layer that governs how a workforce of agents is assembled, governed, and improved over time, decoupled from what individual agents know. To fill this gap, we introduce \emph{OneManCompany (OMC)}, a framework that elevates multi-agent systems to the organisational level. OMC encapsulates skills, tools, and runtime configurations into portable agent identities called \emph{Talents}, orchestrated through typed organisational interfaces that abstract over heterogeneous backends. A community-driven \emph{Talent Market} enables on-demand recruitment, allowing the organisation to close capability gaps and reconfigure itself dynamically during execution. Organisational decision-making is operationalised through an \emph{Explore-Execute-Review} ($\text{E}^2$R) tree search, which unifies planning, execution, and evaluation in a single hierarchical loop: tasks are decomposed top-down into accountable units and execution outcomes are aggregated bottom-up to drive systematic review and refinement. This loop provides formal guarantees on termination and deadlock freedom while mirroring the feedback mechanisms of human enterprises. Together, these contributions transform multi-agent systems from static, pre-configured pipelines into self-organising and self-improving AI organisations capable of adapting to open-ended tasks across diverse domains. Empirical evaluation on PRDBench shows that OMC achieves an $84.67\%$ success rate, surpassing the state of the art by $15.48$ percentage points, with cross-domain case studies further demonstrating its generality.

Comments:
Subjects:	Artificial Intelligence (cs.AI)
Cite as:	arXiv:2604.22446 [cs.AI]
	(or arXiv:2604.22446v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2604.22446 arXiv-issued DOI via DataCite

Submission history

From: Zhengxu Yu [view email]
[v1] Fri, 24 Apr 2026 11:02:44 UTC (8,217 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.AI

< prev | next >

new | recent | 2026-04

Change to browse by:

References & Citations

Bookmark

Bibliographic and Citation Tools

Bibliographic Explorer (What is the Explorer?)

Connected Papers (What is Connected Papers?)

Litmaps (What is Litmaps?)

scite Smart Citations (What are Smart Citations?)

Code, Data and Media Associated with this Article

alphaXiv (What is alphaXiv?)

CatalyzeX Code Finder for Papers (What is CatalyzeX?)

DagsHub (What is DagsHub?)

Gotit.pub (What is GotitPub?)

Hugging Face (What is Huggingface?)

ScienceCast (What is ScienceCast?)

Demos

Replicate (What is Replicate?)

Hugging Face Spaces (What is Spaces?)

TXYZ.AI (What is TXYZ.AI?)

Recommenders and Search Tools

Influence Flower (What are Influence Flowers?)

CORE Recommender (What is CORE?)

Author
Venue
Institution
Topic

arXivLabs: experimental projects with community collaborators

arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website.

Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them.

Have an idea for a project that will add value for arXiv's community? Learn more about arXivLabs.

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

智能体论文/研究

arXiv：cs.AI（全量分类）

精选75