Adaline 2.0 推出 AI 智能体自我改进层,将生产流量和用户反馈痕迹自动转化为行为聚类,进而生成评估(Evals)、合成边缘场景数据,并基于此产出新的智能体候选版本。开发者只需审核胜出版本即可上线。该工具无需人工逐条检查异常对话,可自动发现人类难以想到的评估用例。
Adaline just launched a self-improvement layer for AI agents that turns messy production traces into fresh evals, synthetic edge cases, and better agent candidates for humans to approve.
I expected it to be a regular trace viewer, but it is reading my production traffic and building evals I would never have considered.
It reads production traffic and user feedback, then clusters the mess into recognizable agent behaviours rather than asking a human to manually inspect every strange conversation.