Rohan Paul@rohanpaul_ai

2026-05-19 05:16·45天前

AI 摘要

PolyAI研究证实，专为客服设计的较小模型Raven 3.5，在性能上显著超越了规模大其100倍的通用前沿模型。该模型在所有四项客服基准测试中击败GPT-5和Claude Sonnet 4.6，并将响应延迟控制在300毫秒内。这项发布同时包括ADK代码开发工具包和PolyPhone网页语音生成工具，助力企业快速构建生产级语音代理。此举旨在将企业语音AI从大型项目转变为可快速部署的基础设施，从而有效解决客服等待时间长、成本高等问题，提升服务效率与客户体验。

Can a smaller model purpose-built for one domain beat a frontier general model that's 100× its size？

A recent paper showed yes - and not by a small margin.

Raven 3.5 from PolyAI shows that a smaller specialist model can beat bigger general models on customer service calls.

It beats GPT-5 and Claude Sonnet 4.6 on all 4 customer service benchmarks while staying under 300ms latency.

This is one of the live debates in ML. Every researcher is asking this question. The paper is the empirical answer.

PolyAI's research team published "Raven 3.5： The post-training recipe that beats GPT-5 for customer service"

-- Voice agents are moving from call-center software into everyday product infrastructure.

PolyAI's launch targets the gap between website traffic and real customer conversations.

Made every website capable of answering out loud.

PolyAI helps enterprises fix slow phone support， long wait times， costly contact centers， robotic IVRs， and missed revenue from abandoned calls. Its voice agents handle customer conversations 24/7 across voice， chat， SMS， and social channels in 45+ languages. The result is faster support， lower operating cost， more consistent answers， and better customer experience at enterprise scale.

📞 PolyAI is launching 2 new voice AI products： ADK， a code-first Agent Development Kit for building production voice agents from your own IDE， and PolyPhone， which turns any website into a live voice AI agent in about 10 minutes.

ADK connects directly into Agent Studio， so developers can build， manage， and deploy agents from the terminal.

PolyPhone reads a website， understands things like FAQs and product details， then creates a voice agent that can be embedded on any webpage without needing telephony setup.

The bigger point： enterprise voice AI is moving from "contact center project" to "something teams can build and ship much faster."

Rohan Paul@rohanpaul_ai · X

64导出 Markdown