# Inworld、ElevenLabs 与 MiniMax 继续领跑 TTS 排行榜

- 来源：Artificial Analysis (@ArtificialAnlys)
- 发布时间：2026-03-25 23:32
- AIHOT 链接：https://aihot.virxact.com/items/cmnw1ypby00ohslc3ehjlymvm
- 原文链接：https://x.com/ArtificialAnlys/status/2036828667981234318

## AI 摘要

Inworld、ElevenLabs 与 MiniMax 继续领跑 TTS 排行榜，今年发布的模型包揽前五中的四席。当前领先模型在简单文本上逼真度显著提升，用户偏好差异主要体现在声音风格选择上。评估方法已加强机器人投票过滤，并新增基于95%置信区间的排名范围。具体指标方面，Inworld TTS 1.5 Max 以1,238 Elo分居首，Kokoro 82M v1.0以$0.65/百万字符成为价格最低选项，WaveNet则以每秒419字符领先批处理速度。

## 正文

Inworld， ElevenLabs， and MiniMax continue to lead our Text to Speech leaderboard for most preferred models

Recent checkpoints from each of the labs continue to push the frontier of TTS quality， with 4 out of the top 5 models being released this year. Leading TTS models are increasingly realistic， particularly on relatively straightforward text， with preference differences increasingly coming down to affinity for different voices.

Latest results also reflect stronger bot vote filtering， confirmed via triangulation against third-party evaluators. We've also added rank ranges based on each model's 95% confidence interval， showing where a model could land based on its Elo score range.

Key results：
➤ Most preferred： Current top 5 per our TTS leaderboard： 1. Inworld TTS 1.5 Max （Elo of 1，238）； 2. ElevenLabs Eleven v3 （1，197）； 3. Inworld TTS 1 Max （1，183）； 4. Inworld TTS 1.5 Mini （1，182）； 5. MiniMax Speech 2.8 HD （1，175）
➤ Price： Kokoro 82M v1.0 （Replicate） leads at $0.65 per 1M characters， followed by Inworld TTS 1 and 1.5 Mini at $5， and AsyncFlow V2 at $8.33
➤ Speed： WaveNet leads for batch generation at 419 characters processed per second， followed by Kokoro 82M v1.0 （Replicate） at 235， and Inworld TTS 1.5 Mini at 214

See below for further detail ⬇️
