# Microsoft发布MAI-Transcribe-1.5语音转录模型

- 来源：Artificial Analysis (@ArtificialAnlys)
- 发布时间：2026-06-03 02:31
- AIHOT 分数：64
- AIHOT 链接：https://aihot.virxact.com/items/cmpwzozu9029nsl79bpjkmmp2
- 原文链接：https://x.com/ArtificialAnlys/status/2061878491860324402

## AI 摘要

微软AI发布了MAI-Transcribe-1.5语音转录模型。该模型在AA-WER排行榜上位列第三，词错误率（WER）为2.4%，仅次于阿里巴巴的Fun-Realtime-ASR-preview（1.7%）和ElevenLabs Scribe v2（2.2%）。其主要特点是速度极快，处理速度约为276倍实时，是准确率前十模型中第二快模型速度的两倍以上，因此在准确率-速度帕累托前沿上处于领先地位。模型还支持关键词偏差识别，并涵盖包括英语、法语、阿拉伯语、日语和中文在内的43种语言。

## 正文

Microsoft has released MAI-Transcribe-1.5： an exceptionally fast speech transcription model at a speed factor of ~276x， while still achieving 2.4% on AA-WER （#3）， leading the accuracy-speed Pareto frontier

MAI-Transcribe-1.5 is Microsoft AI （MAI）'s latest speech transcription model， coming in at 3rd overall on the on the Artificial Analysis Word Error Rate （AA-WER） leaderboard， behind Alibaba's Fun-Realtime-ASR-preview （1.7% WER）， and ElevenLabs Scribe v2 （2.2% WER）. The model stands out as the fastest STT model in the top 10 for accuracy， processing audio at ~276x real-time - this is more than double the speed of the second fastest model in the top 10 for accuracy.

The new model supports keyword biasing （improved recognition of rarer vocabulary such as names and medical terminology）， in addition to support for 43 languages including English， French， Arabic， Japanese， and Chinese.

See more details below ⬇️
