自定义语音与语音库
xAI 这波‘声音克隆+管理’的更新很实用,安全验证做得细,创作品类和品牌方应该会喜欢,对开发者来说是个加分项,但不是那种能改变格局的大招。
xAI于2026年4月30日推出自定义语音和语音库功能。用户可通过约1分钟录音快速克隆声音,并在Grok文本转语音及语音代理API中即时使用,整个过程仅需2分钟。语音库提供集中管理平台,内置语音已超80种,支持28种语言。为确保安全,系统采用两阶段验证,包括实时转录匹配和说话人嵌入确认,以防止未经授权的克隆。这些功能适用于品牌代理、内容创作、无障碍辅助、多语言团队及游戏娱乐等多种场景,且使用自定义语音无需额外费用。
Custom Voices and Voice Library | xAI
Apr 30, 2026
Custom Voices and Voice Library
Your voice. Your brand. Clone a voice from a short recording and manage your entire voice catalog from the xAI console.
Use Cases Custom Voices Voice Safety Voice Library
Listen to this blog post
Today, we're introducing Custom Voices. Clone your voice from a few seconds of audio and use it instantly across Grok Text to Speech and Voice Agent APIs.
Tyler
SpaceX Broadcast Host
ORIGINAL
CLONED
Alongside Custom Voices, the new Voice Library gives your team a single place to browse, preview, and manage all your voices from the xAI console.
Use Cases
Custom Voices unlock a new class of applications.
Live Support
I need help with my recent order.
Of course! Let me pull up your order details.
Brand Voice Agents
Give your customer support agent a consistent, recognizable voice that matches your brand identity, not a generic preset.
Rec
In today's episode we dive deep into the future of AI and what it means for creators everywhere
Content Creators
Narrate videos, podcasts, and social posts in your own voice at scale, without re-recording every time.
Original
Preserved
Accessibility
Create personalized voices for individuals who have lost the ability to speak, preserving their vocal identity.
🇺🇸
🇪🇸
🇫🇷
🇩🇪
🇨🇳
🇯🇵
🇺🇸English
🇪🇸Spanish
🇫🇷French
🇩🇪German
🇨🇳Chinese
🇯🇵Japanese
Multilingual Teams
Deliver your CEO's keynote in every major language — naturally in English, Spanish, French, German, Chinese, Japanese, and more.
Narrator The ancient door creaked open...
Kira We need to move. Now.
Thane I have a bad feeling about this.
Gaming & Entertainment
Bring characters to life with unique voices without scheduling studio time for every line of dialogue.
Chapter 3
The Discovery
She opened the notebook and found the handwriting unmistakably her own though she had no memory of writing it
4:12 12:34
Podcasts & Audiobook Narration
Make your narrative engaging. Turn scripts into full audiobooks narrated in your own voice, chapter by chapter, without stepping into a studio.
Live Support
I need help with my recent order.
Of course! Let me pull up your order details.
Rec
In today's episode we dive deep into the future of AI and what it means for creators everywhere
Original
Preserved
🇺🇸
🇪🇸
🇫🇷
🇩🇪
🇨🇳
🇯🇵
🇺🇸English
🇪🇸Spanish
🇫🇷French
🇩🇪German
🇨🇳Chinese
🇯🇵Japanese
Narrator The ancient door creaked open...
Kira We need to move. Now.
Thane I have a bad feeling about this.
Chapter 3
The Discovery
She opened the notebook and found the handwriting unmistakably her own though she had no memory of writing it
4:14 12:34
Brand Voice Agents
Give your customer support agent a consistent, recognizable voice that matches your brand identity, not a generic preset.
Custom Voices
Clone your voice in under two minutes. Use it everywhere.
Record about a minute of natural speech in the xAI console. Our pipeline verifies you're the voice owner, processes your recording, and delivers a production-ready voice model, all in under two minutes. Your custom voice inherits every TTS capability: speech tags, multilingual output, and both REST and WebSocket streaming.
PASSPHRASE CHECK
RECORDING
My voice is my key
Step 1 Read a passphrase aloud to confirm your identity
Custom voices work everywhere our built-in voices do. Pass the voice_id to any TTS endpoint or use it with the Voice Agent API for real-time conversational agents.
Voice Safety
Every custom voice goes through a two-stage verification process before it can be created. First, the speaker reads a verification phrase that our STT engine transcribes and matches in real time, confirming intent and presence. Then we compute speaker embeddings from the verification clip and the full recording to confirm they belong to the same person.
You can't clone a voice from a pre-existing recording, and you can't clone someone else's voice.
PASSPHRASE CHECK
RECORDING
My voice is my key
Passphrase Check
Read a verification phrase aloud. Our STT engine transcribes and matches it in real time, verifying your consent and presence.
SPEAKER SIMILARITY
IDLE
PASSPHRASE
–
RECORDING
Speaker Similarity
Speaker embeddings from the passphrase and the full recording are compared to confirm they belong to the same person.
Voice Library
The Voice Library is a new section in the xAI console that organizes every voice available to your team, with your custom creations alongside our built-in voices. Browse, preview, and manage voices from a single page.
We've expanded our built-in voice catalog to over 80 voices across 28 languages. Listen to any voice across different scenarios before choosing one for your application.
There is no extra charge to use Text to Speech or Voice Agent APIs with custom voices.
🌐Multilingual🇸🇦Arabic🇩🇰Danish🇩🇪German🇺🇸English🇪🇸Spanish🇫🇮Finnish🇫🇷French🇮🇳Hindi🇮🇹Italian🇯🇵Japanese🇰🇷Korean🇳🇱Dutch🇵🇱Polish🇧🇷Portuguese🇷🇺Russian🇸🇪Swedish🇹🇭Thai🇹🇷Turkish🇻🇳Vietnamese🇨🇳Chinese
Voice LibraryCustom Voices DocsVoice Agent API
Copy dark SVG
Copy light SVG