StepAudio 2.5 Realtime是一款实时语音模型,能够深度理解用户语音中的语气、语速、停顿乃至微表情等副语言特征。它支持通过API接入自定义人格,允许设定个性、背景故事和语言风格,并提供了上万种原生人格选项,可组合出数百万种特征。产品还内置了5个可直接体验的预设人格,并经过RLHF调优,确保在复杂的角色扮演压力测试中也能保持角色一致性。该模型支持中文和英文。
StepAudio 2.5 Realtime is live!
Real-time voice that picks up what you actually mean - tone, pace, pauses, sighs, even the half-laugh mid-sentence.
⚡ Top-tier paralinguistic perception - reads tone, pace, micro-emotions ⚡ Bring-your-own persona via API - personality, backstory, quirks, language style ⚡ 10,000+ native personas → millions of feature combinations ⚡ 5 preset personas to try out of the box ⚡ ZH/EN
RLHF-tuned to hold character even under roleplay stress tests.
Try it → https://www.stepfun.com/studio/audio?tab=voice-chat Model card: https://stepaudiollm.github.io/step-audio-2.5-realtime/