StepFun@StepFun_ai

2026-05-24 05:45·40天前

AI 摘要

StepAudio 2.5 Realtime是一款实时语音模型，能够深度理解用户语音中的语气、语速、停顿乃至微表情等副语言特征。它支持通过API接入自定义人格，允许设定个性、背景故事和语言风格，并提供了上万种原生人格选项，可组合出数百万种特征。产品还内置了5个可直接体验的预设人格，并经过RLHF调优，确保在复杂的角色扮演压力测试中也能保持角色一致性。该模型支持中文和英文。

StepAudio 2.5 Realtime is live！

Real-time voice that picks up what you actually mean - tone， pace， pauses， sighs， even the half-laugh mid-sentence.

⚡ Top-tier paralinguistic perception - reads tone， pace， micro-emotions ⚡ Bring-your-own persona via API - personality， backstory， quirks， language style ⚡ 10，000+ native personas → millions of feature combinations ⚡ 5 preset personas to try out of the box ⚡ ZH/EN

RLHF-tuned to hold character even under roleplay stress tests.

Try it → https://www.stepfun.com/studio/audio?tab=voice-chat Model card： https://stepaudiollm.github.io/step-audio-2.5-realtime/