面壁智能(OpenBMB)的扩散式 TTS 模型 VoxCPM-0.5B 已通过 Apple Core AI 完全部署至 iPhone 端侧,无需联网。该模型整合了 MiniCPM4 语言模型、LocDiT flow-matching 和 AudioVAE,每一层均运行于 Neural Engine 和 GPU 上。模型权重和部署代码已开源至 HuggingFace 与 GitHub。
Big thanks to @JackdeS11 for bringing VoxCPM-0.5B fully on-device to iPhone! 🎉❤️ The entire stack (MiniCPM4 + LocDiT flow-matching + AudioVAE) runs on Neural Engine and GPU, with no network required. Great work! 👍👍