Mistral AI 发布首个区域语言模型 Mistral Saba
阅读原文· mistral.aiMistral AI 推出首个区域语言模型 Mistral Saba。这是一个参数量为 24B 的模型,基于来自中东和南亚的精选数据集进行训练。模型在提供比自身参数量大五倍的通用模型更准确的相关响应的同时,具备更快的速度和更低的成本。Mistral Saba 支持阿拉伯语及多种印度语言,在南印度语系如泰米尔语上表现尤为突出。它以 API 形式提供服务,同时也支持在客户的安全环境中进行本地部署。该模型轻量化,可在单 GPU 系统上运行,响应速度超过 150 tokens/秒。
Making AI ubiquitous requires addressing every culture and language. As AI proliferates globally, many of our customers worldwide have expressed a strong desire for models that are not just fluent but native to regional parlance. While larger, general-purpose models are often proficient in several languages, they lack linguistic nuances, cultural background, and in-depth regional knowledge required to serve use cases with strong regional context.
In such scenarios, custom-trained models tailored to regional languages can grasp the unique intricacies and insights for delivering precision and authenticity. To that end, we are proud to introduce Mistral Saba, the first of our specialized regional language models.
Mistral Saba is a 24B parameter model trained on meticulously curated datasets from across the Middle East and South Asia. The model provides more accurate and relevant responses than models that are over 5 times its size, while being significantly faster and lower cost. The model can also serve as a strong base to train highly specific regional adaptations. Mistral Saba is available as an API , but importantly, it is also available to deploy locally within the security premises of customers. Like the recently released Mistral Small 3, the model is lightweight and can be deployed on single-GPU systems, responding at speeds of over 150 tokens per second.