蚂蚁 inclusionAI 发布 Ling-2.6-flash-base 基础模型

2026-06-02 17:55·30天前

AI 摘要

Ling-2.6-flash-base 是蚂蚁 inclusionAI 发布的基础模型，采用闪速规模 MoE 与混合线性注意力架构（7:1 融合 Lightning Attention 与 MLA），总参数量约 104B、激活约 7.4B。模型从 Ling-2.0 检查点改造而来，经约 9.6T token 的迁移预训练、继续预训练和中段训练，上下文窗口从 4K 扩展至 256K。在知识、推理、数学、代码和长上下文基准上相比前代均有提升（如 MMLU 84.13，GSM8K 91.89）。该模型面向研究用途开放，支持继续预训练、微调和蒸馏，未经聊天对齐。

原文 · 未翻译

🤗 Hugging Face | 🤖 ModelScope | Tech Report

Ling-2.6-flash-base

Ling-2.6-flash-base is the base checkpoint behind the Ling-2.6-flash model. It is a flash-scale Mixture-of-Experts language model retrofitted from the Ling-2.0 base checkpoint with a hybrid linear attention design, continued pre-training, and long-context mid-training.

This release is intended for research, continued pre-training, distillation, and supervised or preference-based fine-tuning. It is not a chat-aligned assistant model. If you want an out-of-the-box instruction model, use the corresponding post-trained Ling-2.6-flash checkpoint instead.

Model Overview

Ling-2.6-flash-base is designed for efficient instant-response modeling with stronger long-context efficiency than the previous GQA-based Ling-2.0 generation. The core upgrade is a hybrid attention retrofit that combines Lightning Attention with MLA in a 7:1 ratio, together with a smooth migration pipeline from the original architecture.

Ling-2.6 base models are trained through approximately 9.6T tokens across migration pre-training, continued pre-training, and mid-training, with staged context extension from 4K to 256K. Ling-2.6-flash-base serves as the base checkpoint for the post-trained Ling-2.6-flash instant model.

Key Features

Hybrid linear attention architecture combining Lightning Attention and MLA in a 7:1 ratio

Flash-scale MoE backbone optimized for efficient serving and high token efficiency

Long-context training pipeline extended to 256K context during mid-training

Continued pre-training mixture covering agentic data, long-context data, knowledge-rich web data, math, code, and multilingual corpora

Strong base-model quality across knowledge, math, code, reasoning, and long-context understanding benchmarks

蚂蚁 inclusionAI：HuggingFace 新模型

54导出 Markdown

蚂蚁 inclusionAI 发布 Ling-2.6-flash-base 基础模型

2026-06-02 17:55·30天前

阅读原文· huggingface.co

AI 摘要

原文 · 保持原样，未翻译

🤗 Hugging Face | 🤖 ModelScope | Tech Report

Ling-2.6-flash-base