MiniMax发布开源模型M3,它是首个将前沿编码能力、1M token上下文窗口与原生多模态集成于单一系统的开源模型。M3在SWE-Bench Pro上得分为59.0%,略高于GPT-5.5(58.6%)与Gemini 3.1 Pro(54.2%);在BrowseComp自主浏览任务中以83.5%领先Opus 4.7。此外,模型在Terminal Bench 2.1(66.0%)、MCP Atlas(74.2%)等基准上表现优异。其每token成本约为GPT-5.5的十二分之一,模型权重及技术报告预计在10天后发布。
MiniMax just dropped M3! It hits 59% on SWE-Bench Pro, edging out GPT-5.5 (58.6%) and beating Gemini 3.1 Pro (54.2%).
Trails Opus 4.7 on coding, but leads it on autonomous browsing at 83.5% on BrowseComp. First open model to pack frontier coding, a 1M-token context, and native multimodality into one system.
I mean, let that sink in: Roughly 12x cheaper per token than GPT-5.5, with weights and a full tech report promised in about 10 days.