Chubby♨️@kimmonismus

2026-06-01 23:42·31天前

AI 摘要

MiniMax发布开源模型M3，它是首个将前沿编码能力、1M token上下文窗口与原生多模态集成于单一系统的开源模型。M3在SWE-Bench Pro上得分为59.0%，略高于GPT-5.5（58.6%）与Gemini 3.1 Pro（54.2%）；在BrowseComp自主浏览任务中以83.5%领先Opus 4.7。此外，模型在Terminal Bench 2.1（66.0%）、MCP Atlas（74.2%）等基准上表现优异。其每token成本约为GPT-5.5的十二分之一，模型权重及技术报告预计在10天后发布。

MiniMax just dropped M3！ It hits 59% on SWE-Bench Pro， edging out GPT-5.5 （58.6%） and beating Gemini 3.1 Pro （54.2%）.

Trails Opus 4.7 on coding， but leads it on autonomous browsing at 83.5% on BrowseComp. First open model to pack frontier coding， a 1M-token context， and native multimodality into one system.

I mean， let that sink in： Roughly 12x cheaper per token than GPT-5.5， with weights and a full tech report promised in about 10 days.

MiniMax (official)Introducing MiniMax M3: The First Open-Weights Model to Combine Three Frontier Capabilities - Coding & Agentic Frontier: 59.0% SWE-Bench Pro, 66.0% Terminal Ben...

智能体多模态开源生态模型发布

在 X 查看原推导出 Markdown

Chubby♨️@kimmonismus · X

82导出 Markdown

2026-06-01 23:42·31天前

在 X 看原推· x.com

AI 摘要

MiniMax just dropped M3！ It hits 59% on SWE-Bench Pro， edging out GPT-5.5 （58.6%） and beating Gemini 3.1 Pro （54.2%）.

Trails Opus 4.7 on coding， but leads it on autonomous browsing at 83.5% on BrowseComp. First open model to pack frontier coding， a 1M-token context， and native multimodality into one system.

I mean， let that sink in： Roughly 12x cheaper per token than GPT-5.5， with weights and a full tech report promised in about 10 days.