StepFun@StepFun_ai

2026-05-29 08:00·35天前

AI 摘要

阶跃星辰（Step）发布了开源大模型 Step 3.7 Flash，主打智能体（Agent）工作流的效率。该模型在 ClawEval-1.1（67.1分）和 SimpleVQA Search（79.2分）评测中排名第一。其架构为 198B 参数的 MoE，约 11B 为活跃参数，支持 256K 上下文。模型具备多模态理解能力，能处理图像、文档并生成代码或调用工具执行任务。在工具使用方面，它致力于高可靠性，τ²-bench 得分超过 98%。Step 3.7 Flash 兼容 Claude Code、MCP 协议等工具链，并支持在 Mac Studio M4 Max 等设备上本地运行。模型权重以 Apache 2.0 许可开源。

⚡️ Step 3.7 Flash is here： The new frontier is agent efficiency.

#1 ClawEval-1.1 （67.1）， #1 SimpleVQA Search （79.2）， #2 SWE-PRO （56.3）， 95.3 on V* Python. Open weights under Apache 2.0.

Built for agentic， coding， search， and multimodal workflows - balancing speed， cost， and reliable execution.

400 TPS. 198B sparse MoE， ~11B active. 256K context， 3 reasoning levels.
Understands UIs， charts， docs， images - then writes code or calls tools to act on what it sees.
Web + visual search reaches further： more sources， deeper follow-up.
Reliable tool use - less drift， fewer broken toolcalls. 98%+ on τ2-bench across all difficulty levels.
Works with Claude Code， KiloCode， Hermes Agent， OpenClaw， and protocols like MCP.
Runs locally on Mac Studio M4 Max， DGX Spark， AMD AI Max+ 395.

GitHub： http://github.com/stepfun-ai/Step-3.7-Flash HuggingFace： http://huggingface.co/stepfun-ai/Step-3.7-Flash GGUF： http://huggingface.co/stepfun-ai/Step-3.7-Flash-GGUF ModelScope： http://modelscope.cn/models/stepfun-ai/Step-3.7-Flash API： http://platform.stepfun.ai Blog： http://static.stepfun.com/blog/step-3.7-flash/

智能体多模态开源生态推理

StepFun@StepFun_ai · X

75导出 Markdown

2026-05-29 08:00·35天前

在 X 看原推· x.com

AI 摘要

⚡️ Step 3.7 Flash is here： The new frontier is agent efficiency.

#1 ClawEval-1.1 （67.1）， #1 SimpleVQA Search （79.2）， #2 SWE-PRO （56.3）， 95.3 on V* Python. Open weights under Apache 2.0.

Built for agentic， coding， search， and multimodal workflows - balancing speed， cost， and reliable execution.

400 TPS. 198B sparse MoE， ~11B active. 256K context， 3 reasoning levels.
Understands UIs， charts， docs， images - then writes code or calls tools to act on what it sees.