OpenRouter@OpenRouter

2026-06-29 01:22·4天前

AI 摘要

提示：OpenRouter 持续在大多数开源权重模型上运行 GPQA 和 TAU-Bench 评测，并公开发布结果。这些结果用于构建我们的 AutoExacto 元基准，在路由工具调用时默认使用。以下，@Parasail_io 和 @Zai_org 排名第一：https://openrouter.ai/z-ai/glm-5.2#performance

Tip： OpenRouter continuously runs GPQA and TAU-Bench on most open-weight models and publishes the results publicly.

This informs our AutoExacto meta-benchmark， used by default when routing tool calls.

Here， @Parasail_io and @Zai_org rank first： https://openrouter.ai/z-ai/glm-5.2#performance

MCP/工具产品更新推理

在 X 查看原推导出 Markdown

OpenRouter@OpenRouter · X

61导出 Markdown