Chubby♨️@kimmonismus

2026-05-01 14:57·62天前

AI 摘要

xAI发布的Grok 4.3模型在Artificial Analysis Intelligence Index上获得53分，相比Grok 4.20输入成本降低约40%，输出成本降低约60%，性价比突出。其最大亮点是在真实世界代理任务（GDPval-AA）上的ELO评分跃升321点至1500，超越了Gemini 3.1 Pro Preview和Muse Spark等模型，但仍大幅落后于GPT-5.5。该模型在指令遵循和客服任务上表现强劲，同时在Omniscience基准上准确率提升但幻觉率增加。总体而言，Grok 4.3以更低成本实现了更高的智能指数得分，成为同智能层级中成本效益较高的模型之一。

Grok 4.3 is a very good model especially when you think its only 500m parameters！

xAI's Grok 4.3 scores 53 on the Artificial Analysis Intelligence Index with ~40% lower input and ~60% lower output pricing vs Grok 4.20， making it one of the most cost-efficient models at its intelligence tier. Biggest gain： a 321-point Elo jump on real-world agentic tasks （GDPval-AA）， though it still trails GPT-5.5 by a wide margin.

Artificial AnalysisxAI has launched Grok 4.3, achieving 53 on the Artificial Analysis Intelligence Index with improved agentic performance, ~40% lower input price, and ~60% lower ...

xAI 推理模型发布

在 X 查看原推导出 Markdown

Chubby♨️@kimmonismus · X

57导出 Markdown

2026-05-01 14:57·62天前

在 X 看原推· x.com

AI 摘要

Grok 4.3 is a very good model especially when you think its only 500m parameters！