xAI发布的Grok 4.3模型在Artificial Analysis Intelligence Index上获得53分,相比Grok 4.20输入成本降低约40%,输出成本降低约60%,性价比突出。其最大亮点是在真实世界代理任务(GDPval-AA)上的ELO评分跃升321点至1500,超越了Gemini 3.1 Pro Preview和Muse Spark等模型,但仍大幅落后于GPT-5.5。该模型在指令遵循和客服任务上表现强劲,同时在Omniscience基准上准确率提升但幻觉率增加。总体而言,Grok 4.3以更低成本实现了更高的智能指数得分,成为同智能层级中成本效益较高的模型之一。
Grok 4.3 is a very good model especially when you think its only 500m parameters!
xAI's Grok 4.3 scores 53 on the Artificial Analysis Intelligence Index with ~40% lower input and ~60% lower output pricing vs Grok 4.20, making it one of the most cost-efficient models at its intelligence tier. Biggest gain: a 321-point Elo jump on real-world agentic tasks (GDPval-AA), though it still trails GPT-5.5 by a wide margin.