AI 摘要
消息:Grok 刚刚创下有史以来最低的幻觉率,在 AA-Omniscience 基准测试中仅为 17%。 击败了: Claude → 36% Gemini → 50% ChatGPT → 89% https://t.co/jiZlwEDMbv
NEWS: Grok just posted the lowest hallucination rate ever recorded, only 17% on the AA-Omniscience benchmark.
Beating:
Claude → 36% Gemini → 50% ChatGPT → 89%