AI 摘要
Meta Muse Spark 模型在 FrontierMath 基准测试中,Tiers 1-3 得分 39%,Tier 4 得分 15%。该成绩与近期多款前沿模型相当,但仍落后于 GPT-5.4。
We had pre-release access to Meta's new Muse Spark model and evaluated it on FrontierMath. It scored 39% on Tiers 1-3 and 15% on Tier 4. This is competitive with several recent frontier models, though behind GPT-5.4.