更令人印象深刻的是,当成功率从50%提升到80%时,Claude Mythos与Gemini 3.1 Pro之间的差距会变得多么巨大。 Mythos不仅仅是"工作更持久"——最重要的是,它的工作准确率显著更高!这才是真正令人惊叹的部分。
What is even more impressive is just how wide the gap between Claude Mythos and Gemini 3.1 Pro becomes when moving from a 50% success rate to an 80% success rate.
Mythos doesn't just work "longer" - above all, it works significantly more accurately! That is the truly impressive part.