AI 摘要
Anthropic "Mythos" 模型在基准测试中表现极强,证明模型扩展(scaling)尚未触及天花板;但更强性能伴随极高训练与推理成本,其出色表现很大程度上源于昂贵的配置投入。
some important takeaways from anthropic's "mythos" model:
1) the model is extremely strong across benchmarks, so scaling has not hit a wall
2) but better scaling also brings much higher training and inference costs, and its setup was strong partly because it's expensive