Nathan Lambert@natolambert

2026-04-19 02:40·75天前

AI 摘要

这里的一个大问题是，我们并没有清晰界定 mythos capabilities 到底是什么。发布博客中的每个基准测试都有模型能达标，当然。但要说有模型能直接替换到相同用例中且性能毫无下降？我对此表示怀疑。

A big problem with this is that we don't really have a clear description of what mythos capabilities are.

A model on each of the benchmarks in the launch blog post， sure.

A model that you can swap right in for the same use-cases and notice no drop in perf？ Doubt it.

rohitDario seems to think China and open source will hit Mythos capabilities in 6-12 months

Nathan Lambert@natolambert · X

2026-04-19 02:40·75天前

AI 摘要

A big problem with this is that we don't really have a clear description of what mythos capabilities are.

A model on each of the benchmarks in the launch blog post， sure.

A model that you can swap right in for the same use-cases and notice no drop in perf？ Doubt it.

rohitDario seems to think China and open source will hit Mythos capabilities in 6-12 months