AI 摘要
根据@ArtificialAnlys的AA-Briefcase评估(让AI执行多周咨询任务),@emollick绘制了开放与封闭模型的前沿曲线,显示令人惊讶的快速进步,且开放权重模型与封闭模型之间存在明显差距。
Even though I made this graph, it is also kind of wrong. Fable is guardrailed Mythos. If we use the Mythos date
I took the new AA-Briefcase scores from @ArtificialAnlys (basically having the AI do multi-week consulting gigs with a lot of complexity) and graphed the fronti...