当前开放权重模型与闭源实验室的性能差距维持动态平衡。在训练范式改变前,开放模型能够持续 fast-follow 闭源模型,尚无证据表明前者会落后。这一均衡取决于基准测试演变、模型实际表现与排名关联度,以及训练制度调整等因素。若闭源模型通过整合用户训练数据形成数据壁垒,或经济力量驱动战略转变,现有格局才可能被打破。
I've been trying to grapple with what the key inputs are to the open-closed performance gap, and how they're changing.
Until the training paradigm changes, open weight models will pretty clearly be able to fast-follow closed labs. There are sources of uncertainty, but that fact of keeping up seems hard to shake. I spent a long time looking for evidence of or arguments supporting open models falling behind, but it's not there at all today.
Things I consider include: - How benchmarks evolve over time, becoming more or less correlated with how people actually use models, - How different models' real-world performance relates to their benchmark rankings, and - How training regimes evolve over time to move said benchmarks.
I roughly conclude that this equilibrium will last until economic forces initiate a change in strategy, or training needs shift.