AI 实验室在发布每个新模型时,除了 model card 还应该发布一份新文档:一种类似 changelog 的东西 我想看到新模型在一系列具体任务上相比早期模型如何以及在哪些方面发生变化、失效或改进。越来越重要!
We need a new document that AI labs should release with each new model, besides the model card: a sort of changelog
I want to see how &; in what way the new model changes, breaks, or improves at a range of individual tasks compared to the earlier models. Increasingly important!