AI 摘要
除了CAISI评估外,如果NIST能作为独立评估者对AI能力进行公开测试将会很有帮助——尽管这些显然不应是预发布测试,且可以在模型公开后进行。 独立测试很重要且成本越来越高。
In addition to the CAISI evaluation, it would be useful if NIST conducted public tests of AI abilities as an independent evaluator - though those obviously should not be pre-release tests &; can be done when models are public.
Independent testing is important &; getting expensive.