谷歌、微软和xAI已同意在美国商务部机构CAISI的测试下,提前评估其前沿AI模型。测试的特殊之处在于,公司将提供降低或移除安全护栏的模型版本,以便评估其在协助网络入侵、恶意软件规划等高危任务上的原始能力与风险。此前,OpenAI和Anthropic已于2024年达成类似协议。此举背景是白宫正考虑建立针对主要AI模型的政府审查流程,审查重点是其网络能力——即发现和利用软件漏洞以改变现实安全风险的水平。政策转向的触发点是Anthropic的Mythos模型,该公司认为该模型在发现安全漏洞方面能力过强,广泛发布风险过高。
Google, Microsoft and xAI just agreed to let the U.S. government test early frontier AI models before the public can use them.
The testing will be run by CAISI, a Commerce Department group that checks what advanced models can do and where they may create security risk.
The unusual part is that the companies will share versions with reduced or removed guardrails, which lets testers see the model's raw ability instead of only its polished public behavior.
Becasue, a national security test asks whether the model can help with cyber intrusion, malware planning, or other high-risk tasks when its filters are weakened.