# 与英国 AISecurityInst 及 turinginst 的新研究：少量恶意文档即可在 LLM 中制造漏洞，数据投毒攻击或比预想更可行

- 来源：Anthropic (@AnthropicAI)
- 发布时间：2025-10-10 00:28
- AIHOT 链接：https://aihot.virxact.com/items/cmnw1yoz200n7slc37prxmbtd
- 原文链接：https://x.com/AnthropicAI/status/1976323781938626905

## AI 摘要

联合研究发现，仅需少量恶意文档就能在 LLM 中植入安全漏洞，且不受模型规模或训练数据量影响。这表明数据投毒攻击的实施门槛可能比此前认为的更低，实际威胁被低估。

## 正文

New research with the UK @AISecurityInst and the @turinginst：

We found that just a few malicious documents can produce vulnerabilities in an LLM-regardless of the size of the model or its training data.

Data-poisoning attacks might be more practical than previously believed.