AI 摘要
单个神经元足以绕过大型语言模型的安全对齐设置
A Single Neuron Is Sufficient to Bypass Safety Alignment in Large Language Models
单个神经元足以绕过大型语言模型的安全对齐设置
A Single Neuron Is Sufficient to Bypass Safety Alignment in Large Language Models
单个神经元足以绕过大型语言模型的安全对齐设置
A Single Neuron Is Sufficient to Bypass Safety Alignment in Large Language Models