# SynCred-Bench：AI生成视觉错误信息的合成可信度基准测试

- 来源：HuggingFace Daily Papers（社区热门论文）
- 发布时间：2026-06-02 08:00
- AIHOT 分数：56
- AIHOT 链接：https://aihot.virxact.com/items/cmpyrtpw102q7sli3zcro6z2b
- 原文链接：https://arxiv.org/abs/2606.03348

## AI 摘要

SynCred-Bench是一个包含600张AI生成错误信息图像的基准测试，覆盖6种可信形式类别和7种细粒度传播风格，并配有FP450真实图像负集。评估显示，在5%假阳性率约束下，现有系统表现不可靠：15个多模态大语言模型仅达10.5%真阳性率，开源AIGC检测器不足5%，商业API达57.6%，人类标注者也仅识别出63%样本。这揭示了合成可信度作为严峻且尚未充分研究的视觉错误信息挑战。

## 正文

Recent generative models can now produce visual artifacts with realistic embedded text and layouts, creating a new misinformation threat: synthetic credibility. We introduce SYNCRED-Bench, a benchmark of 600 AI-generated misinformation images balanced across six credible-form categories and seven fine-grained circulation styles, together with FP450, a real-image negative set for measuring false positives. Extensive evaluation shows that existing systems remain unreliable: under a 5% false-positive-rate constraint, 15 MLLMs achieve only 10.5% true positive rate (TPR), open-source AIGC detectors achieve less than 5%, and commercial APIs reach 57.6%. Human annotators also struggled to identify synthetic credibility, reaching only 63% TPR. These findings establish synthetic credibility as a severe and underexplored visual misinformation challenge, and provide a benchmark for developing detectors that reason beyond superficial credibility cues.
