AI聊天机器人正成为健康咨询的重要入口,但新证据表明,真实人机对话对医疗准确性的破坏远超预期。研究显示,在控制测试中AI准确率可达95%,但面对真实用户混乱、不完整、分心的症状描述时,准确率骤降至35%。医疗建议领域存在极高敏感性,细微的措辞变化可能导致建议从"居家休息"翻转为"立即就医",凸显当前AI医疗应用在实际场景中的重大风险。
BBC Published an article.
AI chatbots are becoming a real front door for health advice, but new evidence says human-AI conversation breaks their medical accuracy far more than most people realize.
The problem is not that these systems always fail when they see a full, neatly written case, because in controlled testing they reached about 95% accuracy.
The problem is that real people give messy, partial, distracted symptom descriptions, and in that setting accuracy dropped to about 35%
In the area of medical advice, a tiny wording change can flip advice from "rest at home" to "go to hospital now,
---
bbc .com/news/articles/clyepyy82kxo