打破概率的枷锁:中立逻辑作为大语言模型认知不确定性建模的新框架
阅读原文· arxiv.org研究团队提出将中立逻辑应用于大语言模型(LLM)以建模认知不确定性。该框架将真、不确定、假视为三个独立维度,允许其值之和大于1。实验在OpenAI GPT模型家族上进行,测试了逻辑悖论等五种语言现象,并对比了三种提示策略。结果发现,中立逻辑方法在35%的评估中自发出现“超真理”状态,能更丰富地表示模型内部状态,有助于识别与量化模型冲突,为构建更透明、可靠的AI系统提供方法。
Large Language Models (LLMs) are predominantly governed by probabilistic frameworks in which the sum of outcome probabilities is constrained to unity. This architectural limitation, often imposed by Softmax layers, leads to a collapse of uncertainty that makes it difficult to differentiate between epistemic uncertainty, paradox, and vagueness. We present an empirical investigation of the application of Neutrosophic Logic, a framework that treats Truth (T), Indeterminacy (I), and Falsity (F) as three independent dimensions, to model epistemic states in LLMs. We conducted experiments on a family of four OpenAI GPT models across five linguistic phenomena: logical paradoxes, epistemic ignorance, vagueness, ethical contradictions, and future contingencies, under three prompting strategies: neutrosophic, probabilistic, and entropy-derived. Our findings reveal that the neutrosophic approach, by allowing T+I+F > 1, a state we term hyper-truth, provides a richer representation of a model's internal state. In 35% of evaluations, hyper-truth emerged spontaneously, predominantly under ethical contradiction and logical paradox. We demonstrate that this approach preserves truth values in fuzzy contexts and offers a robust method for identifying and quantifying internal model conflict. We conclude that the integration of neutrosophic evaluation layers is a critical step toward more transparent, reliable, and ethically aware AI systems.