Pangram CEO:语言模型会因论点雷同而暴露自己
阅读原文· the-decoder.comAI文本检测器Pangram的CEO Max Spero表示,其深度学习分类器是一个黑箱,通过捕捉语言模型在组织文档时留下的结构模式来识别AI生成文本。Spero指出,语言模型在语法和逻辑上可能优于普通人,但论点高度同质化:若要求LLM就某个主题生成100个论点,它们会集中在狭窄范围内,而人类论点的空间则非常多样。这种雷同是AI文本的显著特征。
Pangram CEO says language models give themselves away by making the same arguments
If you want to fool Pangram, you'll need better arguments. That's the takeaway from an interview with Max Spero, CEO of AI text detector Pangram, published on AI Policy Perspectives.
Spero calls Pangram's deep-learning classifier a black box. "We don't have a ton of interpretability into why it makes the predictions that it does," he said. The tool surfaces suspicious phrases as clues, but the model picks up on structural patterns a language model leaves behind when organizing a document. Even Pangram doesn't fully understand those patterns.
Spero also argues that language models "might be" better than average humans at grammar and logic but are far more uniform. Ask an LLM for 100 arguments on a topic and they'll cluster in a narrow band, "whereas the space of human arguments is going to be very diverse."