(评论)
(comments)
原始链接: https://news.ycombinator.com/item?id=39499207
虽然这似乎违反直觉,但他们的实验表明,在应用他们提出的方法后,幻觉显着减少。 然而,“真”和“假”输出之间的区别变得模糊,因为基本事实本身并不完全真实。 尽管如此,如果法学硕士能够准确识别没有可靠答案的情况,那么它就可以通过再培训或校准来提供提高幻觉率的见解。 正如作者指出的,最终目标不是完全消除幻觉,而是显着降低幻觉的发生率。 同样,无幻觉法学硕士对于有用目的既不是必要的,也不是充分的。 最终,最重要的是为特定应用程序或领域实现令人满意的性能。 关于语言和意义的讨论,我们的想法是,当前的法学硕士框架过度简化了语言使用和解释之间的关系。 虽然模型的输出与输入相当吻合,特别是在统计一致性方面,但它无法捕捉关键的细微差别和微妙之处。 这些挑战部分源于建模假设,特别是假设语言纯粹由表面形式或标记上的相关信号组成。 尽管这些模型取得了令人印象深刻的成就,但它们忽略了意义的基本方面,包括语用学、话语和参考解析。 其结果是语言上足够的输出和解释意义之间存在相当大的差距,使得解释在规模上具有挑战性。 因此,旨在开发更丰富的语言和意义表示的努力应该优先考虑与解释意义的一致性,而不是仅仅表面形式或标记。
I think this is a clever point and an interesting philosophical question (about math, computer science, and language), but I think people are mostly trying to apply this using our commonsense notions of "LLM hallucination" rather than the formal notion they use in this paper, and I don't see an obvious connection, since commonsense hallucinations (eg inventing chapters of a novel when asked to produce summaries, inventing specific details when asked) don't seem to be NP-complete problems but rather are hallucinatory for some other interesting reason. (I apologize if I have not captured the paper correctly and would welcome correction on that, I read it quickly)
The statement that the formal world (the world of math and logic and formal grammars) is a subset of the "real" world (or perhaps, the world of natural language) is really interesting to me as well. Most humans can't solve formal logic problems and parse formal grammars but don't suffer from a (strong) hallucination effect, and can work in natural language in great proficiency. Is hallucination inevitable in humans since we also can't solve certain NP-complete problems? We have finite lifespans, after all, so even with the capabilities we might never complete a certain problem.
reply