如果大型语言模型具有人类特征,那么《帝国时代 II》也一样。
If LLMs Have Human-Like Attributes, Then So Does Age of Empires II

原始链接: https://arxiv.org/abs/2605.31514

在《如果大语言模型具有类人属性,那么《帝国时代II》也一样》一文中,Adrian de Wynter 质疑了将道德或真正理解力等人类特质归因于大语言模型(LLM)的倾向。 该论文认为,这些感知到的属性并非大语言模型所独有;通过证明《帝国时代II》是图灵完备的,作者说明了任何足够强大的系统都能产生可能被观察者误读为“类人”的行为。作者指出,如果没有严谨且独立于底层架构的衡量标准,将这些特质归于人工智能只会导致循环论证或毫无意义的结论。 最终,该研究提出,对人工智能行为的解读往往是观察者的主观投射,而非模型固有的属性。为了推动该领域的发展,de Wynter 提出了一个“零假设”框架,主张优先考虑大语言模型的非独特性,而非人类中心主义的偏见。通过摒弃类人标签,研究人员可以开展更具实证基础的实验,并将系统运行的底层架构纳入考量。

相关文章

原文

View a PDF of the paper titled If LLMs Have Human-Like Attributes, Then So Does Age of Empires II, by Adrian de Wynter

View PDF
Abstract:Much research has been carried out on large language models (LLMs) and LLM-powered agentic workflows. However, many works within the field state emergence of, ascribe to, or assume, generalised anthropomorphic attributes to them (e.g., morality or understanding of natural language). Our goal is not to argue in favour or against the existence of these attributes, but to point out that these conclusions could be incorrect. For this we build and train a simple neural network on the videogame Age of Empires II, and note that any entity in a sufficiently-powerful substrate, such as LEGO or the Greater Boston Area, could also present such attributes. Hence, the purported anthropomorphic attributes of LLMs are empirically non-unique: although some properties (e.g., responses to prompts) could remain constant, others, such as the interpretation of their perceived behaviour, might change with the substrate. Thus, any empirically-grounded discussion requires explicit measurement criteria; otherwise the interpretation is left to the representation. We then show that assuming that these attributes exist or not in a system, independent of the substrate and in a generalised way, leads to either circular or uninformative conclusions, regardless of the experimenter's viewpoint on the subject. Finally we propose a 'null' assumption, where one assumes LLM non-uniqueness instead of assuming anthropomorphic attributes to set up an experiment, along with examples of it. We also discuss potential objections to our work, briefly survey the field, and prove that Age of Empires II is functionally- and Turing-complete.
From: Adrian de Wynter [view email]
[v1] Fri, 29 May 2026 16:31:31 UTC (13,704 KB)
[v2] Mon, 1 Jun 2026 21:31:22 UTC (13,705 KB)
联系我们 contact @ memedata.com