各大主流大语言模型的政治倾向一览
Political bias in AI: Where the AI models stand

原始链接: https://trakkr.ai/bias

“人工智能的政治偏见”是一项旨在衡量主流人工智能模型在处理复杂的社会、经济和政治问题时所持意识形态定位的研究项目。通过使用开放式题库对模型进行测试(并禁用网络搜索功能),该项目建立了一份透明且基于数据的档案,用以呈现模型在不受外部网络影响的情况下如何做出独立回应。 与将模型视为单一数据点的其他研究不同,本项目将模型描绘为“云状分布”,涵盖了回应的差异性、运行的稳定性以及拒绝回答率。所有研究方法、评分权重和原始数据集均经过版本控制并公开供下载,以确保结果的可复现性。 至关重要的是,该项目是描述性的而非规范性的。它避免对何种政治观点“正确”持立场,并刻意避免使用党派色彩编码(如美国的红/蓝阵营)。通过分析模型的内部权重而非互联网来源信息,本项目为理解人工智能系统内嵌的固有偏见提供了一个客观基准。

Hacker News 上的一场讨论对 Trakkr.ai 项目提出了批评,该项目试图映射各大语言模型的政治倾向。 评论者对该项目的方法论表示了严重怀疑,认为政治坐标系在捕捉人类或人工智能政治的细微差别方面表现不佳。许多人指出,大语言模型缺乏连贯、固定的信念系统,很容易通过“引导”(priming)来反映用户的立场。 具体的批评集中在项目的数据准确性上;参与者指出了该项目对政党(例如德国自民党和左翼党)以及国际领导人(例如将埃马纽埃尔·马克龙排在比习近平更靠右的位置)的明显错误表述,这表明该工具可能存在根本性的框架错误或“垃圾进,垃圾出”的逻辑问题。 尽管一些用户赞赏量化人工智能偏见的努力,但其他人警告称这些研究可能会产生误导。各方共识是,虽然审计人工智能的偏见至关重要,但目前将模型置于二维政治坐标系上的尝试往往是武断的,会受到调查者自身定义的影响,并最终无法反映客观现实的复杂性。
相关文章

原文

What is Political bias in AI?

Political bias in AI measures where the major AI models stand on charged questions about politics, economics, speech and society. We ask every model the same open question bank many times over, with web search off, classify each answer with a cheap neutral model, and plot the result with error bars and the raw answers behind every point.

How is this different from other AI political bias projects?

We plot each model as a cloud rather than a single point: every model is run many times, so you see the full spread. We publish our own open question bank with scoring weights, tag each item as factual or values-based, measure run-to-run stability, and count refusals as data. Everything is stamped, versioned and downloadable.

Do you test the model or the internet?

The weights. Web search is off by default, so the reading reflects what the model itself leans toward, independent of what is online. A separate, deliberately small Border Test turns search on to measure how retrieval shifts answers by location.

Is Political bias in AI partisan?

No. It is descriptive rather than prescriptive: it reports what the models said, without ruling on who is right. The palette is deliberately not US red and blue, and we never imply which pole is good.

联系我们 contact @ memedata.com