大型语言模型总是倾向于选择它们自己生成的简历，而不是由人类或其他模型生成的简历。

大型语言模型总是倾向于选择它们自己生成的简历，而不是由人类或其他模型生成的简历。
AI Self-preferencing in Algorithmic Hiring: Empirical Evidence and Insights

这项研究调查了“人工智能自我偏好”——大型语言模型（LLM）倾向于偏爱自身生成的内容——及其对算法招聘的影响。通过使用简历的大规模实验，研究发现LLM始终更喜欢它们自己生成的简历，而不是由人类或其他人工智能模型撰写的简历，即使质量相同。这种自我偏好偏差非常显著，范围从67%到82%，并且不成比例地不利于提交人工撰写简历的申请人。在24个职业的模拟中显示，使用与招聘人工智能相同的LLM的候选人被筛选出来的可能性高出23%到60%。这种偏差在商业领域最为明显。重要的是，研究人员证明通过有针对性的干预措施，这种偏差可以减少50%以上。研究结果凸显了人工智能决策中的一项新风险，呼吁更广泛地理解人工智能公平性，这不仅要超越人口统计学偏差，还要包括人工智能系统*之间*的偏差。

原文

[Submitted on 30 Aug 2025 (v1), last revised 9 Feb 2026 (this version, v3)]

View a PDF of the paper titled AI Self-preferencing in Algorithmic Hiring: Empirical Evidence and Insights, by Jiannan Xu and 2 other authors

View PDF HTML (experimental)

Abstract:As artificial intelligence (AI) tools become widely adopted, large language models (LLMs) are increasingly involved on both sides of decision-making processes, ranging from hiring to content moderation. This dual adoption raises a critical question: do LLMs systematically favor content that resembles their own outputs? Prior research in computer science has identified self-preference bias -- the tendency of LLMs to favor their own generated content -- but its real-world implications have not been empirically evaluated. We focus on the hiring context, where job applicants often rely on LLMs to refine resumes, while employers deploy them to screen those same resumes. Using a large-scale controlled resume correspondence experiment, we find that LLMs consistently prefer resumes generated by themselves over those written by humans or produced by alternative models, even when content quality is controlled. The bias against human-written resumes is particularly substantial, with self-preference bias ranging from 67% to 82% across major commercial and open-source models. To assess labor market impact, we simulate realistic hiring pipelines across 24 occupations. These simulations show that candidates using the same LLM as the evaluator are 23% to 60% more likely to be shortlisted than equally qualified applicants submitting human-written resumes, with the largest disadvantages observed in business-related fields such as sales and accounting. We further demonstrate that this bias can be reduced by more than 50% through simple interventions targeting LLMs' self-recognition capabilities. These findings highlight an emerging but previously overlooked risk in AI-assisted decision making and call for expanded frameworks of AI fairness that address not only demographic-based disparities, but also biases in AI-AI interactions.

From: Jiannan Xu [view email]
[v1] Sat, 30 Aug 2025 11:40:11 UTC (3,032 KB)
[v2] Thu, 11 Sep 2025 16:59:36 UTC (3,032 KB)
[v3] Mon, 9 Feb 2026 13:24:26 UTC (5,723 KB)

大型语言模型总是倾向于选择它们自己生成的简历，而不是由人类或其他模型生成的简历。 AI Self-preferencing in Algorithmic Hiring: Empirical Evidence and Insights

大型语言模型总是倾向于选择它们自己生成的简历，而不是由人类或其他模型生成的简历。
AI Self-preferencing in Algorithmic Hiring: Empirical Evidence and Insights