谷歌因其引发争议而解雇蒂姆尼特·格布鲁（Timnit Gebru），如今这些针对大语言模型的警告已全部应验。

谷歌因其引发争议而解雇蒂姆尼特·格布鲁（Timnit Gebru），如今这些针对大语言模型的警告已全部应验。
The LLM warnings Google fired Timnit Gebru over have all come true

原始链接: https://www.tumblr.com/dreaminginthedeepsouth/817865966907228160/darren-oconnor-timnit-gebru-was-fired-from

2020年，谷歌解雇了其人工智能伦理团队的领导者蒂姆尼特·格布鲁（Timnit Gebru）博士，原因在于她拒绝撤回其研究论文《论随机鹦鹉的危险性》（On the Dangers of Stochastic Parrots）。这篇与埃米莉·本德（Emily Bender）等人合著的论文，概述了大型语言模型（LLM）的五大关键风险，而这些风险后来被证明具有惊人的预见性。格布鲁警告称，模型的规模化将导致其成为无法真正理解含义的“随机鹦鹉”，从而产生广泛的“幻觉”现象。她预测，训练数据的偏差将固化招聘、医疗和金融领域的歧视，这一现象如今已得到大量记录。此外，她还强调了巨大的环境成本、大规模数据集无法审计（导致有害内容扩散）的问题，以及可能削弱语言多样性的文化权力集中化风险。谷歌解雇格布鲁并解散其团队的决定，有效地压制了内部异见，在研究人员中制造了一种恐惧文化。尽管科技巨头无视她的警告，执意追求快速发展，但随后的行业危机——从模型崩溃到破纪录的碳排放——都印证了她的研究。格布鲁被解雇一事是一个鲜明的例证，表明企业激励机制如何将利润置于安全与伦理考量之上，而后者对于人工智能的负责任发展至关重要。

近期 Hacker News 上的一场讨论围绕 Timnit Gebru 2021 年发表的“随机鹦鹉”（Stochastic Parrots）论文及其遗产展开，探讨了她对大语言模型（LLM）的警告是否已成现实。该论文最初针对以下风险提出了警告： 1. **规模：** 只有流畅度而无实际理解能力。 2. **偏见：** 系统性地放大社会偏见。 3. **环境：** 训练模型所需的高昂能源成本。 4. **文档记录：** 海量数据集导致审计变得不可能。 5. **中心化：** 技术巨头对权力的垄断。参与讨论者对此意见分歧严重。批评者认为该论文已过时，指出现代大语言模型展现出的推理和解决问题能力，已经超越了“随机鹦鹉”的标签。而另一派则认为，尽管关于什么是“理解”仍有争议，但偏见、幻觉和中心化等负面影响已显而易见。讨论帖还反映出人们对原始争议的定性持怀疑态度，一些用户质疑 Gebru 究竟是“被解雇”还是“辞职”，另一些用户则认为所提供的原始来源是低质量的标题党内容。归根结底，这场讨论反映了科技界内部持续存在的紧张关系：即人工智能安全问题究竟是合理的警示，还是被夸大且流于形式的批判。

原文

"Timnit Gebru was fired from Google in December 2020 for refusing to retract a research paper, and every single warning that paper made about large language models has now happened at a scale the industry spent 4 years trying to make people forget about.

Her name is Timnit Gebru.

She co-led the Ethical AI team at Google. She co-wrote a paper called "On the Dangers of Stochastic Parrots" with Emily Bender at the University of Washington and two other researchers. The paper was 14 pages long. It was submitted to a top AI ethics conference. And it was the reason Google decided that one of the most senior Black women in AI research could no longer work there.

The story Google told publicly was that she resigned. The story she told, confirmed by 2,695 of her colleagues in an open letter, was that she was fired by email while on vacation because she refused to either retract the paper or remove her name from it.

The paper had not even been published yet.

Here is what she actually wrote, and why every prediction inside it has now come true.

The first warning was about scale itself. Bender and Gebru argued that training ever-larger models on ever-larger scrapes of the internet would produce systems that appeared fluent but had no actual understanding of language. They called these systems stochastic parrots because they would repeat patterns from training data with statistical confidence and zero comprehension. The paper predicted that this apparent intelligence would fool both users and developers into trusting outputs that were structurally incapable of being reliable.

This was 2020. GPT-3 had just come out. The paper predicted the hallucination problem before anyone had a word for it.

The second warning was about bias amplification. The paper documented in detail that internet-scale training data contains systematic overrepresentation of dominant viewpoints and underrepresentation of marginalized ones. The models would not just absorb this bias. They would amplify it, because the optimization process rewards confident outputs, and confidence in language patterns tracks frequency in the training set.

The prediction was that hiring tools built on these models would discriminate against women. That healthcare triage tools would underperform on Black patients. That loan approval systems would entrench inequality while presenting their decisions as neutral algorithmic judgment.

Every one of those things has now been documented in deployment.

Amazon's hiring algorithm penalized resumes that contained the word "women" in any context. Healthcare risk scoring algorithms used by major US hospitals were found to systematically underestimate the medical needs of Black patients. Apple Card's credit algorithm gave wives credit lines 10x lower than their husbands for the same financial profile.

The third warning was about environmental cost. The paper calculated that training a single large language model produced emissions equivalent to the lifetime output of 5 cars. The prediction was that the race to scale would create an environmental footprint that would eventually rival entire industries.

In 2024, Google's emissions were up 48% from 2019, and the company explicitly blamed AI infrastructure. Microsoft's were up 29%, same reason. Both companies have now quietly abandoned the climate commitments they were publicly celebrating the year Gebru was fired.

The fourth warning was about documentation. The paper argued that the training datasets being assembled were too large for anyone to actually audit. Nobody at Google, OpenAI, Meta, or any other lab could tell you with confidence what was in the data their models were trained on. This was not a temporary problem to be solved later. It was a permanent feature of the approach.

In 2023, researchers discovered that the LAION-5B dataset, used to train Stable Diffusion and other major image models, contained thousands of images of child sexual abuse material. The companies that had trained on the dataset had no way of knowing. The paper predicted that category of failure 3 years before it was found.

The fifth warning was the one Google cared about most.

Bender and Gebru argued that the deployment of these systems would centralize linguistic and cultural power in the hands of the small number of companies that could afford to train them. The internet would become a place where the dominant voice was a statistical average of dominant voices, presented as a neutral assistant. Languages underrepresented in the training data would degrade over time as more web content was generated by these systems and fed back into the next training run.

This is now happening in real time. A 2024 study found that 57% of new web content in English is AI-generated or AI-assisted. Researchers studying low-resource languages have documented active degradation in translation quality, because the synthetic content fed back into training is itself worse in those languages.

The paper Google fired her for predicted the model collapse problem before model collapse had a name.

The mechanism behind why this all happened is the part of her work that nobody quotes.

Gebru's argument was not that AI is dangerous in some abstract sci-fi sense. Her argument was that AI is dangerous in a very specific structural sense. The technology was being built by a small group of researchers who shared similar backgrounds, worked at similar companies, and were rewarded for shipping products faster than competitors. The incentive structure made it impossible for safety, ethics, and bias concerns to slow anything down. Anyone inside the system who raised those concerns was either ignored, sidelined, or removed.

She was making that argument from inside Google.

Then Google proved her right by removing her.

The team Google had built to make sure their AI was safe was dismantled in 90 days because they did the job they had been hired to do. Margaret Mitchell, the other co-lead of the Ethical AI team, was fired two months after Gebru for searching through her own emails for evidence of how Gebru had been treated.

Gebru did not stop. She founded DAIR, the Distributed AI Research Institute, in 2021. The mission is to do AI research outside the control of the companies that have a financial interest in not hearing the answers.

Every prediction in the Stochastic Parrots paper has now been validated by deployment. Hallucinations are an industry-wide problem the largest labs cannot solve. Bias amplification has been documented in hiring, healthcare, lending, and criminal justice. Environmental costs are larger than entire small countries. Training data audits remain impossible. Model collapse is an active research crisis at every major lab.

The question worth sitting with is the one almost no one in the industry will say out loud.

Every researcher with the technical credibility to call out these problems watched what happened to her in December 2020 and made a calculation about their own career. The number of people willing to speak publicly about safety and ethics issues inside the major AI labs collapsed after that firing and has not recovered.

The researcher Google fired for warning about exactly what is now happening was right.

The company that fired her is now the second-largest deployer of the technology she warned about.

And the people inside that company who agree with her are not allowed to say so."

谷歌因其引发争议而解雇蒂姆尼特·格布鲁（Timnit Gebru），如今这些针对大语言模型的警告已全部应验。 The LLM warnings Google fired Timnit Gebru over have all come true

谷歌因其引发争议而解雇蒂姆尼特·格布鲁（Timnit Gebru），如今这些针对大语言模型的警告已全部应验。
The LLM warnings Google fired Timnit Gebru over have all come true