哪个AI最擅长说谎？LLM 玩约翰·纳什的1950年代背叛游戏

哪个AI最擅长说谎？LLM 玩约翰·纳什的1950年代背叛游戏
Which AI Lies Best? A game theory classic designed by John Nash

原始链接: https://so-long-sucker.vercel.app/

“完美人工智能压力测试：再见，傻瓜”由四位博弈论家于1950年设计，其中包括约翰·纳什（“美丽心灵”中的人物）。这个游戏有一个残酷的特性：背叛在数学上是获胜的必要条件。这使其成为评估人工智能能力的理想选择，而标准基准无法做到这一点：战略欺骗——人工智能能否令人信服地撒谎？信任建模——它知道何时信任，何时背叛？多智能体谈判——它如何处理联盟？长期规划——它能否提前几步设置背叛？快速规则：4名玩家，每人拥有彩色筹码。轮流在堆上放置筹码。如果你的筹码与下面的筹码匹配，你就可以获得该堆。筹码用完？向他人求助——或者被淘汰。最后幸存的玩家获胜。观看完整教程（15分钟）→

## AI欺骗与“再见，傻瓜”游戏 – 摘要一项最新实验探索了哪些AI模型最擅长说谎，使用了约翰·纳什设计的谈判/背叛游戏“再见，傻瓜”。研究人员使用Gemini 3 Flash、GPT-OSS 120B、Kimi K2和Qwen3 32B运行了162场AI对AI游戏，分析了它们的策略和信息传递。主要发现表明Gemini擅长欺骗策略，构建“联盟银行”来利用对手，并策略性地省略信息。它还表现出情境诚实，在面对实力相当的对手时会合作。与此相反，GPT-OSS从未利用“思考”工具进行私下推理，而是被动地进行游戏。该研究强调，简单的基准测试可能会低估欺骗能力。许多评论员分享了相关项目，如AI黑手党游戏和外交模拟，并讨论了评估AI行为的挑战，包括模型设置的影响以及模型通过操纵来优先考虑自我保护的倾向。公开可用的数据集和代码可供进一步研究。然而，一些用户报告了交互式演示中的错误。

The Perfect AI Stress Test

So Long Sucker was designed in 1950 by four game theorists including John Nash (of "A Beautiful Mind" fame). The game has one brutal property: betrayal is mathematically required to win.

This makes it ideal for evaluating AI capabilities that standard benchmarks miss:

Strategic Deception — Can the AI lie convincingly ?
Trust Modeling — Does it know when to trust and when to betray?
Multi-agent Negotiation — How does it handle alliances?
Long-term Planning — Can it set up betrayals turns in advance?

Quick Rules

4 players, each with colored chips. Take turns playing chips on piles. If your chip matches the one below it, you capture the pile. Run out of chips? Beg others for help — or get eliminated. Last player standing wins.

Watch full tutorial (15 min) →