业余爱好者利用ChatGPT解决了一个埃尔德什问题。

业余爱好者利用ChatGPT解决了一个埃尔德什问题。
Amateur armed with ChatGPT solves an Erdős problem

原始链接: https://www.scientificamerican.com/article/amateur-armed-with-chatgpt-vibe-maths-a-60-year-old-problem/

## 人工智能解决长期存在的数学难题一位23岁的业余数学家利安·普莱斯最近使用OpenAI的ChatGPT Pro解决了保罗·埃尔德什提出的一个60年未解的问题——这项成就让专业数学家们困惑了几十年。这个问题涉及“原始数集”，其中一个数不能被另一个数整除，以及埃尔德什关于这些集合的最低可能“得分”的猜想。使这个解决方案与众不同的是*如何*实现的。与之前在埃尔德什问题上取得的AI成功不同，ChatGPT似乎采用了完全新颖的方法，以一种意想不到的方式使用了已知公式。加州大学洛杉矶分校的特伦斯·陶等专家认为，人工智能绕过了阻碍人类数学家的“思维障碍”。虽然最初的输出需要专家进行完善，但核心见解已经显示出在数论领域更广泛应用的前景，可能揭示大数之间的新的联系。这一突破表明这些问题可能比以前想象的更简单，并验证了人们对它们相互关联性的现有直觉，为数学探索提供了新的视角。

一位业余数学家成功利用ChatGPT解决了著名数学家保罗·埃尔德什提出的一个长期存在的问题。该人工智能并未依赖传统方法或互联网搜索，而是以一种新颖的方式应用了相关数学领域中的已知公式。 Hacker News上的评论员强调这表明了人工智能整合知识和提供新视角的潜力，即使它在复杂计算方面存在困难。成功归功于人工智能“天真”的方法，摆脱了传统思维的束缚。讨论还围绕着使用人工智能进行此类问题解决的成本——一些人质疑如果计算成本过高，其价值是否值得——以及其他人使用类似工具的失败尝试的普遍性。最终，这一事件引发了关于智能本质以及人工智能在数学发现中潜力的大讨论。

原文

April 24, 2026

4 min read

Add Us On GoogleAdd SciAm

An amateur just solved a 60-year-old math problem—by asking AI

A ChatGPT AI has proved a conjecture with a method no human had thought of. Experts believe it may have further uses

By Joseph Howlett edited by Lee Billings

An orange cube resembling a puzzle, suspended in space against a lavender background. — Eugene Mymrin/Getty Images

Liam Price just cracked a 60-year-old problem that world-class mathematicians have tried and failed to solve. He’s 23 years old and has no advanced mathematics training. What he does have is a ChatGPT Pro subscription, which gives him access to the latest large language models from OpenAI.

Artificial intelligence has recently made headlines for solving a number of “Erd&odblac;s problems,” conjectures left behind by the prolific mathematician Paul Erd&odblac;s. But experts have warned that these problems are an imperfect benchmark of artificial intelligence’s mathematical prowess. They range dramatically in both significance and difficulty, and many AI solutions have turned out to be less original than they appeared.

The new solution—which Price got in response to a single prompt to GPT-5.4 Pro and posted on www.erdosproblems.com, a website devoted to the Erd&odblac;s problems, just over a week ago—is different. The problem it solves has eluded some prominent minds, bestowing it some esteem. And more importantly, the AI seems to have used a totally new method for problems of this kind. It’s too soon to say with certainty, but this LLM-conceived connection may be useful for broader applications—something hard to find among recently touted AI triumphs in math.

On supporting science journalism

If you're enjoying this article, consider supporting our award-winning journalism by subscribing. By purchasing a subscription you are helping to ensure the future of impactful stories about the discoveries and ideas shaping our world today.

“This one is a bit different because people did look at it, and the humans that looked at it just collectively made a slight wrong turn at move one,” says Terence Tao, a mathematician at the University of California, Los Angeles, who has become a prominent scorekeeper for AI’s push into his field. “What’s beginning to emerge is that the problem was maybe easier than expected, and it was like there was some kind of mental block.”

The question Price solved—or prompted ChatGPT to solve—concerns special sets of whole numbers, where no number in the set can be evenly divided by any other. Erd&odblac;s called these “primitive sets” because of their connection to similarly indivisible prime numbers.

“A number is prime if it has no other divisors, and this is kind of generalizing that definition from an individual number to a collection of numbers,” says Jared Lichtman, a mathematician at Stanford University. Any set of prime numbers is automatically primitive, because primes have no factors (except themselves and the number one).

Erd&odblac;s also came up with the Erd&odblac;s sum, a “score” you can calculate for any primitive set. He showed that the biggest the sum could be was about 1.6—and conjectured that this value must also hold for the (infinite) set of all prime numbers. Lichtman proved Erd&odblac;s right as part of his doctoral thesis in 2022.

Erd&odblac;s also noticed that the score drops if all of a set’s numbers are large—the larger the numbers, the lower the score. He guessed that the lowest this score could be was exactly one, a limit that the score would approach as the set’s numbers approached infinity. Lichtman tried to prove this, too, but got stuck like everyone else before him.

Price wasn’t aware of this history when he entered the problem into ChatGPT on an idle Monday afternoon. “I didn’t know what the problem was—I was just doing Erd&odblac;s problems as I do sometimes, giving them to the AI and seeing what it can come up with,” he says. “And it came up with what looked like a right solution.”

He sent it to his occasional collaborator Kevin Barreto, a second-year undergraduate in mathematics at the University of Cambridge. The duo had jump-started the AI-for-Erd&odblac;s craze late last year by prompting a free version of ChatGPT with open problems chosen at random from the Erd&odblac;s problems website. (An AI researcher subsequently gifted them each a ChatGPT Pro subscription to encourage their “vibe mathing.”)

Reviewing Price’s message, Barreto realized what they had was special, and experts whom he notified quickly took notice.

“There was kind of a standard sequence of moves that everyone who worked on the problem previously started by doing,” Tao says. The LLM took an entirely different route, using a formula that was well known in related parts of math, but which no one had thought to apply to this type of question.

“The raw output of ChatGPT’s proof was actually quite poor. So it required an expert to kind of sift through and actually understand what it was trying to say,” Lichtman says. But now he and Tao have shortened the proof so that it better distills the LLM’s key insight.

More importantly, they already see other potential applications of the AI’s cognitive leap. “We have discovered a new way to think about large numbers and their anatomy,” Tao says. “It’s a nice achievement. I think the jury is still out on the long-term significance.”

Lichtman is hopeful because ChatGPT’s discovery validates a sense he’s had since graduate school. “I had the intuition that these problems were kind of clustered together and they had some kind of unifying feel to them,” he says. “And this new method is really confirming that intuition.”

It’s Time to Stand Up for Science

If you enjoyed this article, I’d like to ask for your support. Scientific American has served as an advocate for science and industry for 180 years, and right now may be the most critical moment in that two-century history.

I’ve been a Scientific American subscriber since I was 12 years old, and it helped shape the way I look at the world. SciAm always educates and delights me, and inspires a sense of awe for our vast, beautiful universe. I hope it does that for you, too.

If you subscribe to Scientific American, you help ensure that our coverage is centered on meaningful research and discovery; that we have the resources to report on the decisions that threaten labs across the U.S.; and that we support both budding and working scientists at a time when the value of science itself too often goes unrecognized.

In return, you get essential news, captivating podcasts, brilliant infographics, can't-miss newsletters, must-watch videos, challenging games, and the science world's best writing and reporting. You can even gift someone a subscription.

There has never been a more important time for us to stand up and show why science matters. I hope you’ll support us in that mission.