In October, OpenAI claimed a model had cracked ten Erdős problems. It had only rediscovered known answers, and the mathematician who runs the Erdős problems site called the post a dramatic misrepresentation. On May 20, a general-purpose reasoning model disproved a conjecture open since 1946, and this time Fields Medalist Tim Gowers and the same skeptics signed off.

Thomas Bloom spends part of his life keeping score for a dead genius. He runs erdosproblems.com, the running ledger of questions the Hungarian mathematician Paul Erdős left unsolved when he died in 1996. So when OpenAI declared a mathematical triumph last October, Bloom was exactly the wrong person to bluff.

He called the company's claim "a dramatic misrepresentation."

This week, Bloom's name is on a paper vouching for OpenAI.

On May 20, OpenAI said one of its general-purpose reasoning models had produced an original proof disproving the planar unit distance conjecture, a question Erdős first posed in 1946. The company called it "the first time AI has autonomously solved a prominent open problem central to a field of mathematics." The difference between this announcement and the last one is not the confidence. It is that the mathematicians who humiliated OpenAI seven months ago checked the work first.

The October Fiasco Set the Bar for Doubt

To understand why this week matters, rewind to the embarrassment.

In October 2025, OpenAI's then vice president Kevin Weil posted on X that GPT-5 had "found solutions to 10 (!) previously unsolved Erdős problems and made progress on 11 others." It was the kind of line that travels fast. It was also wrong.

GPT-5 had not solved anything. It had located solutions that already existed in the mathematical literature and presented them as fresh. Bloom, whose website was the source being misread, said so bluntly. Yann LeCun and Google DeepMind chief Demis Hassabis piled on. Weil deleted the post.

That history is the reason OpenAI's May announcement arrived wrapped in external review rather than a victory lap. The company published companion remarks from working mathematicians alongside the result, instead of asking the public to take its word.

The Problem Is Simple to State and Brutal to Solve

The unit distance problem sounds like a children's puzzle. Place dots on a flat sheet of paper. Count how many pairs of dots sit exactly one unit apart. The question: as you add more points, how fast can that count of unit-distance pairs grow?

For nearly 80 years, the consensus answer pointed at the square grid. Arrange points in a tidy lattice and you get a predictable number of one-unit pairs. Erdős proposed that the count could only grow slightly faster than the number of points itself, never dramatically more. Generations of mathematicians tried and failed to settle whether anything could beat the grid.

OpenAI's model found something that does. According to the company, it discovered an infinite family of point arrangements that produce significantly more unit-distance pairs than the classic grid, breaking the picture mathematicians had carried for decades.

What unsettled experts was the route the model took. Instead of leaning on the usual tricks of combinatorial geometry, it reached into algebraic number theory, the study of exotic number systems that extend the ordinary integers. The proof used machinery rarely seen near this kind of geometry problem, including infinite class field towers and Golod-Shafarevich theory. In plain terms, the model used hidden symmetries buried inside strange number systems to manufacture far more one-unit distances than anyone expected was possible.

Princeton mathematician Will Sawin later refined the result, pinning the improvement to a fixed exponent rather than a vanishing one. The model found the door. A human helped measure how far it opened.

The Verification Chain Is the Real News

A bold claim from an AI lab is cheap. The names attached to this one are not.

The proof went through external review, and the reviewers produced a companion paper explaining the argument and why it matters. The roster reads like a who's who of the field:

Tim Gowers, the Fields Medal winner, who called the achievement "a milestone in AI mathematics."
Noga Alon and Melanie Wood, both leading figures in combinatorics and number theory.
Arul Shankar, a number theorist who said the work shows AI systems can move past assisting mathematicians and start generating genuinely original ideas.
Thomas Bloom, the same researcher who called OpenAI's October post a dramatic misrepresentation.

When the people who exposed your last mistake agree to put their reputations behind your next claim, that is the verification that counts. The October failure, painful as it was, is part of why this result is credible: the skeptics were already watching, and this time they did not flinch.

OpenAI's framing is that the model held together a long, difficult chain of reasoning and connected ideas across fields in ways researchers had not explored. That is the capability practitioners care about, far more than the geometry itself. A model that can sustain a multi-step argument and import tools from an unrelated branch of math is a model that might do the same in biology, physics, or engineering. For a working sense of why long, structured reasoning is the frontier worth watching, see our explainer on reasoning models and how AI learned to think step by step.

What the Proof Does Not Prove

The result is genuine. The hype around it should still be handled with tongs.

This is one problem. Disproving a single conjecture, however famous, is not the same as a machine independently building new mathematical theory at scale. The model found a construction; a human mathematician, Will Sawin, sharpened it into its cleaner final form. The line between autonomous discovery and very fast assistance is blurrier than a press release suggests.

There is also the matter of what "general-purpose" earns OpenAI. The company stresses the proof came from a reasoning model not built for math and not pointed at this problem in particular. That is the impressive part. It is also the unverifiable part, since outside researchers cannot audit how the model was prompted, how many attempts it took, or how much scaffolding sat around it. The mathematics has been checked. The process has not.

Skeptics like LeCun have spent years arguing that today's models pattern-match rather than reason. A clean, reviewed proof of an 80-year-old problem is a real data point against that view. It is not the end of the argument, and it sits alongside benchmarks that still expose how much these systems do not know, like the brutal evaluation we covered in Humanity's Last Exam.

The Bottom Line

Seven months ago, OpenAI mistook a library lookup for a discovery and got caught by the man who keeps the records. This week, that same man helped certify that one of OpenAI's models did something humans had not managed in 80 years: it broke the unit distance conjecture, and it did so by dragging deep number theory into a geometry problem nobody thought needed it.

The lesson for engineers is not that math is solved. It is that the gap between "the model sounds confident" and "the model is correct" now has referees, and on this problem the referees said yes. Whether the same model can do this on demand, on the next problem, without a Princeton professor to tidy up, is the question that actually decides how much this changes the work.

Bloom, for his part, sounded less like a skeptic than a kid who just found a trapdoor. "AI is helping us to more fully explore the cathedral of mathematics we have built over the centuries," he said. "What other unseen wonders are waiting in the wings?"

Sources

An OpenAI model has disproved a central conjecture in discrete geometry — OpenAI, May 20, 2026
OpenAI claims it solved an 80-year-old math problem — for real this time — TechCrunch, May 20, 2026
80-year-old geometry mystery cracked by OpenAI using deep number theory — Interesting Engineering, May 20, 2026
Remarks on the disproof of the unit distance conjecture (companion paper) — arXiv, May 2026
OpenAI's embarrassing math (October claim) — TechCrunch, October 19, 2025
OpenAI Says It Solved an Erdős Puzzle, This Time For Real — Technology.org, May 21, 2026

Practice interview problems based on real data

1,500+ SQL & Python problems across 15 industry datasets — the exact type of data you work with.

Try 250 free problems

Free Career Roadmaps8 PATHS

Step-by-step roadmaps from zero to job-ready — curated courses, salary data, and the exact learning order that gets you hired.

Explore all career paths

Recommended Reading

Curated articles related to this topic

News

9 min

Trump Spent Weeks Drafting an AI Order. Thursday, He Scrapped It at the Last Minute.

President Trump scrapped the signing of a landmark AI executive order on May 21, 2026, telling reporters he did not want to blunt America's lead over China. The order, in development for weeks after Anthropic's Mythos model autonomously found thousands of cyber vulnerabilities, would have created a voluntary framework for labs to share frontier models with the government 90 days before release. Even that compromise proved too much for an administration torn between China competition and AI safety.

May 21, 2026

News

7 min

Google's Flash Model Just Beat Its Own Flagship. The Real Target Is Your Agents.

Google launched Gemini 3.5 Flash at I/O 2026, a fast, cheap model that outperforms its flagship Gemini 3.1 Pro on nearly every benchmark. The release, paired with Antigravity 2.0 and Managed Agents in the Gemini API, marks Google's strategic shift from conversational AI to autonomous agents.

May 20, 2026

News

6 min

A Jury Took Less Than Two Hours to End Elon Musk's War on OpenAI

A nine-person jury in Oakland dismissed Elon Musk's lawsuit against Sam Altman, OpenAI, and Microsoft in under two hours, ruling his claims fell outside the three-year statute of limitations. The court never addressed the merits. Musk plans to appeal, but the verdict clears the largest legal threat to OpenAI's for-profit conversion.

May 20, 2026

News

11 min

The npm Worm That Just Poisoned 317 More Packages in 20 Minutes

A self-replicating npm worm called Mini Shai-Hulud, run by the threat actor group TeamPCP, hit a new wave on May 19, 2026, hijacking the atool maintainer account and pushing 637 malicious versions across 317 packages. The same worm previously breached TanStack, Mistral AI, Guardrails AI, and two OpenAI corporate devices on May 11.

May 19, 2026

News

10 min

Andrej Karpathy Joined Anthropic. His First Job: Use Claude to Build the Next Claude.

Andrej Karpathy, one of OpenAI's original 11 founders and former Tesla AI director, joined Anthropic on May 19, 2026, paused his startup Eureka Labs, and was given the mandate to spin up a team that uses Claude itself to accelerate pre-training research on the next Claude.

May 19, 2026

News

7 min

A Year Ago, 9% of Businesses Paid for Anthropic. In April It Passed OpenAI.

For the first time, more businesses are paying for Anthropic than for OpenAI, according to Ramp's May 2026 AI Index. Anthropic quadrupled its business adoption over twelve months, but the economist who published the data laid out three headwinds that could erase the lead.

May 14, 2026

News

7 min

Amazon Killed Rufus. Its Replacement Will Check Out for You.

Amazon retired Rufus, its 2024 shopping chatbot, and folded it into Alexa for Shopping, an agentic assistant that schedules recurring orders, sets price-triggered purchases, and completes checkouts on third-party retailers' websites. It rolls out to every U.S. shopper within a week, making it one of the largest consumer deployments of agentic AI to date.

May 14, 2026

News

8 min

Demis Hassabis Just Raised $2.1 Billion to Turn AlphaFold Into a Drug Company

Alphabet-owned Isomorphic Labs raised 2.1 billion dollars in Series B funding from Thrive Capital, Alphabet, MGX, Temasek, and the UK Sovereign AI Fund. The round brings the AlphaFold spinout's total capital base above 2.6 billion dollars and funds the first internal pipeline of drugs designed end-to-end by its IsoDDE AI engine, expected to enter human trials in 2026.

May 13, 2026

News

9 min

Anthropic Is Buying the Company That Builds OpenAI's and Google's SDKs

Anthropic is reportedly in advanced talks to buy Stainless, the AI-powered SDK generator used by OpenAI, Google, Meta, and others, for at least 300 million dollars. The acquisition would continue an Anthropic streak that includes Bun, Vercept, and Coefficient Bio, and would hand Anthropic ownership of a quiet but critical layer of the AI developer stack used by every major competitor.

May 13, 2026

News

12 min

Google Found the First AI-Built Zero-Day in the Wild. The Exploit Code Was Still Hallucinating CVSS Scores.

Google's Threat Intelligence Group has documented the first real-world case of criminals using an AI model to discover and weaponize a zero-day vulnerability. The exploit targeted a two-factor authentication bypass in a popular open-source admin tool and was caught before the mass-exploitation campaign could launch.

May 12, 2026