DeepMind proof agent solves 9 Erdős problems
A Google DeepMind paper reports an AI-driven formal proof system that autonomously solved 9 of 353 open Erdős problems and 44 of 492 OEIS conjectures, with inference costs of a few hundred dollars per problem. The result is less about raw headline score and more about showing that LLMs plus Lean can produce machine-checkable math research at meaningful scale.
The important shift here is not “AI beats mathematicians,” but “verification turns probabilistic reasoning into something that can actually survive contact with formal math.”
- –Solving 9 of 353 open Erdős problems is still a small hit rate, but every win is formally checked, which is a much higher bar than most benchmark math claims
- –The paper says a basic LLM-plus-Lean loop replicated the Erdős results, which suggests the search/verification scaffold matters as much as model intelligence
- –44 solved OEIS conjectures broadens the signal beyond one math niche and points to a reusable theorem-search pattern
- –The cost profile matters: a few hundred dollars per problem is the kind of number that makes this usable as a research assistant, not just a demo
- –For developers, the takeaway is broader than math: any domain with a strong validator could benefit from the same agent loop
DISCOVERED
4h ago
2026-05-26
PUBLISHED
1d ago
2026-05-24
RELEVANCE
AUTHOR
Independent-Wind4462