Grant Sanderson (@3blue1brown) – AI and the future of math Summary — Dwarkesh Podcast

This episode explores whether AI’s rapid progress in mathematics—from IMO gold medal performance to potentially solving Millennium Prize problems—represents a path toward AGI, featuring Grant Sanderson discussing how mathematical breakthroughs differ from other forms of intelligence and what this means for the future of human mathematicians.

Solving IMO problems or Millennium Prize problems represents another benchmark rather than an “aha” moment of AGI, as AI capabilities improve generally without sudden qualitative shifts.
AI excels at geometry through brute force methods but struggles with combinatorics, which requires more creativity and playful problem-solving.
The distinction matters because combinatorics problems might represent the kind of creative thinking that remains challenging for AI, unlike the systematic approaches that work for geometry.
Even if AI solves the Riemann hypothesis through connecting random matrix theory and number theory, this differs qualitatively from white-collar work that requires broader contextual understanding.
The Montgomery-Dyson connection—where number theory zeros matched random matrix eigenvalue statistics—illustrates the kind of cross-field insight that AI might excel at without matching general human intelligence.

Mathematical breakthroughs like Galois theory require verification loops that can span centuries, as the value of new abstractions only becomes apparent through applications in cryptography, physics, and other fields.
Galois developed group theory while in prison, but his work wasn’t properly understood until Liouville and Jordan refined it decades later, showing how recognition of valuable ideas can be delayed.
Unlike immediate problem-solving, conceptual innovations need time to prove their worth through downstream applications, making them difficult to train for with current RLVR methods.
The difference between proof and explanation matters: Timothy Chow’s work on forcing shows that even proven results can lack intuitive understanding, highlighting the need for distillation.
Measuring conjecture-generating ability will likely depend on subjective “tone shifts” in how mathematicians discuss AI’s usefulness, rather than objective benchmark scores.

The Langlands program exemplifies mathematics as a “research ethos” focused on mapping connections between seemingly unrelated fields, rather than solving specific problems.
Major breakthroughs often involve connecting different domains—like general relativity linking Riemannian geometry and special relativity—which AI might excel at due to superhuman breadth across fields.
Current autoregressive methods struggle with unlikely connections because they’re optimized for predictable next-token prediction rather than creative leaps.
Systematic approaches to increasing entropy—running multiple agents with different biases or perspectives—might unlock more cross-field insights than simply scaling current architectures.
The advantage of digital minds includes parallelization: unlike individual human geniuses, AI can apply connection-making capabilities universally across all problems at a given capability level.

Math and coding progress quickly because they’re both verifiable and “grindable”—allowing massive parallel rollouts and clear credit assignment for successful approaches.
Real-world tasks like computer use lack grindability due to bot detection, changing environments, and difficulty containerizing problems for repeated experimentation.
Lean formalization provides process-based supervision but isn’t essential for current progress, as natural language verification with meta-verifiers can work effectively.
The potential exists for endless Mathlib extension where AI continuously generates proofs, conjectures, and theories without human intervention, creating an infinite tree of mathematical exploration.
This differs from natural language math because formal systems allow unconstrained exploration without the heuristics that limit human mathematical discourse.

Learning with LLMs works best when humans provide curated artifacts—books, videos, articles—that organize concepts with proper motivation and logical progression.
LLMs function like enhanced search engines, helping identify the right human-created resources rather than replacing them entirely.
Students considering math careers should understand where value comes from: prestige, teaching, grant funding, or entertainment—knowing this helps navigate AI-driven changes.
Teaching remains one of the most stable post-AGI careers because it’s deeply relational and social, extending beyond mere explanation to mentorship and motivation.
If AI generates novel mathematical fields, the role of mathematicians may shift toward curation—helping society navigate and apply these new insights rather than discovering them.

Writing requires theory of mind that AI lacks: understanding how readers process information, what associations they’ll make, and anticipating their mental models.
Good writing involves deliberate unpredictability—knowing when to make novel moves rather than just increasing randomness—which current autoregressive methods struggle with.
LLMs excel at explaining pre-existing ideas but cannot generate the original insights that make books worth reading in the first place.
The social function of writers and educators—providing motivation, curation, and relational engagement—remains irreplaceable even as technical capabilities advance.
Mathematics may face questions about practical relevance if AI progress accelerates pure math without corresponding applications, forcing the field to justify its value to society.