The Future of AI Maths May Be Deeply Weird

4 months ago 16

In fiction there is a popular image of a mathematician as a lone solitary figure, a figure of lost in their own world in a world of intense concentration before a chalkboard dense with arcane symbols. That may have had some truth in the past but the reality is that mathematics these days is a far more social phenomenon and future mathematicians may have a lot more in common with software engineers toiling away on pull requests on Github than they do with the archetype of a Andrew Wiles-like genius toiling away in isolation for a decade to prove Fermat's Last Theorem.

For those that don't follow the automated theorem proving ecosystem (Lean4, Coq and their predecessors Isabelle, Agda, HOL4, etc) are tools that were once confined to the esoteric frontiers of computer science research, are poised to reshape the very practice and philosophy of mathematics. The future of the discipline may be far stranger, and more collaborative, than we can currently imagine. And with the simultaneous advent of language models, may be much more exciting and rapid than we can currently imagine.

Lean is a modern theorem prover and programming language developed by Microsoft Research that allows mathematicians to write formal, machine-checkable proofs. Unlike traditional mathematical proofs written in natural language, Lean proofs must satisfy a rigorous type system where every step is verified by the computer, eliminating the possibility of logical errors or gaps in reasoning. The language combines dependent type theory with a powerful tactic system, enabling users to construct proofs interactively while the system ensures mathematical correctness at every stage.

Below is a formalization of Euclid's theorem on the infinitude of primes. The proof follows Euclid's constructive approach: given any natural number n, it constructs a prime p ≥ n by considering the smallest prime factor of n! + 1. The Lean proof uses several tactics and lemmas: minFac finds the minimal prime factor, factorial_pos establishes that factorials are positive, minFac_prime proves the minimal factor is indeed prime, and dvd_factorial handles divisibility properties. The crucial insight, formalized through Nat.dvd_add_iff_right, shows that if a prime divides both n! and n! + 1, it must divide their difference (which is 1), leading to a contradiction since primes cannot divide 1. This is the proof that every undergraduate mathematics student learns in their first year.

/-- Euclid's theorem on the **infinitude of primes**. Here given in the form: for every `n`, there exists a prime number `p ≥ n`. -/ theorem exists_infinite_primes (n : ℕ) : ∃ p, n ≤ p ∧ Prime p := let p := minFac (n ! + 1) have f1 : n ! + 1 ≠ 1 := ne_of_gt <| succ_lt_succ <| factorial_pos _ have pp : Prime p := minFac_prime f1 have np : n ≤ p := le_of_not_ge fun h => have h₁ : p ∣ n ! := dvd_factorial (minFac_pos _) h have h₂ : p ∣ 1 := (Nat.dvd_add_iff_right h₁).2 (minFac_dvd _) pp.not_dvd_one h₂ ⟨p, np, pp⟩

The formalization of the standard undergraduate mathematics curriculum in a format like this is no longer a distant dream. It is an achievable, near-term goal. Need only have a look at the standard library Mathlib and the amount of progress on the standard set of theorems is now incredibly vast and expressive. Another good example of this progress is the formalization of Terence Tao's textbook, Analysis I which has been meticulously translated into an explorable, crosslinked and machine-checkable set of theorems. A future in which is every textbook has a digitial equivelent and any theorem or lemma can be "double clicked" to expand its context and definitions is hardly science fiction anymore. It's essentially here.

If you want to experiment with Lean, you can get started very quickly with the following steps:

Install Visual Studio Code.
Install Lean + Elan:

curl https://elan.lean-lang.org/elan-init.sh -sSf | sh elan default leanprover/lean4:stable

Install the Lean 4 VS Code Extension: Open VS Code → Extensions → search for Lean 4 (author: leanprover.lean4)
Initialize a new Lean project:

lake init my_project cd my_project lake build

The Lean ecosystem also has a burgeoning flora of open source tools. The Mathlib library is a massive, collaborative repository of formalized mathematics, growing daily. To navigate this vast digital library, a suite of search engines has emerged. Moogle provides semantic search, allowing one to find theorems based on conceptual meaning rather than exact phrasing. Loogle specializes in finding expressions that match a specific logical structure, and LeanSearch offers natural language queries. For those wishing to experiment, the Lean 4 Playground provides an interactive environment to write and check proofs directly in a web browser.

Consider the educational landscape this might enable. Imagine a future where every first-year mathematics student downloads a single, 200-megabyte file containing the entire verified foundation of their subject. They would be guided by an LLM-assisted tutor, a system capable of planning lessons, verbalizing complex proofs, and providing instant, theorem prover checkable feedback on their own attempts at proofs. This is not science fiction; the technology for curriculum planning and natural language explanation is largely within our grasp today. Of course, this vision must be tempered with realism. Science is intrinsically a social phenomenon. A great deal of mathematical research is about developing taste, an aesthetic for what constitutes an elegant proof, and the intuition for which problems are worth pursuing. These are intangible qualities of the craft, born from culture and collaboration, which a machine may never fully capture.

As we move from established knowledge to the open frontiers of research, these tools present new and exciting possibilities. While the body of formalized proofs is growing, there remains a relative scarcity of formalized open conjectures. A project like the Deepmind's index of formal conjectures is valuable for several reasons. First, it provides a clear and challenging set of benchmarks for the development of AI-assisted automated theorem provers. Second, the very act of formalization forces mathematicians to clarify the precise, unambiguous meaning of a conjecture. It is almost certain that we will soon see a powerful, neurosymbolic system, a kind of next generation AlphaGeometry, unleashed upon this list in an attempt to systematically push the boundaries of machine-generated mathematics.

The artifacts produced by such research will themselves be objects of profound study. Picture a future where one can download the twenty-gigabyte weights of a neural network that has been trained through reinforcement learning to mastery over the entirety of undergraduate mathematics, using a theorem prover feedback as its reward function. The existence of such a model would provoke fascinating theoretical and philosophical questions. Is there a smaller, more elegant model that can achieve the same result? By examining its failure modes, what can we learn about the complexity hurdles in mathematics? How would it handle alternative axiom sets, and could it ever develop an "understanding" of Gödel's incompleteness, intuiting when a problem is fundamentally independent of the given axioms? There are many deep meta-mathematical questions we could ask if we had an existence proof of a giant inscrutable matrix that "itself" can do maths. There is no guarantee such a thing can exist (and the usual Gödel, Turing and Hilbert analyses still apply), but if it can, that opens up a whole new world of new questions.

Stretching our imagination further, we can envision a far future where this technology reaches its zenith. A dedicated data center could be tasked with a grand challenge, like the Riemann hypothesis. After weeks of processing, it might return a positive result: a complete, machine-checkable proof. Yet, the proof's internal structure could be so byzantine, so alien in its logic, that no human could ever hope to grasp its intuitive thrust, much like the computer-assisted proof of the Four Color Theorem or the controversy surrounding the Mochizuki proof of the ABC conjecture. This would usher in a disquieting new reality, a world where machines, not unlike the hyper-intelligent Minds from Iain M. Banks' Culture series, venture into intellectual territories forever beyond human comprehension. We would "know" (epistomigical concerns aside) and accept certain truths, but as humans we would never intuit or understand why they are true. And that is a very different world in which science happens.

We're not there yet. Need only watch Terence Tao's recent YouTube streams with Claude and o4 on Lean theorem proving reveal the stark limitations of our current systems when faced with even moderately challenging mathematical problems. The models struggle with basic proof strategies, make elementary logical errors, and often produce syntactically correct but mathematically meaningless Lean code. Yet watching these experiments unfold, it doesn't require a tremendous leap of imagination to envision how rapidly this landscape might change. The combination of increasingly sophisticated language models, improved integration with theorem provers, and better training methodologies specifically designed for formal mathematical reasoning suggests that today's clumsy first attempts may soon give way to genuinely more capable models.

This prospect may seem to diminish the human element of mathematics, but perhaps that fear is unfounded. Computers have decisively solved the games of chess and Go. Stockfish and AlphaGo can defeat any human player on Earth with ease. Yet, chess has never been more popular. These powerful systems have not killed the game; they have become indispensable tools for training, analysis, and appreciation, making the game more accessible to more people than ever before. It is possible the same will hold true for mathematics. As these strange and powerful new tools proliferate, they may not render the human mathematician obsolete. Instead, they might democratize access to mathematical reasoning, allowing more people to participate in the oldest and most abstract of sciences. Perhaps the frontiers of knowledge will widen at an incredible pace, but the path to that frontier will be more accessible for everyone. And maybe that is a very bright future indeed.

Read Entire Article

The Future of AI Maths May Be Deeply Weird

Related

10 Years of NitroKey – A Decade of Digital Security

Brazil Proposes a New Type of Fund to Protect Tropical Fores...

Disassembling Terabytes of Random Data with Zig and Capstone...