This event has passed.

Optimization for ML and AI Seminar: Self-play Algorithms for Math Theorem Proving

Name: Optimization for ML and AI Seminar: Self-play Algorithms for Math Theorem Proving
Start: 2026-05-08T11:00:00-07:00
End: 2026-05-08T12:00:00-07:00
Location: HDSI 123 and Virtual

May 8 @ 11:00 - 12:00 PDT

Tengyu Ma, Stanford University

Abstract: I will discuss RL algorithms for automated theorem proving with LLMs, especially in the possible future regime where we run out of high-quality training data. To keep improving the models with limited data, we draw inspiration from mathematicians, who continuously develop new results, partly by proposing novel conjectures or exercises and attempting to solve them. We design the Self-play Theorem Prover (STP) that simultaneously takes on two roles, conjecturer and prover, each providing training signals to the other. At the end of the talk, I will mention a recent paper on extending the algorithm to include another role, Guide, which helps guide the conjecturer to generate clean and relevant conjectures, and a few other related works in using AI for math.

Tengyu Ma is an assistant professor of computer science at Stanford University. His research interests broadly include topics in machine learning, algorithms and their theory, such as deep learning, (deep) reinforcement learning, pre-training / foundation models, robustness, non-convex optimization, distributed optimization, and high-dimensional statistics.

Details

Date: May 8
Time:
11:00 - 12:00 PDT
Event Categories: TILOS Seminar Series, TILOS Sponsored Event

Organizers

Halicioglu Data Science Institute
TILOS

Venue

HDSI 123 and Virtual
3234 Matthews Ln
La Jolla, CA 92093 United States + Google Map
View Venue Website