• Optimization for ML and AI Seminar: Self-play Algorithms for Math Theorem Proving

    HDSI 123 and Virtual 3234 Matthews Ln, La Jolla, CA, United States

    Tengyu Ma, Stanford University Abstract: I will discuss RL algorithms for automated theorem proving with LLMs, especially in the possible future regime where we run out of high-quality training data. To keep improving the models with limited data, we draw inspiration from mathematicians, who continuously develop new results, partly by proposing novel conjectures or exercises […]

  • TILOS-HDSI Seminar: ComPO: Preference Alignment via Comparison Oracles

    HDSI 123 and Virtual 3234 Matthews Ln, La Jolla, CA, United States

    Tianyi Lin, Columbia University Direct alignment methods are increasingly used for aligning large language models (LLMs) with human preferences. However, these methods suffer from the likelihood displacement, which can be driven by noisy preference pairs that induce similar likelihood for preferred and dis-preferred responses. To address this issue, we consider doing derivative-free optimization based on […]

  • Optimization for ML and AI Seminar: A non-equilibrium phase transition with broken ergodicity leads to double descent and benign overfitting in machine learning

    HDSI 123 and Virtual 3234 Matthews Ln, La Jolla, CA, United States

    Nigel Goldenfeld, UC San Diego Department of Physics and HDSI Abstract: The remarkable ability of modern neural networks to generalize improves with increasing network capacity, even when the number of model parameters or effective degrees of freedom exceeds the number of training data points. This phenomenon is all the more surprising given that generalization error […]

  • TILOS-HDSI Seminar: Inference-Time Algorithms: A Theoretical Lens on Tractability and Error Propagation

    HDSI 123 and Virtual 3234 Matthews Ln, La Jolla, CA, United States

    Andrej Risteski, Carnegie Mellon University Abstract: Modern AI systems are increasingly built by placing trained models inside larger computational loops. Inference-time algorithms are a basic instance of this idea: they use one or more trained models at test time to incorporate new information, exploit pretrained models as priors, and trade computational effort for accuracy, sample […]