By NSF TILOS6 March 2025

Tutorial on AI Alignment (part 2 of 2): Methodologies for AI Alignment

Ahmad Beirami, Google DeepMind
Hamed Hassani, University of Pennsylvania

The second part of the tutorial focuses on AI alignment techniques and is structured as three segments: In the first segment, we examine black-box techniques aimed at aligning models towards various goals (e.g., safety), such as controlled decoding and the best-of-N algorithm. In the second segment, we will also consider efficiency, where we examine information-theoretic techniques designed to improve inference latency, such as model compression or speculative decoding. If time permits, in the final segment, we discuss inference-aware alignment, which is a framework to align models to work better with inference-time compute algorithms.

5views

Workshops and Tutorials

You may also like

TILOS HOT-AI Workshop: Flat Minima and Generalization with Maryam Fazel (University of Washington)

TILOS HOT-AI Workshop: Flat Minima and Generalization with Maryam Fazel (University of Washington)

9views

Machine Learning,

Workshops and Tutorials

TILOS HOT-AI Workshop: Hunting the Hessian with Madeleine Udell (Stanford University)

TILOS HOT-AI Workshop: Hunting the Hessian with Madeleine Udell (Stanford University)

8views

Machine Learning,

Workshops and Tutorials

TILOS HOT-AI Workshop: From Test-Time Tweaks to Global Guarantees with Mahdi Soltanolkotabi (USC)

TILOS HOT-AI Workshop: From Test-Time Tweaks to Global Guarantees with Mahdi Soltanolkotabi (USC)

3views

Machine Learning,

Workshops and Tutorials

TILOS HOT-AI Workshop: The Wisdom of the Body Revisited with Benjamin Recht (UC Berkeley)

TILOS HOT-AI Workshop: The Wisdom of the Body Revisited with Benjamin Recht (UC Berkeley)

7views

Workshops and Tutorials

TILOS HOT-AI Workshop: Accelerating Nonconvex Optimization via Online Learning with Aryan Mokhtari (UT Austin)

TILOS HOT-AI Workshop: Accelerating Nonconvex Optimization via Online Learning with Aryan Mokhtari (UT Austin)

7views

Workshops and Tutorials

TILOS HOT-AI Workshop: The Binary Iterative Hard Thresholding Algorithm with Arya Mazumdar (TILOS & UC San Diego)

TILOS HOT-AI Workshop: The Binary Iterative Hard Thresholding Algorithm with Arya Mazumdar (TILOS & UC San Diego)

3views

Workshops and Tutorials

TILOS HOT-AI Workshop: Reverse diffusion Monte Carlo with Yian Ma (TILOS & UC San Diego)

TILOS HOT-AI Workshop: Reverse diffusion Monte Carlo with Yian Ma (TILOS & UC San Diego)

8views

Workshops and Tutorials

TILOS HOT-AI Workshop: Linear Bregman Divergence Control with Babak Hassibi (Caltech)

TILOS HOT-AI Workshop: Linear Bregman Divergence Control with Babak Hassibi (Caltech)

8views

Workshops and Tutorials

TILOS HOT-AI Workshop: Unleashing the Power of Variance Reduction for Training Large Models with Quanquan Gu (UCLA)

TILOS HOT-AI Workshop: Unleashing the Power of Variance Reduction for Training Large Models with Quanquan Gu (UCLA)

3views

Machine Learning,

Workshops and Tutorials

TILOS HOT-AI Workshop: Optimization and Reasoning with Sean Gao (TILOS & UC San Diego)

TILOS HOT-AI Workshop: Optimization and Reasoning with Sean Gao (TILOS & UC San Diego)

6views

Workshops and Tutorials

Page 1 of 2

Leave A Reply Cancel reply