8 videos found 00:56:52 A survey of the mixing times of the Proximal Sampler algorithm 5views Foundations,Optimization for ML & AI Seminar Series 00:56:00 Transformers Learn Generalizable Chain-of-Thought Reasoning via Gradient Descent 10views Foundations,Machine Learning,Optimization for ML & AI Seminar Series 01:00:42 (De)regularized Wasserstein Gradient Flows via Reproducing Kernels 8views Foundations,Machine Learning,Optimization for ML & AI Seminar Series 00:55:51 Extended Convex Lifting for Policy Optimization in Control 6views Optimization for ML & AI Seminar Series 00:58:24 Randomized Linear Algebra with Subspace Injections 11views Foundations,Optimization for ML & AI Seminar Series 00:55:54 Stochastic-Gradient and Diagonal-Scaling Algorithms for Constrained Optimization and Learning 14views Foundations,Machine Learning,Optimization for ML & AI Seminar Series 01:00:09 Training Neural Networks at Any Scale 10views Machine Learning,Optimization for ML & AI Seminar Series 00:54:16 High-dimensional Optimization with Applications to Compute-Optimal Neural Scaling Laws 17views Foundations,Machine Learning,Optimization for ML & AI Seminar Series,TILOS Seminar Series