7 videos found 00:56:00 Transformers Learn Generalizable Chain-of-Thought Reasoning via Gradient Descent 3views Foundations,Machine Learning,Optimization for ML & AI Seminar Series 01:00:42 (De)regularized Wasserstein Gradient Flows via Reproducing Kernels 2views Foundations,Machine Learning,Optimization for ML & AI Seminar Series 00:55:51 Extended Convex Lifting for Policy Optimization in Control 2views Optimization for ML & AI Seminar Series 00:58:24 Randomized Linear Algebra with Subspace Injections 4views Foundations,Optimization for ML & AI Seminar Series 00:55:54 Stochastic-Gradient and Diagonal-Scaling Algorithms for Constrained Optimization and Learning 7views Foundations,Machine Learning,Optimization for ML & AI Seminar Series 01:00:09 Training Neural Networks at Any Scale 1views Machine Learning,Optimization for ML & AI Seminar Series 00:54:16 High-dimensional Optimization with Applications to Compute-Optimal Neural Scaling Laws 9views Foundations,Machine Learning,Optimization for ML & AI Seminar Series,TILOS Seminar Series