Optimization for AI and ML Seminar: Training Neural Networks at Any Scale
HDSI 123 and Virtual 3234 Matthews Ln, La Jolla, CA, United StatesVolkan Cevher, École Polytechnique Fédérale de Lausanne Abstract: At the heart of deep learning’s transformative impact lies the concept of scale--encompassing both data and computational resources, as well as their interaction with neural network architectures. Scale, however, presents critical challenges, such as increased instability during training and prohibitively expensive model-specific tuning. Given the substantial resources […]