
TILOS Seminar: The Dissimilarity Dimension: Sharper Bounds for Optimistic Algorithms
HDSI 123 and Virtual 3234 Matthews Ln, La Jolla, CA, United StatesAldo Pacchiano, Assistant Professor, Boston University Center for Computing and Data Sciences Abstract: The principle of Optimism in the Face of Uncertainty (OFU) is one of the foundational algorithmic design choices in Reinforcement Learning and Bandits. Optimistic algorithms balance exploration and exploitation by deploying data collection strategies that maximize expected rewards in plausible models. This […]