Resource Aggregator for Online Learning
A collection of resources on Online Learning, Multi-Armed Bandits, and related areas.
Books
Online Learning
-
Introduction to Online Convex Optimization by Elad Hazan
-
Introduction to Online Optimization by Sebastien Bubeck
-
A Modern Introduction to Online Learning by Francesco Orabona
-
Online Learning and Online Convex Optimization by Shai Shalev-Shwartz
-
Prediction, Learning, and Games by Nicolo Cesa-Bianchi and Gabor Lugosi
-
Statistical Learning and Sequential Prediction by Alexander Rakhlin and Karthik Sridharan
-
Online learning Lecture Notes by Gábr Bartók, Dávid Pál, Csaba Szepesvári and István Szita
Bandit
-
Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems by Sebastien Bubeck and Nicolo Cesa-Bianchi
-
Introduction to Multi-Armed Bandits by Aleksandrs Slivkins
-
Bandit Algorithms by Tor Lattimore and Csaba Szepesvári
-
Bandit Convex Optimisation by Tor Lattimore
Online Algorithms
-
Online Computation and Competitive Analysis by Allan Borodin and Ran El-Yaniv
-
The Design of Competitive Online Algorithms via a Primal-Dual Approach by Niv Buchbinder and Joseph (Seffi) Naor
-
An Introduction to Online Computation by Dennis Komm
-
Prophets and Secretaries by Anupam Gupta
Reinforcement Learning Theory
-
Reinforcement Learning: Theory and Algorithms by Alekh Agarwal, Nan Jiang, Sham M. Kakade and Wen Sun
-
Algorithms for Reinforcement Learning by Csaba Szepesvári
-
Statistical Reinforcement Learning and Decision Making: Course Notes by Dylan J. Foster and Alexander Rakhlin
-
Reinforcement Learning: Foundations by Shie Mannor, Yishay Mansour, and Aviv Tamar
Control Theory
- Online Nonstochastic Control by Elad Hazan and Karan Singh
Surveys
-
Online Learning Algorithms by Nicolo Cesa-Bianchi and Francesco Orabona
-
The Multiplicative Weights Update Method: a Meta-Algorithm and Applications by Sanjeev Arora, Elad Hazan, and Satyen Kale
-
Potential-Function Proofs for Gradient Methods by Nikhil Bansal and Anupam Gupta
-
Handbook of Convergence Theorems for (Stochastic) Gradient Methods by Guillaume Garrigos and Robert M. Gower
-
No-Regret Dynamics in the Fenchel Game: A Unified Framework for Algorithmic Convex Optimization by Jun-Kun Wang, Jacob Abernethy and Kfir Y. Levy
Blogs
-
Parameter-free Learning and Optimization Algorithms by Francesco Orabona
-
Bandit Algorithms by Tor Lattimore and Csaba Szepesvári
-
I’m a bandit by Sebastien Bubeck
-
StephenTu’s blog by Stephen Tu - great content on control theory related topics
-
Tim van Erven’s blog by Tim van Erven
-
Adversarial Intelligence by Wouter Koolen
Courses
- Haipeng Luo
- Akshay Krishnamurthy
- Spring 2022: COMS 6998-11: Bandits and Reinforcement Learning
- Fall 2017: Machine Learning Theory
- Kevin Jamieson
- Winter 2022: CSE 541 Interactive Machine Learning
- Spring 2021: CSE 599 Interactive Machine Learning in Non-stochastic Environments
- Winter 2021: CSE 599 Interactive Machine Learning in Stochastic Environments
- Winter 2020: CSE 599 Interactive Machine Learning
- Winter 2018: CSE 599 Online and Adaptive Methods for Machine Learning
- Chi Jin
- Spring 2022: ECE524: Foundations of Reinforcement Learning
- Spring 2023: ELE539/COS512: Optimization for Machine Learning
- Fall 2021: ECE434/COS434: Machine Learning Theory
Video Lectures
-
Nicolo Cesa Bianchi at Summer Graduate School on Mathematics of Machine Learning 2022
-
Kevin Jamieson at Summer Graduate School on Mathematics of Machine Learning 2019
-
Five Miracles of Mirror Descent by Sebastien Bubeck
-
Bandit Convex Optimization by Sebastien Bubeck
-
Bandit Algorithm (Online Machine Learning) by Prof. Manjesh Hanawal
Simons Programs
-
Interactive Learning within the Foundations of Machine Learning program
-
Optimization, Statistics and Uncertainty within the Bridging Continuous and Discrete Optimization program