Prediction, learning, and games Nicolò Cesa-Bianchi and Gábor Lugosi Cambridge University Press, 2006
Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems S. Bubeck and N. Cesa-Bianchi, . In Foundations and Trends in Machine Learning, Vol 5: No 1, 1-122, 2012.
Approachability, Regret and Calibration: Implications and equivalences. V. Perchet, Journal of Dynamics and Games, 1:181-254, 2014
Lattimore, T., & Szepesvári, C. (2020). Bandit algorithms. Cambridge University Press.