Regret Analysis of Stochastic and Nonstochastic Multi-Armed Bandit Problems

Regret Analysis of Stochastic and Nonstochastic Multi-Armed Bandit Problems
Author :
Publisher :
Total Pages : 137
Release :
ISBN-10 : 1601986270
ISBN-13 : 9781601986276
Rating : 4/5 (276 Downloads)

Book Synopsis Regret Analysis of Stochastic and Nonstochastic Multi-Armed Bandit Problems by : Sébastien Bubeck

Download or read book Regret Analysis of Stochastic and Nonstochastic Multi-Armed Bandit Problems written by Sébastien Bubeck and published by . This book was released on 2012 with total page 137 pages. Available in PDF, EPUB and Kindle. Book excerpt: Multi-armed bandit problems are the most basic examples of sequential decision problems with an exploration-exploitation trade-off. This is the balance between staying with the option that gave highest payoffs in the past and exploring new options that might give higher payoffs in the future. In this monograph, the focus is on two extreme cases in which the analysis of regret is particularly simple and elegant: independent and identically distributed payoffs and adversarial payoffs. Besides the basic setting of finitely many actions, it also analyzes some of the most important variants and extensions, such as the contextual bandit model.


Regret Analysis of Stochastic and Nonstochastic Multi-Armed Bandit Problems Related Books