Bandit Algorithms for Website Optimization
Developing, Deploying, and Debugging
Publisher: O'Reilly Media
Release Date: December 2012
Pages: 88
Read on Safari with a 10-day trial
Start your free trial now Buy on AmazonWhere’s the cart? Now you can get everything on Safari. To purchase books, visit Amazon or your favorite retailer. Questions? See our FAQ or contact customer service:
1-800-889-8969 / 707-827-7019
support@oreilly.com
When looking for ways to improve your website, how do you decide which changes to make? And which changes to keep? This concise book shows you how to use Multiarmed Bandit algorithms to measure the real-world value of any modifications you make to your site. Author John Myles White shows you how this powerful class of algorithms can help you boost website traffic, convert visitors to customers, and increase many other measures of success.
This is the first developer-focused book on bandit algorithms, which were previously described only in research papers. You’ll quickly learn the benefits of several simple algorithms—including the epsilon-Greedy, Softmax, and Upper Confidence Bound (UCB) algorithms—by working through code examples written in Python, which you can easily adapt for deployment on your own website.
- Learn the basics of A/B testing—and recognize when it’s better to use bandit algorithms
- Develop a unit testing framework for debugging bandit algorithms
- Get additional code examples written in Julia, Ruby, and JavaScript with supplemental online materials
Table of Contents
-
Chapter 1 Two Characters: Exploration and Exploitation
-
The Scientist and the Businessman
-
The Explore-Exploit Dilemma
-
-
Chapter 2 Why Use Multiarmed Bandit Algorithms?
-
What Are We Trying to Do?
-
The Business Scientist: Web-Scale A/B Testing
-
-
Chapter 3 The epsilon-Greedy Algorithm
-
Introducing the epsilon-Greedy Algorithm
-
Describing Our Logo-Choosing Problem Abstractly
-
Implementing the epsilon-Greedy Algorithm
-
Thinking Critically about the epsilon-Greedy Algorithm
-
-
Chapter 4 Debugging Bandit Algorithms
-
Monte Carlo Simulations Are Like Unit Tests for Bandit Algorithms
-
Simulating the Arms of a Bandit Problem
-
Analyzing Results from a Monte Carlo Study
-
Exercises
-
-
Chapter 5 The Softmax Algorithm
-
Introducing the Softmax Algorithm
-
Implementing the Softmax Algorithm
-
Measuring the Performance of the Softmax Algorithm
-
The Annealing Softmax Algorithm
-
Exercises
-
-
Chapter 6 UCB – The Upper Confidence Bound Algorithm
-
Introducing the UCB Algorithm
-
Implementing UCB
-
Comparing Bandit Algorithms Side-by-Side
-
Exercises
-
-
Chapter 7 Bandits in the Real World: Complexity and Complications
-
A/A Testing
-
Running Concurrent Experiments
-
Continuous Experimentation vs. Periodic Testing
-
Bad Metrics of Success
-
Scaling Problems with Good Metrics of Success
-
Intelligent Initialization of Values
-
Running Better Simulations
-
Moving Worlds
-
Correlated Bandits
-
Contextual Bandits
-
Implementing Bandit Algorithms at Scale
-
-
Chapter 8 Conclusion
-
Learning Life Lessons from Bandit Algorithms
-
A Taxonomy of Bandit Algorithms
-
Learning More and Other Topics
-