Thompson Sampling

Thompson Sampling is a Bayesian approach to multi-armed bandits. This notebook reviews the theory walks through my implementation and some experiments. The experiments should give you some good understanding of the behaviour of Thompson Sampling in comparison to epsilon-greedy and UCB. To run the notebook online, click this link and open with Colab.

For a more extensive review of the theory, checkout A Tutorial on Thompson Sampling by Russo et al., 2017.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.gitignore		.gitignore
README.md		README.md
thompson.ipynb		thompson.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Thompson Sampling

About

Releases

Packages

Languages

andrecianflone/thompson

Folders and files

Latest commit

History

Repository files navigation

Thompson Sampling

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages