stochastic-multi-armed-bandits

Here are 2 public repositories matching this topic...

manome / python-mab

This project provides a simulation of multi-armed bandit problems. This implementation is based on the below paper. https://arxiv.org/abs/2308.14350.

reinforcement-learning multi-armed-bandits bandits stochastic-bandit-algorithms stochastic-multi-armed-bandits survival-multi-armed-bandits

Updated Nov 8, 2024
Python

alex26626 / Stochastic-Multi-Armed-Bandits

Star

This project consists in an empirical comparative analysis of the following stochastic multi-armed bandits algorithms: Epsilon-Greedy, SoftMax, UCB-1, UCB-V, UCB-KL, UCB-MOSS, Bayes-UCB, and Thompson Sampling. Their performance is evaluated under different metrics after multiple simulations. A peer-to-peer situation is also analysed.

stochastic-multi-armed-bandits

Updated Feb 14, 2024
Jupyter Notebook

Improve this page

Add a description, image, and links to the stochastic-multi-armed-bandits topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the stochastic-multi-armed-bandits topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

stochastic-multi-armed-bandits

Here are 2 public repositories matching this topic...

manome / python-mab

alex26626 / Stochastic-Multi-Armed-Bandits

Improve this page

Add this topic to your repo