Reinforcement learning methods for the multi armed bandit problem

The .Rmd file contains R code for demonstration of two reinforcement learning methods to solve the multi-armed bandit problem. The .url file shows how the RMarkdown file looks like after being knitted. Click here to view the project.

Methods used include:

Upper Confidence Bound (UCB)
Thompson sampling using conjugate priors
Thompson sampling using Markov chain Monte Carlo (MCMC)

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.gitignore		.gitignore
README.md		README.md
RL_bandit.Rmd		RL_bandit.Rmd
RL_bandit.url		RL_bandit.url

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reinforcement learning methods for the multi armed bandit problem

About

Releases

Packages

Eric-Su-2718/Reinforcement-learning-methods-for-the-multi-armed-bandit-problem

Folders and files

Latest commit

History

Repository files navigation

Reinforcement learning methods for the multi armed bandit problem

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages