Skip to content

Implementation of the Upper confidence bounds and Thompson sampling algorithms in R for the multi armed bandit problem

Notifications You must be signed in to change notification settings

Eric-Su-2718/Reinforcement-learning-methods-for-the-multi-armed-bandit-problem

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Reinforcement learning methods for the multi armed bandit problem

The .Rmd file contains R code for demonstration of two reinforcement learning methods to solve the multi-armed bandit problem. The .url file shows how the RMarkdown file looks like after being knitted. Click here to view the project.

Methods used include:

  • Upper Confidence Bound (UCB)
  • Thompson sampling using conjugate priors
  • Thompson sampling using Markov chain Monte Carlo (MCMC)

About

Implementation of the Upper confidence bounds and Thompson sampling algorithms in R for the multi armed bandit problem

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages