MRAS Bandits

Project Proposal and Report

Links:

TODO

Fix MRAS-Categorical-Subset reproducibility
Tune MRAS-Dirchlet-Subset

Install and Run

Use python 3.6. pip install -r requirements.txt

To run:

export PYTHONPATH=.
python sim.py

Benchmark Algorithms

UCB
Thomson Sampling
Aysm-UCB
KL-UCB (Needs to be sped up)

Simulation Experiments

We try the following parameter distributions:

Categorical
Dirchlet
Gaussian

We also experiment with the following:

Increasing function H.
Exploitation param lambda.
Simulation allocation M_k.
Population size N_o

Adding a new algorithm

Add under bandits folder.
Add a unit test to verify its working.
Import it under sim.py

Working with configs

We use sacred for configs and capturing arguments. Read more here. Modify base-config.yaml for more games.

If you need very different args, create a new config file and run as: python sim.py with configs/new-config.yaml

Critical Checks

Is regret being computed correctly? Right now we are accumulating (best_mean - reward). This could be negative, but averaged over experiments is positive.
Is the UCB implementation correct?

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
bandits		bandits
configs		configs
utils		utils
.gitignore		.gitignore
README.md		README.md
proposal.pdf		proposal.pdf
report.pdf		report.pdf
requirements.txt		requirements.txt
sim.py		sim.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bandits

bandits

configs

configs

utils

utils

.gitignore

.gitignore

README.md

README.md

proposal.pdf

proposal.pdf

report.pdf

report.pdf

requirements.txt

requirements.txt

sim.py

sim.py

Repository files navigation

MRAS Bandits

Project Proposal and Report

TODO

Install and Run

Benchmark Algorithms

Simulation Experiments

Adding a new algorithm

Working with configs

Critical Checks

About

Releases

Packages

Contributors 2

Languages

varun19299/CS6046-MRAS-Bandits

Folders and files

Latest commit

History

Repository files navigation

MRAS Bandits

Project Proposal and Report

TODO

Install and Run

Benchmark Algorithms

Simulation Experiments

Adding a new algorithm

Working with configs

Critical Checks

About

Resources

Stars

Watchers

Forks

Languages