Parallel Programming Project

Playing Taiko by parallel reinforcement learning

The participants

0856071 謝秉瑾
0856108 謝宗祐
0856160 洪鈺恆

Introduction

In this project we will parallelly train multiple reinforcement learning agents to play taiko.

Statement of the problem

Most reinforcement learning environment uses OpenAi atari gym environment. Because OpenAi gym had already parallelized well for each environment, all we have to do is parallelly collect data from each environment. However, in the real case, we have to parallelly control each environment and prevent data race and so on. It is more complex than using OpenAi atari gym environment.

Proposed approaches

Language selection

python threadind
tensorflow

Related work

use deep q-learning playing atari policy proximal optimization

Statement of expected results

We can parallelly train and play taiko.

A timetable

10/29~11/13 Do research on taiko web and reinforcement methods.
11/13~11/20 Choose which reinforcement method to use and implement single thread version for this project.
11/20~11/27 Find which part of code can be parallelized.
11/27~12/10 Implement parallel training of taiko.
12/10~ Train agents.

References

https://github.com/bui/taiko-web https://github.com/openai/baselines

Requirements

pip3 install -r requirements.txt

TO DO

offline GAME
single thread running(4500 episodes, converge time => 15:31~17:32)
multi-thread(3500 episodes, converge time => 14:29~15:25)
play on taiko-web
Final report

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
UnitySDK		UnitySDK
multi_thread_Model		multi_thread_Model
single_thread_Model		single_thread_Model
test.app/Contents		test.app/Contents
.gitignore		.gitignore
README.md		README.md
ppo.py		ppo.py
ppo_singleThread.py		ppo_singleThread.py
rec.py		rec.py
requirements.txt		requirements.txt
testauto.py		testauto.py
testppo.py		testppo.py
testppo2.py		testppo2.py
winEnv.zip		winEnv.zip

Bing-Jing/PPFinal

Folders and files

Latest commit

History

Repository files navigation

Parallel Programming Project

Playing Taiko by parallel reinforcement learning

The participants

Introduction

Statement of the problem

Proposed approaches

Language selection

Related work

Statement of expected results

A timetable

References

Requirements

TO DO

About

Resources

Stars

Watchers

Forks

Languages