micro.ai

Creating the world's first autonomous micro-robot!

Introduction

The goal of this project is to investigate different Reinforcement Learning (RL) architectures and techniques against scenarios that have relevance to the field of micro robotics.

This project is specifically concerned with the control of magnetic micro and nano-robots. These robots consist of magnetic material that are controlled by surrounding the workspace with energized coils of wire to generate a magnetic field that acts on the robot. This project makes the simplification of assuming that the robot only moves in 2 dimensions.

Scenarios

Sorting Task

A magnetic micro-robot must sort particles to the left and right sides of the workspace depending on each particle's class.

Solved with DDPG and Experience Replay.

Assembly Task

In progress.

Brains

Duelling DQN

Double Duelling DQN [1] features two estimators: the action advantage function and the satte value function.

DDPG

Deep Deterministic Policy Gradient (DDPG) [2] uses a model-free, actor-critic algorithm that can successfully learn control policies operating over a continuous action space.

Memories

Experience Replay

Experience Replay stores past agent experiences and randomly samples from them to perform network updates. First introduced in [3].

Prioritized Experience Replay

In progress. (improving efficiency)

A3C

In progress.

References:

[1] Wang, Ziyu, et al. "Dueling network architectures for deep reinforcement learning." arXiv preprint arXiv:1511.06581 (2015).

[2] Lillicrap, Timothy P., et al. "Continuous control with deep reinforcement learning." arXiv preprint arXiv:1509.02971 (2015).

[3] Lin, Long-Ji. Reinforcement learning for robots using neural networks. No. CMU-CS-93-103. Carnegie-Mellon Univ Pittsburgh PA School of Computer Science, 1993.

[4] Diller, Eric, and Metin Sitti. "Micro-scale mobile robotics." Foundations and Trends® in Robotics 2.3 (2013): 143-259.

Reinforcement Learning Resources:

https://medium.com/@awjuliani/simple-reinforcement-learning-with-tensorflow-part-4-deep-q-networks-and-beyond-8438a3e2b8df#.yit72xseu

http://cs.stanford.edu/people/karpathy/reinforcejs/waterworld.html

http://pemami4911.github.io/blog/2016/08/21/ddpg-rl.html

https://jaromiru.com/2016/10/03/lets-make-a-dqn-implementation/

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
brains		brains
checkpoints		checkpoints
custom_env		custom_env
media/images		media/images
memories		memories
.gitignore		.gitignore
README.md		README.md
agent.py		agent.py
run.py		run.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

brains

brains

checkpoints

checkpoints

custom_env

custom_env

media/images

media/images

memories

memories

.gitignore

.gitignore

README.md

README.md

agent.py

agent.py

run.py

run.py

Repository files navigation

micro.ai

Introduction

Scenarios

Sorting Task

Assembly Task

Brains

Duelling DQN

DDPG

Memories

Experience Replay

Prioritized Experience Replay

A3C

References:

Reinforcement Learning Resources:

About

Releases

Packages

Languages

jflafleche/micro.ai

Folders and files

Latest commit

History

Repository files navigation

micro.ai

Introduction

Scenarios

Sorting Task

Assembly Task

Brains

Duelling DQN

DDPG

Memories

Experience Replay

Prioritized Experience Replay

A3C

References:

Reinforcement Learning Resources:

About

Resources

Stars

Watchers

Forks

Languages