Skip to content

shincling/Deep-Rein4cement

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Deep-Rein4cement

Deep Reinforcement Learning related works.

##深度强化学习导引: http://mp.weixin.qq.com/s?__biz=MzI1NTE4NTUwOQ==&mid=2650324914&idx=1&sn=0baaf404b3d8132243d08b55310de210&scene=2&srcid=062732p5u33RRNIKUeDSlvXN&from=timeline&isappinstalled=0#wechat_redirect

##详解深度强化学习,搭建DQN详细指南(基于Neon框架): https://mp.weixin.qq.com/s?__biz=MzA3MzI4MjgzMw==&mid=2650716425&idx=1&sn=bf52c653b7cd054ce721ce5be928c623

##《Multiagent Cooperation and Competition with Deep Reinforcement Learning》Ardi Tampuu, Tambet Matiisen 15年11月份,是在deepMind Q-learning的基础上的一个扩展 http://arxiv.org/abs/1511.08779

##深度强化学习导引: https://mp.weixin.qq.com/s?__biz=MzA3MzI4MjgzMw==&mid=2650716246&idx=2&sn=2c328097a95839871c8c91c5c5af9de5

##《Learning to Optimize》 强化学习的一个应用,把学习优化的过程加入了某些奖惩策略,利用强化学习的方式学习优化的方式,可参考 http://arxiv.org/abs/1606.01885 解读文章: http://weibo.com/ttarticle/p/show?id=2309403985644224393104

##《Deep Reinforcement Learning 深度增强学习资源 https://zhuanlan.zhihu.com/p/20885568

##《Dueling Network Architectures for Deep Reinforcement Learning》 Google DeepMind; University of Oxford; 15年11月 ,被引用10次以上 http://arxiv.org/abs/1511.06581

##Yoshua Bengio 最新论文:用于序列预测的actor-critic算法 http://t.cn/RtV9tL6 原文:http://arxiv.org/abs/1607.07086 摘要 提出了一种训练神经网络的方法以使用来自强化学习的 actor-critic 方法来生成序列。

##另外:ICML16强化学习相关论文24篇 http://weibo.com/p/1001603975123651678749

##更新:David Silver 的课程 http://www0.cs.ucl.ac.uk/staff/d.silver/web/Teaching.html 中文翻译:http://chenrudan.github.io/archives/

##Github的一个rl库推荐: https://github.com/matthiasplappert/keras-rl

Some Papers in NLP

##Language Understanding for Text-based Games Using Deep Reinforcement Learning http://arxiv.org/pdf/1506.08941

##Deep Reinforcement Learning with an Action Space Defined by Natural Language http://arxiv.org/abs/1511.04636

About

Deep Reinforcement Learning related works.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages