Deep Q Network

Deep Q Network

Implementing Mnih et al. 2015

Alexandre Beaulne ID 260234941
Amina Madzhun ID 260844950
Apr 26 2018

Introduction

"Human-level control through deep reinforcement learning", Mnih et al., Nature 2015
Get RL agent to play Atari games with no knowledge beside screen action and rewards

Motivation

Learn about applying RL to more complex problems from an engineering perspective
Learn how deep learning and RL tie up in practice

Results

Before training:	After training:

Results

Before training	After training

Results

Conclusion

Deep RL is hard!
Debugging is difficult
Convergence of learning very sensitive to hyperparameters (notably ε)
Even when good hyperparameters are found, training takes a lot of time

Future Work

Learn what to look for when training: how to debug and measure progress
How to craft more sensible action space
Use ConvNet with more capacity
Use RNN (e.g. LSTM) instead of stacked frames to learn causation