https://wagonhelm.github.io/posts/deep-policy-gradients-w-tensorflow/
Deep Policy Gradients w/ Tensorflow - Justin's Blog