https://www.pylessons.com/PPO-reinforcement-learning
New tutorial: