https://gopikrishtummala.github.io/posts/reinforcement-learning-intuition-to-algorithms/