https://bnet339.com/blog/q-learning-algorithm/