https://123dok.net/document/q0575k2v-analysis-of-classification-based-policy-iteration-algorithms.html