https://5dok.net/document/oz1mpnez-stopping-policy-iteration-algorithm-average-markov-decision-processes.html