https://123dok.net/document/zx568g4q-direct-policy-iteration-with-demonstrations.html