https://123dok.net/document/qo5l427y-messi-maximum-entropy-semi-supervised-inverse-reinforcement-learning.html