https://123dok.net/document/q5m331pg-reinforcement-learning-for-optimization-of-covid-mitigation-policies.html