https://123dok.net/document/yevnx5o0-finite-time-bounds-sampling-based-fitted-value-iteration.html