https://1library.net/document/q747npnq-iterative-amortized-policy-optimization.html