https://mysteriousl2019.github.io/posts/LLM-RLHF-inference/
LLM Reasoning Models comparison - Li Fangzheng