Transformer Reinforcement Learning
Leandro von Werra, GitHub : https://github.com/lvwerra/trl
#ProximalPolicyOptimization #Transformer #ReinforcementLearning
Leandro von Werra, GitHub : https://github.com/lvwerra/trl
#ProximalPolicyOptimization #Transformer #ReinforcementLearning
GitHub
GitHub - huggingface/trl: Train transformer language models with reinforcement learning.
Train transformer language models with reinforcement learning. - huggingface/trl