denisyarats/drq
DrQ: Data regularized Q
Language: Python
#actor_critic #control #deep_learning #deep_reinforcement_learning #dm_control #drq #gym #mujoco #pixel #python #pytorch #reinforcement_learning #rl #sac #soft_actor_crit
Stars: 122 Issues: 0 Forks: 8
https://github.com/denisyarats/drq
DrQ: Data regularized Q
Language: Python
#actor_critic #control #deep_learning #deep_reinforcement_learning #dm_control #drq #gym #mujoco #pixel #python #pytorch #reinforcement_learning #rl #sac #soft_actor_crit
Stars: 122 Issues: 0 Forks: 8
https://github.com/denisyarats/drq
GitHub
GitHub - denisyarats/drq: DrQ: Data regularized Q
DrQ: Data regularized Q. Contribute to denisyarats/drq development by creating an account on GitHub.
ugurkanates/awesome-real-world-rl
Great resources for making Reinforcement Learning work in Real Life situations. Papers,projects and more.
#awesome_list #gans #imitation_learning #meta_learning #reinforcement_learning #robotics #sim2real #simulation
Stars: 107 Issues: 0 Forks: 9
https://github.com/ugurkanates/awesome-real-world-rl
Great resources for making Reinforcement Learning work in Real Life situations. Papers,projects and more.
#awesome_list #gans #imitation_learning #meta_learning #reinforcement_learning #robotics #sim2real #simulation
Stars: 107 Issues: 0 Forks: 9
https://github.com/ugurkanates/awesome-real-world-rl
GitHub
GitHub - ugurkanates/awesome-real-world-rl: Great resources for making Reinforcement Learning work in Real Life situations. Papers…
Great resources for making Reinforcement Learning work in Real Life situations. Papers,projects and more. - GitHub - ugurkanates/awesome-real-world-rl: Great resources for making Reinforcement Lea...
cool-RR/grid_royale
A life simulation for exploring social dynamics
Language: Python
#ai #hacktoberfest #keras #machine_learning #python #q_learning #reinforcement_learning
Stars: 164 Issues: 11 Forks: 18
https://github.com/cool-RR/grid_royale
A life simulation for exploring social dynamics
Language: Python
#ai #hacktoberfest #keras #machine_learning #python #q_learning #reinforcement_learning
Stars: 164 Issues: 11 Forks: 18
https://github.com/cool-RR/grid_royale
GitHub
GitHub - cool-RR/marley: A framework for multi-agent reinforcement learning.
A framework for multi-agent reinforcement learning. - GitHub - cool-RR/marley: A framework for multi-agent reinforcement learning.
huggingface/deep-rl-class
This repo contain the syllabus of the Hugging Face Deep Reinforcement Learning Class.
#deep_reinforcement_learning #reinforcement_learning #reinforcement_learning_excercises
Stars: 307 Issues: 1 Forks: 16
https://github.com/huggingface/deep-rl-class
This repo contain the syllabus of the Hugging Face Deep Reinforcement Learning Class.
#deep_reinforcement_learning #reinforcement_learning #reinforcement_learning_excercises
Stars: 307 Issues: 1 Forks: 16
https://github.com/huggingface/deep-rl-class
GitHub
GitHub - huggingface/deep-rl-class: This repo contains the Hugging Face Deep Reinforcement Learning Course.
This repo contains the Hugging Face Deep Reinforcement Learning Course. - huggingface/deep-rl-class
zubair-irshad/Awesome-Implicit-NeRF-Robotics
A comprehensive list of Implicit Representations and NeRF papers relating to Robotics/RL domain, including papers, codes, and related websites
#computer_vision #dynamics #implicit_representations #manipulation #navigation #nerf #planning #pose_estimation #reinforcement_learning #robotics #slam
Stars: 177 Issues: 1 Forks: 5
https://github.com/zubair-irshad/Awesome-Implicit-NeRF-Robotics
A comprehensive list of Implicit Representations and NeRF papers relating to Robotics/RL domain, including papers, codes, and related websites
#computer_vision #dynamics #implicit_representations #manipulation #navigation #nerf #planning #pose_estimation #reinforcement_learning #robotics #slam
Stars: 177 Issues: 1 Forks: 5
https://github.com/zubair-irshad/Awesome-Implicit-NeRF-Robotics
GitHub
GitHub - zubair-irshad/Awesome-Implicit-NeRF-Robotics: A comprehensive list of Implicit Representations and NeRF papers relating…
A comprehensive list of Implicit Representations and NeRF papers relating to Robotics/RL domain, including papers, codes, and related websites - zubair-irshad/Awesome-Implicit-NeRF-Robotics
PKU-Alignment/safe-rlhf
Safe-RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
Language: Python
#ai_safety #alpaca #datasets #deepspeed #large_language_models #llama #llm #llms #reinforcement_learning #reinforcement_learning_from_human_feedback #rlhf #safe_reinforcement_learning #safe_reinforcement_learning_from_human_feedback #safe_rlhf #safety #transformers #vicuna
Stars: 279 Issues: 0 Forks: 14
https://github.com/PKU-Alignment/safe-rlhf
Safe-RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
Language: Python
#ai_safety #alpaca #datasets #deepspeed #large_language_models #llama #llm #llms #reinforcement_learning #reinforcement_learning_from_human_feedback #rlhf #safe_reinforcement_learning #safe_reinforcement_learning_from_human_feedback #safe_rlhf #safety #transformers #vicuna
Stars: 279 Issues: 0 Forks: 14
https://github.com/PKU-Alignment/safe-rlhf
GitHub
GitHub - PKU-Alignment/safe-rlhf: Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback - PKU-Alignment/safe-rlhf
mihirp1998/AlignProp
AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more sample and compute efficient than reinforcement learning methods (PPO) for finetuning Stable Diffusion
Language: Python
#alignment #diffusion_models #reinforcement_learning #stable_diffusion #text_to_image
Stars: 104 Issues: 4 Forks: 1
https://github.com/mihirp1998/AlignProp
AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more sample and compute efficient than reinforcement learning methods (PPO) for finetuning Stable Diffusion
Language: Python
#alignment #diffusion_models #reinforcement_learning #stable_diffusion #text_to_image
Stars: 104 Issues: 4 Forks: 1
https://github.com/mihirp1998/AlignProp
GitHub
GitHub - mihirp1998/AlignProp: AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion…
AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more sample and compute efficient than reinforcement learning methods...
AgibotTech/agibot_x1_train
The reinforcement learning training code for AgiBot X1.
Language: Python
#open_source #reinforcement_learning #robotics
Stars: 763 Issues: 2 Forks: 235
https://github.com/AgibotTech/agibot_x1_train
The reinforcement learning training code for AgiBot X1.
Language: Python
#open_source #reinforcement_learning #robotics
Stars: 763 Issues: 2 Forks: 235
https://github.com/AgibotTech/agibot_x1_train
GitHub
GitHub - AgibotTech/agibot_x1_train: The reinforcement learning training code for AgiBot X1.
The reinforcement learning training code for AgiBot X1. - AgibotTech/agibot_x1_train
Gen-Verse/ReasonFlux
ReasonFlux-32B beats o1-preview and DeepSeek-V3 with only 500 thought templates
Language: Python
#chain_of_thought #deepseek_r1 #deepseek_v3 #llm_rlhf #o1_mini #o1_preview #reinforcement_learning #sft_data
Stars: 194 Issues: 2 Forks: 10
https://github.com/Gen-Verse/ReasonFlux
ReasonFlux-32B beats o1-preview and DeepSeek-V3 with only 500 thought templates
Language: Python
#chain_of_thought #deepseek_r1 #deepseek_v3 #llm_rlhf #o1_mini #o1_preview #reinforcement_learning #sft_data
Stars: 194 Issues: 2 Forks: 10
https://github.com/Gen-Verse/ReasonFlux
GitHub
GitHub - Gen-Verse/ReasonFlux: ReasonFlux - Open-Sourced Strong Reasoning Model Series
ReasonFlux - Open-Sourced Strong Reasoning Model Series - Gen-Verse/ReasonFlux
FareedKhan-dev/all-rl-algorithms
Implementation of all RL algorithms in a simpler way
Language: Jupyter Notebook
#agent #llm #openai #python #reinforcement_learning #rl
Stars: 240 Issues: 0 Forks: 19
https://github.com/FareedKhan-dev/all-rl-algorithms
Implementation of all RL algorithms in a simpler way
Language: Jupyter Notebook
#agent #llm #openai #python #reinforcement_learning #rl
Stars: 240 Issues: 0 Forks: 19
https://github.com/FareedKhan-dev/all-rl-algorithms
GitHub
GitHub - FareedKhan-dev/all-rl-algorithms: Implementation of all RL algorithms in a simpler way
Implementation of all RL algorithms in a simpler way - FareedKhan-dev/all-rl-algorithms