udacity/deep-reinforcement-learning
Repo for the Deep Reinforcement Learning Nanodegree program
Language: Jupyter Notebook
#cross_entropy #ddpg #deep_reinforcement_learning #dqn #dynamic_programming #hill_climbing #ml_agents #neural_networks #openai_gym #openai_gym_solutions #ppo #pytorch #pytorch_rl #reinforcement_learning #reinforcement_learning_algorithms #rl_algorithms
Stars: 160 Issues: 2 Forks: 36
https://github.com/udacity/deep-reinforcement-learning
  
  Repo for the Deep Reinforcement Learning Nanodegree program
Language: Jupyter Notebook
#cross_entropy #ddpg #deep_reinforcement_learning #dqn #dynamic_programming #hill_climbing #ml_agents #neural_networks #openai_gym #openai_gym_solutions #ppo #pytorch #pytorch_rl #reinforcement_learning #reinforcement_learning_algorithms #rl_algorithms
Stars: 160 Issues: 2 Forks: 36
https://github.com/udacity/deep-reinforcement-learning
GitHub
  
  GitHub - udacity/deep-reinforcement-learning: Repo for the Deep Reinforcement Learning Nanodegree program
  Repo for the Deep Reinforcement Learning Nanodegree program - udacity/deep-reinforcement-learning
  astorfi/Deep-Learning-World
:satellite: Organized Resources for Deep Learning Researchers and Developers
Language: Python
#deep_learning #reinforcement_learning
Stars: 586 Issues: 0 Forks: 44
https://github.com/astorfi/Deep-Learning-World
  
  :satellite: Organized Resources for Deep Learning Researchers and Developers
Language: Python
#deep_learning #reinforcement_learning
Stars: 586 Issues: 0 Forks: 44
https://github.com/astorfi/Deep-Learning-World
GitHub
  
  GitHub - astorfi/Deep-Learning-Roadmap: :satellite: Organized Resources for Deep Learning Researchers and Developers
  :satellite: Organized Resources for Deep Learning Researchers and Developers - astorfi/Deep-Learning-Roadmap
  andri27-ts/60_Days_RL_Challenge
Learn Deep Reinforcement Learning in depth in 60 days
#artificial_intelligence #challenge #machine_learning #reinforcement_learning
Stars: 720 Issues: 0 Forks: 38
https://github.com/andri27-ts/60_Days_RL_Challenge
  
  Learn Deep Reinforcement Learning in depth in 60 days
#artificial_intelligence #challenge #machine_learning #reinforcement_learning
Stars: 720 Issues: 0 Forks: 38
https://github.com/andri27-ts/60_Days_RL_Challenge
GitHub
  
  GitHub - andri27-ts/Reinforcement-Learning: Learn Deep Reinforcement Learning in 60 days! Lectures & Code in Python. Reinforcement…
  Learn Deep Reinforcement Learning in 60 days! Lectures & Code in Python. Reinforcement Learning + Deep Learning - GitHub - andri27-ts/Reinforcement-Learning: Learn Deep Reinforcement Learni...
  danaugrs/huskarl
Parallel Deep Reinforcement Learning Framework
Language: Python
#algorithms #artificial_intelligence #deep_learning #python #reinforcement_learning #tensorflow
Stars: 84 Issues: 0 Forks: 5
https://github.com/danaugrs/huskarl
  
  Parallel Deep Reinforcement Learning Framework
Language: Python
#algorithms #artificial_intelligence #deep_learning #python #reinforcement_learning #tensorflow
Stars: 84 Issues: 0 Forks: 5
https://github.com/danaugrs/huskarl
GitHub
  
  GitHub - danaugrs/huskarl: Deep Reinforcement Learning Framework + Algorithms
  Deep Reinforcement Learning Framework + Algorithms - danaugrs/huskarl
  vietnguyen91/Super-mario-bros-A3C-pytorch
Asynchronous Advantage Actor-Critic (A3C) algorithm for Super Mario Bros
Language: Python
#a3c #deep_learning #gym #python #pytorch #reinforcement_learning
Stars: 152 Issues: 0 Forks: 30
https://github.com/vietnguyen91/Super-mario-bros-A3C-pytorch
  
  Asynchronous Advantage Actor-Critic (A3C) algorithm for Super Mario Bros
Language: Python
#a3c #deep_learning #gym #python #pytorch #reinforcement_learning
Stars: 152 Issues: 0 Forks: 30
https://github.com/vietnguyen91/Super-mario-bros-A3C-pytorch
GitHub
  
  uvipen/Super-mario-bros-A3C-pytorch
  Asynchronous Advantage Actor-Critic (A3C) algorithm for Super Mario Bros - uvipen/Super-mario-bros-A3C-pytorch
  uvipen/Tetris-deep-Q-learning-pytorch
Deep Q-learning for playing tetris game
Language: Python
#cv2 #deep_q_learning #deep_q_network #pytorch #reinforcement_learning
Stars: 202 Issues: 2 Forks: 31
https://github.com/uvipen/Tetris-deep-Q-learning-pytorch
  
  Deep Q-learning for playing tetris game
Language: Python
#cv2 #deep_q_learning #deep_q_network #pytorch #reinforcement_learning
Stars: 202 Issues: 2 Forks: 31
https://github.com/uvipen/Tetris-deep-Q-learning-pytorch
GitHub
  
  GitHub - uvipen/Tetris-deep-Q-learning-pytorch: Deep Q-learning for playing tetris game
  Deep Q-learning for playing tetris game. Contribute to uvipen/Tetris-deep-Q-learning-pytorch development by creating an account on GitHub.
  denisyarats/drq
DrQ: Data regularized Q
Language: Python
#actor_critic #control #deep_learning #deep_reinforcement_learning #dm_control #drq #gym #mujoco #pixel #python #pytorch #reinforcement_learning #rl #sac #soft_actor_crit
Stars: 122 Issues: 0 Forks: 8
https://github.com/denisyarats/drq
  
  DrQ: Data regularized Q
Language: Python
#actor_critic #control #deep_learning #deep_reinforcement_learning #dm_control #drq #gym #mujoco #pixel #python #pytorch #reinforcement_learning #rl #sac #soft_actor_crit
Stars: 122 Issues: 0 Forks: 8
https://github.com/denisyarats/drq
GitHub
  
  GitHub - denisyarats/drq: DrQ: Data regularized Q
  DrQ: Data regularized Q. Contribute to denisyarats/drq development by creating an account on GitHub.
  ugurkanates/awesome-real-world-rl
Great resources for making Reinforcement Learning work in Real Life situations. Papers,projects and more.
#awesome_list #gans #imitation_learning #meta_learning #reinforcement_learning #robotics #sim2real #simulation
Stars: 107 Issues: 0 Forks: 9
https://github.com/ugurkanates/awesome-real-world-rl
  
  Great resources for making Reinforcement Learning work in Real Life situations. Papers,projects and more.
#awesome_list #gans #imitation_learning #meta_learning #reinforcement_learning #robotics #sim2real #simulation
Stars: 107 Issues: 0 Forks: 9
https://github.com/ugurkanates/awesome-real-world-rl
GitHub
  
  GitHub - ugurkanates/awesome-real-world-rl: Great resources for making Reinforcement Learning work in Real Life situations. Papers…
  Great resources for making Reinforcement Learning work in Real Life situations. Papers,projects and more.  - GitHub - ugurkanates/awesome-real-world-rl: Great resources for making Reinforcement Lea...
  cool-RR/grid_royale
A life simulation for exploring social dynamics
Language: Python
#ai #hacktoberfest #keras #machine_learning #python #q_learning #reinforcement_learning
Stars: 164 Issues: 11 Forks: 18
https://github.com/cool-RR/grid_royale
  
  A life simulation for exploring social dynamics
Language: Python
#ai #hacktoberfest #keras #machine_learning #python #q_learning #reinforcement_learning
Stars: 164 Issues: 11 Forks: 18
https://github.com/cool-RR/grid_royale
GitHub
  
  GitHub - cool-RR/marley: A framework for multi-agent reinforcement learning.
  A framework for multi-agent reinforcement learning. - GitHub - cool-RR/marley: A framework for multi-agent reinforcement learning.
  huggingface/deep-rl-class
This repo contain the syllabus of the Hugging Face Deep Reinforcement Learning Class.
#deep_reinforcement_learning #reinforcement_learning #reinforcement_learning_excercises
Stars: 307 Issues: 1 Forks: 16
https://github.com/huggingface/deep-rl-class
  
  This repo contain the syllabus of the Hugging Face Deep Reinforcement Learning Class.
#deep_reinforcement_learning #reinforcement_learning #reinforcement_learning_excercises
Stars: 307 Issues: 1 Forks: 16
https://github.com/huggingface/deep-rl-class
GitHub
  
  GitHub - huggingface/deep-rl-class: This repo contains the Hugging Face Deep Reinforcement Learning Course.
  This repo contains the Hugging Face Deep Reinforcement Learning Course. - huggingface/deep-rl-class
❤3
  zubair-irshad/Awesome-Implicit-NeRF-Robotics
A comprehensive list of Implicit Representations and NeRF papers relating to Robotics/RL domain, including papers, codes, and related websites
#computer_vision #dynamics #implicit_representations #manipulation #navigation #nerf #planning #pose_estimation #reinforcement_learning #robotics #slam
Stars: 177 Issues: 1 Forks: 5
https://github.com/zubair-irshad/Awesome-Implicit-NeRF-Robotics
  
  A comprehensive list of Implicit Representations and NeRF papers relating to Robotics/RL domain, including papers, codes, and related websites
#computer_vision #dynamics #implicit_representations #manipulation #navigation #nerf #planning #pose_estimation #reinforcement_learning #robotics #slam
Stars: 177 Issues: 1 Forks: 5
https://github.com/zubair-irshad/Awesome-Implicit-NeRF-Robotics
GitHub
  
  GitHub - zubair-irshad/Awesome-Implicit-NeRF-Robotics: A comprehensive list of Implicit Representations and NeRF papers relating…
  A comprehensive list of Implicit Representations and NeRF papers relating to Robotics/RL domain, including papers, codes, and related websites - zubair-irshad/Awesome-Implicit-NeRF-Robotics
  PKU-Alignment/safe-rlhf
Safe-RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
Language: Python
#ai_safety #alpaca #datasets #deepspeed #large_language_models #llama #llm #llms #reinforcement_learning #reinforcement_learning_from_human_feedback #rlhf #safe_reinforcement_learning #safe_reinforcement_learning_from_human_feedback #safe_rlhf #safety #transformers #vicuna
Stars: 279 Issues: 0 Forks: 14
https://github.com/PKU-Alignment/safe-rlhf
  
  Safe-RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
Language: Python
#ai_safety #alpaca #datasets #deepspeed #large_language_models #llama #llm #llms #reinforcement_learning #reinforcement_learning_from_human_feedback #rlhf #safe_reinforcement_learning #safe_reinforcement_learning_from_human_feedback #safe_rlhf #safety #transformers #vicuna
Stars: 279 Issues: 0 Forks: 14
https://github.com/PKU-Alignment/safe-rlhf
GitHub
  
  GitHub - PKU-Alignment/safe-rlhf: Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
  Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback - PKU-Alignment/safe-rlhf
👍2👏1
  mihirp1998/AlignProp
AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more sample and compute efficient than reinforcement learning methods (PPO) for finetuning Stable Diffusion
Language: Python
#alignment #diffusion_models #reinforcement_learning #stable_diffusion #text_to_image
Stars: 104 Issues: 4 Forks: 1
https://github.com/mihirp1998/AlignProp
  
  AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more sample and compute efficient than reinforcement learning methods (PPO) for finetuning Stable Diffusion
Language: Python
#alignment #diffusion_models #reinforcement_learning #stable_diffusion #text_to_image
Stars: 104 Issues: 4 Forks: 1
https://github.com/mihirp1998/AlignProp
GitHub
  
  GitHub - mihirp1998/AlignProp: AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion…
  AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more sample and compute efficient than reinforcement learning methods...
👍2
  AgibotTech/agibot_x1_train
The reinforcement learning training code for AgiBot X1.
Language: Python
#open_source #reinforcement_learning #robotics
Stars: 763 Issues: 2 Forks: 235
https://github.com/AgibotTech/agibot_x1_train
  
  The reinforcement learning training code for AgiBot X1.
Language: Python
#open_source #reinforcement_learning #robotics
Stars: 763 Issues: 2 Forks: 235
https://github.com/AgibotTech/agibot_x1_train
GitHub
  
  GitHub - AgibotTech/agibot_x1_train: The reinforcement learning training code for AgiBot X1.
  The reinforcement learning training code for AgiBot X1. - AgibotTech/agibot_x1_train
  Gen-Verse/ReasonFlux
ReasonFlux-32B beats o1-preview and DeepSeek-V3 with only 500 thought templates
Language: Python
#chain_of_thought #deepseek_r1 #deepseek_v3 #llm_rlhf #o1_mini #o1_preview #reinforcement_learning #sft_data
Stars: 194 Issues: 2 Forks: 10
https://github.com/Gen-Verse/ReasonFlux
  
  ReasonFlux-32B beats o1-preview and DeepSeek-V3 with only 500 thought templates
Language: Python
#chain_of_thought #deepseek_r1 #deepseek_v3 #llm_rlhf #o1_mini #o1_preview #reinforcement_learning #sft_data
Stars: 194 Issues: 2 Forks: 10
https://github.com/Gen-Verse/ReasonFlux
GitHub
  
  GitHub - Gen-Verse/ReasonFlux: ReasonFlux Series - A family of LLM post-training algorithms focusing on data selection, reinforcement…
  ReasonFlux Series - A family of LLM post-training algorithms focusing on data selection, reinforcement learning, and inference scaling - Gen-Verse/ReasonFlux
👍1
  