GitHub repos – Telegram

GitHub repos

26K subscribers

18 photos

2 videos

11.4K links

Welcome to GitHub repos. Here you'll find valuable information on the latest trending projects. Subscribe to stay informed and gain insights from the thriving GitHub community.

Download Telegram

About

Blog

Apps

Platform

26K subscribers

denisyarats/drq
DrQ: Data regularized Q
Language: Python
#actor_critic #control #deep_learning #deep_reinforcement_learning #dm_control #drq #gym #mujoco #pixel #python #pytorch #reinforcement_learning #rl #sac #soft_actor_crit
Stars: 122 Issues: 0 Forks: 8
https://github.com/denisyarats/drq

GitHub - denisyarats/drq: DrQ: Data regularized Q

DrQ: Data regularized Q. Contribute to denisyarats/drq development by creating an account on GitHub.

1.84K views21:54

ugurkanates/awesome-real-world-rl
Great resources for making Reinforcement Learning work in Real Life situations. Papers,projects and more.
#awesome_list #gans #imitation_learning #meta_learning #reinforcement_learning #robotics #sim2real #simulation
Stars: 107 Issues: 0 Forks: 9
https://github.com/ugurkanates/awesome-real-world-rl

GitHub - ugurkanates/awesome-real-world-rl: Great resources for making Reinforcement Learning work in Real Life situations. Papers…

Great resources for making Reinforcement Learning work in Real Life situations. Papers,projects and more. - GitHub - ugurkanates/awesome-real-world-rl: Great resources for making Reinforcement Lea...

2.06K views03:54

cool-RR/grid_royale
A life simulation for exploring social dynamics
Language: Python
#ai #hacktoberfest #keras #machine_learning #python #q_learning #reinforcement_learning
Stars: 164 Issues: 11 Forks: 18
https://github.com/cool-RR/grid_royale

GitHub - cool-RR/marley: A framework for multi-agent reinforcement learning.

A framework for multi-agent reinforcement learning. - GitHub - cool-RR/marley: A framework for multi-agent reinforcement learning.

2.43K views15:53

huggingface/deep-rl-class
This repo contain the syllabus of the Hugging Face Deep Reinforcement Learning Class.
#deep_reinforcement_learning #reinforcement_learning #reinforcement_learning_excercises
Stars: 307 Issues: 1 Forks: 16
https://github.com/huggingface/deep-rl-class

GitHub - huggingface/deep-rl-class: This repo contains the Hugging Face Deep Reinforcement Learning Course.

This repo contains the Hugging Face Deep Reinforcement Learning Course. - huggingface/deep-rl-class

❤3

2.15K views10:13

zubair-irshad/Awesome-Implicit-NeRF-Robotics
A comprehensive list of Implicit Representations and NeRF papers relating to Robotics/RL domain, including papers, codes, and related websites
#computer_vision #dynamics #implicit_representations #manipulation #navigation #nerf #planning #pose_estimation #reinforcement_learning #robotics #slam
Stars: 177 Issues: 1 Forks: 5
https://github.com/zubair-irshad/Awesome-Implicit-NeRF-Robotics

GitHub - zubair-irshad/Awesome-Implicit-NeRF-Robotics: A comprehensive list of Implicit Representations and NeRF papers relating…

A comprehensive list of Implicit Representations and NeRF papers relating to Robotics/RL domain, including papers, codes, and related websites - zubair-irshad/Awesome-Implicit-NeRF-Robotics

2.15K views10:16

PKU-Alignment/safe-rlhf
Safe-RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
Language: Python
#ai_safety #alpaca #datasets #deepspeed #large_language_models #llama #llm #llms #reinforcement_learning #reinforcement_learning_from_human_feedback #rlhf #safe_reinforcement_learning #safe_reinforcement_learning_from_human_feedback #safe_rlhf #safety #transformers #vicuna
Stars: 279 Issues: 0 Forks: 14
https://github.com/PKU-Alignment/safe-rlhf

GitHub - PKU-Alignment/safe-rlhf: Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback - PKU-Alignment/safe-rlhf

👍2👏1

2.32K views16:11

mihirp1998/AlignProp
AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more sample and compute efficient than reinforcement learning methods (PPO) for finetuning Stable Diffusion
Language: Python
#alignment #diffusion_models #reinforcement_learning #stable_diffusion #text_to_image
Stars: 104 Issues: 4 Forks: 1
https://github.com/mihirp1998/AlignProp

GitHub - mihirp1998/AlignProp: AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion…

AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more sample and compute efficient than reinforcement learning methods...

👍2

2.07K views04:19

AgibotTech/agibot_x1_train
The reinforcement learning training code for AgiBot X1.
Language: Python
#open_source #reinforcement_learning #robotics
Stars: 763 Issues: 2 Forks: 235
https://github.com/AgibotTech/agibot_x1_train

GitHub - AgibotTech/agibot_x1_train: The reinforcement learning training code for AgiBot X1.

The reinforcement learning training code for AgiBot X1. - AgibotTech/agibot_x1_train

1.8K views10:00

Gen-Verse/ReasonFlux
ReasonFlux-32B beats o1-preview and DeepSeek-V3 with only 500 thought templates
Language: Python
#chain_of_thought #deepseek_r1 #deepseek_v3 #llm_rlhf #o1_mini #o1_preview #reinforcement_learning #sft_data
Stars: 194 Issues: 2 Forks: 10
https://github.com/Gen-Verse/ReasonFlux

GitHub - Gen-Verse/ReasonFlux: ReasonFlux Series - A family of LLM post-training algorithms focusing on data selection, reinforcement…

ReasonFlux Series - A family of LLM post-training algorithms focusing on data selection, reinforcement learning, and inference scaling - Gen-Verse/ReasonFlux

👍1

1.68K views23:00

FareedKhan-dev/all-rl-algorithms
Implementation of all RL algorithms in a simpler way
Language: Jupyter Notebook
#agent #llm #openai #python #reinforcement_learning #rl
Stars: 240 Issues: 0 Forks: 19
https://github.com/FareedKhan-dev/all-rl-algorithms

GitHub - FareedKhan-dev/all-rl-algorithms: Implementation of all RL algorithms in a simpler way

Implementation of all RL algorithms in a simpler way - FareedKhan-dev/all-rl-algorithms

1.71K views10:00

NVlabs/Long-RL
Long-RL: Scaling RL to Long Sequences
Language: Python
#efficient_ai #large_language_models #long_sequence #multi_modality #reinforcement_learning #sequence_parallelism
Stars: 301 Issues: 2 Forks: 3
https://github.com/NVlabs/Long-RL

GitHub - NVlabs/Long-RL: Long-RL: Scaling RL to Long Sequences

Long-RL: Scaling RL to Long Sequences. Contribute to NVlabs/Long-RL development by creating an account on GitHub.

1.39K views10:00