GitHub Trends

#python #agent #agentic_ai #grpo #kimi_ai #llms #lora #qwen #qwen3 #reinforcement_learning #rl

ART is a tool that helps you train smart agents for real-world tasks using reinforcement learning, especially with the GRPO method. The standout feature is RULER, which lets you skip the hard work of designing reward functions by using a large language model to automatically score how well your agent is doing—just describe your task, and RULER takes care of the rest. This makes building and improving agents much faster and easier, works for any task, and often performs as well as or better than hand-crafted rewards. You can install ART with a simple command and start training agents right away, even on your own computer or with cloud resources.

https://github.com/OpenPipe/ART

GitHub

GitHub - OpenPipe/ART: Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on…

Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more! - OpenPipe/ART

409 views12:30

GitHub Trends

#python #gym #gym_environment #reinforcement_learning #reinforcement_learning_agent #reinforcement_learning_environments #rl_environment #rl_training

NeMo Gym helps you build and run reinforcement‑learning training environments for large language models, letting you develop, test, and collect verified rollouts separately from the training loop and integrate with your preferred RL framework and model endpoints (OpenAI, vLLM, etc.). It includes ready resource servers, datasets, and patterns for multi‑step, multi‑turn, and tool‑using scenarios, runs on a typical dev machine (no GPU required), and is early-stage with evolving APIs and docs. Benefit: you can generate high‑quality, verifiable training data faster and plug it into existing training pipelines to improve model behavior.

https://github.com/NVIDIA-NeMo/Gym

GitHub

GitHub - NVIDIA-NeMo/Gym: Build RL environments for LLM training

Build RL environments for LLM training. Contribute to NVIDIA-NeMo/Gym development by creating an account on GitHub.

523 viewsedited 12:00

About

Blog

Apps

Platform