Back to List
Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.
Pythonrllm-org/rllm

rllm

Democratizing Reinforcement Learning for LLMs

80.4/100
5.4KForks: 546
View on GitHubHomepage →
Loading report...

Similar Projects

metaflow

88

Build, Manage and Deploy AI/ML Systems

Python10.1K

verl-agent

65

verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

Python1.8K

ray

92

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python42.3K

stable-baselines3

91

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Python13.1K
Back to List