Back to List
Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.
Pythonrllm-org/rllm

rllm

Democratizing Reinforcement Learning for LLMs

80.5/100
5.2KForks: 512
View on GitHubHomepage →
Loading report...

Similar Projects

metaflow

89

Build, Manage and Deploy AI/ML Systems

Python9.9K

verl-agent

74

verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

Python1.6K

ray

92

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python41.6K

stable-baselines3

87

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Python12.8K
Back to List