Democratizing Reinforcement Learning for LLMs
Build, Manage and Deploy AI/ML Systems
verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.