Build RL environments for LLM training
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)
An API standard for multi-agent reinforcement learning environments, with popular reference environments and related utilities
A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.