⚠

Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.

Pythonmbzuai-oryx/Awesome-LLM-Post-training

Awesome-LLM-Post-training

Awesome Reasoning LLM Tutorial/Survey/Guide

42.5/100

★ 2.5KForks: 164

View on GitHub →

Loading report...

Similar Projects

OpenRLHF

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Async RL)

Python★ 9.8K

verl-agent

verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

Python★ 2.2K

QeRL

[ICLR 2026]QeRL enables RL for 32B LLMs on a single H100 GPU.

Python★ 511

langflow

Langflow is a powerful tool for building and deploying AI-powered agents and workflows.

Python★ 152.3K

← Back to List