⚠

Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.

Pythonwalkinglabs/hands-on-modern-rl

hands-on-modern-rl

🚀 An open-source, hands-on curriculum bridging the gap from basic RL concepts to LLM alignment, RLVR, and advanced Agentic systems.

82.2/100

★ 3.0KForks: 198

View on GitHub →Homepage →

Loading report...

Similar Projects

DeepAnalyze

DeepAnalyze is the first agentic LLM for autonomous data science. 🎈你的AI数据分析师，自动分析大量数据，一键生成专业分析报告！

Python★ 4.3K

elephant-agent

Personal-Model First Self Evolving AI Agent 🐘

Python★ 565

deer-flow

An open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, subagents and message gateway, it handles different levels of tasks that could take minutes to hours.

Python★ 71.7K

agent-lightning

The absolute trainer to light up AI agents.

Python★ 17.3K

← Back to List