Back to List
Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.
Pythonwalkinglabs/hands-on-modern-rl

hands-on-modern-rl

🚀 An open-source, hands-on curriculum bridging the gap from basic RL concepts to LLM alignment, RLVR, and advanced Agentic systems.

82.2/100
3.0KForks: 198
View on GitHubHomepage →
Loading report...

Similar Projects

DeepAnalyze

61

DeepAnalyze is the first agentic LLM for autonomous data science. 🎈你的AI数据分析师,自动分析大量数据,一键生成专业分析报告!

Python4.3K

elephant-agent

53

Personal-Model First Self Evolving AI Agent 🐘

Python565

deer-flow

84

An open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, subagents and message gateway, it handles different levels of tasks that could take minutes to hours.

Python71.7K

agent-lightning

74

The absolute trainer to light up AI agents.

Python17.3K
Back to List