Back to List
Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.
Pythonsail-sg/oat

oat

🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.

65.3/100
650Forks: 63
View on GitHub
Loading report...

Similar Projects

OpenJudge

75

OpenJudge: A Unified Framework for Holistic Evaluation and Quality Rewards

Python570

LlamaFactory

92

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python70.5K

InternLM

67

Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).

Python7.2K

alignment-handbook

71

Robust recipes to align language models with human and AI preferences

Python5.6K
Back to List