Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.
Pythonsail-sg/oat
oat
๐พ OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.