Back to List
Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.
PythonContextualAI/HALOs

HALOs

A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).

56.1/100
907Forks: 50
View on GitHubHomepage →
Loading report...

Similar Projects

oat

71

🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.

Python637

align-anything

56

Align Anything: Training All-modality Model with Feedback

Python4.6K

rlhf-book

80

Textbook on reinforcement learning from human feedback

Python1.7K

LlamaFactory

92

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python68.0K
Back to List