⚠

Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.

PythonContextualAI/HALOs

HALOs

A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).

56.1/100

★ 907Forks: 50

View on GitHub →Homepage →

Loading report...

Similar Projects

oat

🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.

Python★ 637

align-anything

Align Anything: Training All-modality Model with Feedback

Python★ 4.6K

rlhf-book

Textbook on reinforcement learning from human feedback

Python★ 1.7K

LlamaFactory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python★ 68.0K

← Back to List