⚠

Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.

Pythonallenai/reward-bench

reward-bench

RewardBench: the first evaluation tool for reward models.

55.2/100

★ 727Forks: 97

View on GitHub →Homepage →

Loading report...

Similar Projects

LlamaFactory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python★ 73.5K

alignment-handbook

Robust recipes to align language models with human and AI preferences

Python★ 5.6K

OpenClaw-RL

OpenClaw-RL: Train any agent simply by talking

Python★ 5.6K

transformerlab-app

The open source research environment for AI researchers to seamlessly train, evaluate, and scale models from local hardware to GPU clusters.

Python★ 5.2K

← Back to List