Back to List
Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.
Pythonhuggingface/lighteval

lighteval

Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends

82.3/100
2.3KForks: 435
View on GitHubHomepage →
Loading report...

Similar Projects

deepeval

87

The LLM Evaluation Framework

Python14.0K

mlflow

91

The open source developer platform to build AI agents and models with confidence. Enhance your AI applications with end-to-end tracking, observability, and evaluations, all in one integrated platform.

Python24.6K

ragas

82

Supercharge Your LLM Application Evaluations 🚀

Python12.8K

oumi

89

Easily fine-tune, evaluate and deploy gpt-oss, Qwen3, DeepSeek-R1, or any open source LLM / VLM!

Python8.9K
Back to List