Back to List
Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.
Pythonvectara/hallucination-leaderboard

hallucination-leaderboard

Leaderboard Comparing LLM Performance at Producing Hallucinations when Summarizing Short Documents

75.1/100
3.1KForks: 96
View on GitHubHomepage →
Loading report...

Similar Projects

strix

89

Open-source AI hackers to find and fix your app’s vulnerabilities.

Python20.8K

cai

73

Cybersecurity AI (CAI), the framework for AI Security

Python7.3K

AdalFlow

78

AdalFlow: The library to build & auto-optimize LLM applications.

Python4.1K

synthetic-data-generator

80

SDG is a specialized framework designed to generate high-quality structured tabular data.

Python2.4K
Back to List