Back to List
Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.
Pythonvectara/hallucination-leaderboard

hallucination-leaderboard

Leaderboard Comparing LLM Performance at Producing Hallucinations when Summarizing Short Documents

75.0/100
3.2KForks: 101
View on GitHubHomepage →
Loading report...

Similar Projects

strix

89

Open-source AI hackers to find and fix your app’s vulnerabilities.

Python24.4K

cai

83

Cybersecurity AI (CAI), the framework for AI Security

Python8.2K

ai-engineering-from-scratch

75

Learn it. Build it. Ship it for others.

Python4.8K

AdalFlow

75

AdalFlow: The library to build & auto-optimize LLM applications.

Python4.1K
Back to List