Back to List
Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.
benchflow-ai/awesome-evals

awesome-evals

A curated, non-BS library of the best resources for building and evaluating AI agents — papers, blogs, talks, tools, benchmarks. Maintained by BenchFlow.

66.4/100
536Forks: 39
View on GitHub
Loading report...

Similar Projects

awesome-hermes-agent

73

A curated list of skills, plugins, tools, integrations, and resources for Hermes Agent by Nous Research

1.8K

awesome-ai-agent-papers

65

A curated collection of AI agent research papers released in 2026, covering agent engineering, memory, evaluation, workflows, and autonomous systems.

1.5K

awesome-openclaw

57

A curated list of the best OpenClaw resources: official projects, skills, plugins, dashboards, deployment tooling, memory systems, and guides.

700

awesome-generative-ai

75

A curated list of modern Generative Artificial Intelligence projects and services

12.2K
Back to List