Back to List
Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.
PythonVibeBench/VibeSearchBench

VibeSearchBench

🔍 The hardest search benchmark in the wild — vague, multi-turn, proactive. 200 long-horizon tasks with persona-driven progressive disclosure, scored by verifiable schema-free knowledge-graph evaluation. No vibes, just triplet F1.

63.9/100
828Forks: 11
View on GitHubHomepage →
Loading report...

Similar Projects

AutoGPT

96

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Python184.9K

PageIndex

80

📑 PageIndex: Document Index for Vectorless, Reasoning-based RAG

Python32.8K

agenticSeek

80

Fully Local Manus AI. No APIs, No $200 monthly bills. Enjoy an autonomous agent that thinks, browses the web, and code for the sole cost of electricity. 🔔 Official updates only via twitter @Martin993886460 (Beware of fake account)

Python26.5K

agent-lightning

75

The absolute trainer to light up AI agents.

Python17.3K
Back to List