Back to List
Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.
TeXPaperGuru-AI/PaperGuru-Benchmark

PaperGuru-Benchmark

Lifecycle-Aware Memory for long-horizon LLM agents — 66.05% on PaperBench, 94.66% on SurveyBench, 10 peer-reviewed acceptances at FSE/ICML/TOSEM/AEI/ICoGB

57.7/100
1.1KForks: 174
View on GitHub
Loading report...

Similar Projects

AI-Research-SKILLs

87

Comprehensive open-source library of AI research and engineering skills for any AI model. Package the skills and your claude code/codex/gemini agent will be an AI research agent with full horsepower. Maintained by Orchestra Research.

TeX9.9K

awesome-multi-agent-papers

70

A compilation of the best multi-agent papers

TeX1.6K

SurveyX

38

Academic Survey Paper Generation.

TeX980

training-materials

65

Bootlin embedded Linux and kernel training materials

TeX786
Back to List