Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.
Pythonclaw-eval/claw-eval
claw-eval
Claw-Eval is an evaluation harness for evaluating LLM as agents. All tasks verified by humans.