Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.
Pythonharbor-framework/terminal-bench
terminal-bench
A benchmark for LLMs on complicated tasks in the terminal