Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.
Pythonopenai/mle-bench
mle-bench
MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering