Back to List
Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.
Pythonbigcode-project/bigcodebench

bigcodebench

[ICLR'25] BigCodeBench: Benchmarking Code Generation Towards AGI

58.8/100
511Forks: 74
View on GitHubHomepage →
Loading report...

Similar Projects

kani

86

kani (カニ) is a highly hackable microframework for tool-calling language models. (NLP-OSS @ EMNLP 2023)

Python604

promptbench

55

A unified evaluation framework for large language models

Python2.8K

vim-ai

54

AI-powered code assistant for Vim. OpenAI and ChatGPT plugin for Vim and Neovim.

Python1.2K

langflow

95

Langflow is a powerful tool for building and deploying AI-powered agents and workflows.

Python150.2K
Back to List