Back to List
Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.
Pythonbigcode-project/bigcodebench

bigcodebench

[ICLR'25] BigCodeBench: Benchmarking Code Generation Towards AGI

60.2/100
500Forks: 72
View on GitHubHomepage →
Loading report...

Similar Projects

kani

73

kani (カニ) is a highly hackable microframework for tool-calling language models. (NLP-OSS @ EMNLP 2023)

Python600

promptbench

58

A unified evaluation framework for large language models

Python2.8K

vim-ai

58

AI-powered code assistant for Vim. OpenAI and ChatGPT plugin for Vim and Neovim.

Python1.2K

langflow

95

Langflow is a powerful tool for building and deploying AI-powered agents and workflows.

Python148.1K
Back to List