Home
NL2Code
Cancel

LiveCodeBench

🗂️Benchmark Name: LiveCodeBench 📚Publisher: Arxiv 🏠Author Affiliation: UC Berkeley; MIT; Cornell 🔗URL: https://livecodebench.github.io Scenario: Holistic and Contamination-Free Evaluatio...

Devin

A promising product! 📙Product: Devin 🏠Author Affiliation: Cognition Contribution: Meet Devin, the world’s first fully autonomous AI software engineer. Devin is a tireless, skilled teammate...

StarCoder 2

📙Paper: StarCoder 2 and The Stack v2 The Next Generation 📚Publisher: Arxiv 🏠Author Affiliation: BigCode 🔑Public: √

OpenCodeInterpreter

📙Paper: OpenCodeInterpreter Integrating Code Generation with Execution and Refinement 📚Publisher: Arxiv 🔑Public: √ Link: https://opencodeinterpreter.github.io

StepCoder

📙Paper: StepCoder Improve Code Generation with Reinforcement Learning from Compiler Feedback 📚Publisher: Arxiv 🏠Author Affiliation: Fudan University 🔑Public: ❌

DeepSeek-Coder

📙Paper: DeepSeek-Coder: When the Large Language Model Meets Programming – The Rise of Code Intelligence 📚Publisher: arxiv 🏠Author Affiliation: DeepSeek-AI 🔑Public: ✅ Github: https://gith...

AlphaCodium

📙Paper: [Code Generation with AlphaCodium From Prompt Engineering to Flow Engineering](https://arxiv.org/abs/2401.08500) 📚Publ...

JumpCoder

📙Paper: JumpCoder 📚Publisher: arxiv 🏠Author Affiliation: Zhejiang University 🔑Public: ✅

CodeAgentBench

🗂️Benchmark Name: CodeAgentBench 📚Publisher: Arxiv 🏠Author Affiliation: Peking University 🔗URL: https://github.com/zkcpku/CodeAgent Number of Instances: 101 Problem Description’s Natur...

OOP

🗂️Benchmark Name: OOP 📚Publisher: Arxiv 🏠Author Affiliation: Wuhan University; The University of Sydney; JD Explore Academy 🔗URL: https://github.com/alphadl/OOP-eval Number of Instances:...