-
nuprl/MultiPL-E
Viewer • Updated • 12.7k • 24.8k • 58 -
openai/openai_humaneval
Viewer • Updated • 164 • 102k • 350 -
Big Code Models Leaderboard
📈1.46kSubmit code models for evaluation and view leaderboard
-
Copilot Evaluation Harness: Evaluating LLM-Guided Software Programming
Paper • 2402.14261 • Published • 11
Shaun
drgitt
AI & ML interests
None yet
Organizations
None yet
codegen_eval
-
nuprl/MultiPL-E
Viewer • Updated • 12.7k • 24.8k • 58 -
openai/openai_humaneval
Viewer • Updated • 164 • 102k • 350 -
Running1.46k
Big Code Models Leaderboard
📈1.46kSubmit code models for evaluation and view leaderboard
-
Copilot Evaluation Harness: Evaluating LLM-Guided Software Programming
Paper • 2402.14261 • Published • 11
Interesting LLMs
datasets
0
None public yet