94 24 24

Terry Yue Zhuo

terryyz

https://terryyz.github.io/

AI & ML interests

code intelligence, now computer intelligence

Recent Activity

liked a Space 16 days ago

bigcode/arena

upvoted a paper 16 days ago

BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution

liked a Space 16 days ago

akhaliq/veo3.1-fast

View all activity

Organizations

upvoted a paper 16 days ago

BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution

Paper • 2510.08697 • Published 24 days ago • 34

upvoted an article 28 days ago

Article

BigCodeArena: Judging code generations end to end with code executions

•

27 days ago

• 16

upvoted a collection about 1 month ago

⚔️ BigCodeArena

Collection

Unveiling More Reliable Human Preferences in Code Generation via Execution • 8 items • Updated 20 days ago • 4

upvoted a paper 2 months ago

Training Language Model Agents to Find Vulnerabilities with CTF-Dojo

Paper • 2508.18370 • Published Aug 25 • 3

upvoted a paper 3 months ago

Cyber-Zero: Training Cybersecurity Agents without Runtime

Paper • 2508.00910 • Published Jul 29 • 8

upvoted an article 9 months ago

Article

Blazing-Fast Code Editing via Multi-Layer Speculation

and 3 others •

Feb 15

• 17

upvoted a paper about 1 year ago

Horizon-Length Prediction: Advancing Fill-in-the-Middle Capabilities for Code Generation with Lookahead Planning

Paper • 2410.03103 • Published Oct 4, 2024 • 9

upvoted an article over 1 year ago

Article

Announcing BigCodeBench-Hard, and More

•

Jul 24, 2024

• 14

upvoted 2 papers over 1 year ago

RegMix: Data Mixture as Regression for Language Model Pre-training

Paper • 2407.01492 • Published Jul 1, 2024 • 40

BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions

Paper • 2406.15877 • Published Jun 22, 2024 • 48

upvoted a collection over 1 year ago

🌸BigCodeBench

Collection

Benchmarking Code Generation with Diverse Function Calls and Complex Instructions https://bigcode-bench.github.io/ • 8 items • Updated Nov 12, 2024 • 4

upvoted an article over 1 year ago

Article

BigCodeBench: Benchmarking Large Language Models on Solving Practical and Challenging Programming Tasks

Jun 18, 2024

• 52

upvoted 2 papers over 1 year ago

Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order

Paper • 2404.00399 • Published Mar 30, 2024 • 42

StarCoder 2 and The Stack v2: The Next Generation

Paper • 2402.19173 • Published Feb 29, 2024 • 149

upvoted 4 collections almost 2 years ago

upvoted 2 papers almost 2 years ago

Astraios: Parameter-Efficient Instruction Tuning Code Large Language Models

Paper • 2401.00788 • Published Jan 1, 2024 • 23

Pop Quiz! Do Pre-trained Code Models Possess Knowledge of Correct API Names?

Paper • 2309.07804 • Published Sep 14, 2023 • 2

Terry Yue Zhuo

AI & ML interests

Recent Activity

Organizations

terryyz's activity

BigCodeArena: Judging code generations end to end with code executions

Blazing-Fast Code Editing via Multi-Layer Speculation

Announcing BigCodeBench-Hard, and More

BigCodeBench: Benchmarking Large Language Models on Solving Practical and Challenging Programming Tasks