📊 CodeForces - a open-r1 Collection

open-r1 's Collections

updated May 14

Datasets with FULLY VERIFIABLE competitive programming problems, reasoning traces, and human created solutions

open-r1/codeforces

Viewer • Updated May 19 • 34.8k • 10.4k • 83

Note Over 10k problems scrapped from the CodeForces platform (almost all the available problems). Includes: - Text rendering of latex equations (using Qwen-VL) - Problem metadata (tags, difficulty, etc) - Editorials when available - Special model-generated checkers to validate problems with multiple correct answers - Model-generated additional test cases for problems where not all test cases are public
open-r1/codeforces-cots

Viewer • Updated Mar 28 • 254k • 4.88k • 198

Note CodeForces-CoTs is a large-scale dataset for training reasoning models on competitive programming tasks. It consists of 10k CodeForces problems with up to five reasoning traces generated by DeepSeek R1
open-r1/codeforces-submissions

Viewer • Updated May 14 • 12.7M • 691 • 7

Note Dataset containing over 12 million real human submissions to the CodeForces platform