OctoThinker

community

https://github.com/GAIR-NLP/OctoThinker

GAIR-NLP

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

Pengfei authored a paper about 1 month ago

daVinci-Env: Open SWE Environment Synthesis at Scale

Pengfei authored a paper 3 months ago

One Sample to Rule Them All: Extreme Data Efficiency in RL Scaling

SinclairWang authored a paper 9 months ago

MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning

View all activity

Organization Card

Community About org cards

🐙 OctoThinker is led by GAIR

🎯 Our Goal: To reshape the pre-training trajectory so models scale better under RL.

Check our technical report for more details!

Collections 4

View 4 collections

models 26

OctoThinker/Llama_32_3B_megamath_web_pro_open_r1_longcot_general_ins_89_10_1_bs4M_seq8k_20B

Text Generation • Updated Jul 7, 2025

OctoThinker/Llama_32_3B_megamath_web_pro_open_r1_longcot_91_bs4M_seq8k_20B

Text Generation • Updated Jul 7, 2025

OctoThinker/Llama_32_3B_megamath_web_pro_megamath_synth_qa_general_ins_89_10_1_bs4M_seq8k_20B

Text Generation • Updated Jul 7, 2025

OctoThinker/Llama_32_3B_megamath_web_pro_megamath_synth_qa_91_bs4M_seq8k_20B

Text Generation • Updated Jul 7, 2025

OctoThinker/Llama_32_3B_megamath_web_pro_max_bs4M_seq8k_20B

Text Generation • Updated Jul 7, 2025

View 26 models

datasets 1

OctoThinker/MegaMath-Web-Pro-Max

Viewer • Updated Jul 6, 2025 • 69.2M • 5.79k • 38

OctoThinker

AI & ML interests

Recent Activity

Collections 4

OctoThinker/Llama_32_3B_finemath_4p_bs4M_seq8k_20B

OctoThinker/Llama_32_3B_megamath_web_pro_bs4M_seq8k_20B

OctoThinker/Llama_32_3B_megamath_web_pro_max_bs4M_seq8k_20B

OctoThinker/Llama_32_3B_megamath_web_pro_megamath_synth_qa_31_bs4M_seq8k_20B

OctoThinker/OctoThinker-8B-Long-Base

OctoThinker/OctoThinker-8B-Hybrid-Base

OctoThinker/OctoThinker-8B-Short-Base

OctoThinker/Llama_32_3B_finemath_4p_bs4M_seq8k_20B

OctoThinker/Llama_32_3B_megamath_web_pro_bs4M_seq8k_20B

OctoThinker/Llama_32_3B_megamath_web_pro_max_bs4M_seq8k_20B

OctoThinker/Llama_32_3B_megamath_web_pro_megamath_synth_qa_31_bs4M_seq8k_20B

OctoThinker/OctoThinker-8B-Long-Base

OctoThinker/OctoThinker-8B-Hybrid-Base

OctoThinker/OctoThinker-8B-Short-Base

models 26

OctoThinker/OctoThinker-3B-Hybrid-Zero

OctoThinker/OctoThinker-3B-Hybrid-Base

OctoThinker/OctoThinker-3B-Short-Zero

OctoThinker/OctoThinker-3B-Short-Base

OctoThinker/Llama_32_3B_megamath_web_pro_max_bs4M_seq8k_100B

OctoThinker/Llama_32_3B_megamath_web_pro_open_r1_longcot_general_ins_89_10_1_bs4M_seq8k_20B

OctoThinker/Llama_32_3B_megamath_web_pro_open_r1_longcot_91_bs4M_seq8k_20B

OctoThinker/Llama_32_3B_megamath_web_pro_megamath_synth_qa_general_ins_89_10_1_bs4M_seq8k_20B

OctoThinker/Llama_32_3B_megamath_web_pro_megamath_synth_qa_91_bs4M_seq8k_20B

OctoThinker/Llama_32_3B_megamath_web_pro_max_bs4M_seq8k_20B

datasets 1

OctoThinker/MegaMath-Web-Pro-Max

AI & ML interests

Recent Activity

Team members 4

Collections 4

models 26 Sort: Recently updated

datasets 1

models 26