sparklereasoning

non-profit

https://sparkle-reasoning.github.io/

sparkle-reasoning

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

MilaWang updated a model 16 days ago

sparkle-reasoning/SparkleRL-7B-Stage1

MilaWang updated a model 16 days ago

sparkle-reasoning/SparkleRL-7B-Stage2-aug

MilaWang authored a paper about 1 month ago

Beyond Accuracy: Dissecting Mathematical Reasoning for LLMs Under Reinforcement Learning

View all activity

MilaWang

updated 2 models 16 days ago

sparkle-reasoning/SparkleRL-7B-Stage1

Reinforcement Learning • 8B • Updated 16 days ago • 32 • 2

sparkle-reasoning/SparkleRL-7B-Stage2-aug

Reinforcement Learning • 8B • Updated 16 days ago • 15 • 3

MilaWang

authored 3 papers about 1 month ago

Beyond Accuracy: Dissecting Mathematical Reasoning for LLMs Under Reinforcement Learning

Paper • 2506.04723 • Published Jun 5 • 1

LiveResearchBench: A Live Benchmark for User-Centric Deep Research in the Wild

Paper • 2510.14240 • Published Oct 16 • 11

Synthesizing Agentic Data for Web Agents with Progressive Difficulty Enhancement Mechanisms

Paper • 2510.13913 • Published Oct 15 • 3

MilaWang

updated a dataset 5 months ago

sparkle-reasoning/olympiad_bench

Viewer • Updated Jul 4 • 675 • 17

MilaWang

published a dataset 5 months ago

sparkle-reasoning/olympiad_bench

Viewer • Updated Jul 4 • 675 • 17

MilaWang

updated a dataset 5 months ago

sparkle-reasoning/math500

Viewer • Updated Jul 4 • 500 • 16

MilaWang

published a dataset 5 months ago

sparkle-reasoning/math500

Viewer • Updated Jul 4 • 500 • 16

MilaWang

updated a dataset 5 months ago

sparkle-reasoning/gsm8k

Viewer • Updated Jul 4 • 1.32k • 11

MilaWang

published a dataset 5 months ago

sparkle-reasoning/gsm8k

Viewer • Updated Jul 4 • 1.32k • 11

MilaWang

updated a dataset 6 months ago

sparkle-reasoning/hardmath

Viewer • Updated Jun 13 • 6.5k • 425

MilaWang

published a dataset 6 months ago

sparkle-reasoning/hardmath

Viewer • Updated Jun 13 • 6.5k • 425

MilaWang

updated a dataset 6 months ago

sparkle-reasoning/dsr40k

Viewer • Updated Jun 13 • 40.3k • 22

MilaWang

published a dataset 6 months ago

sparkle-reasoning/dsr40k

Viewer • Updated Jun 13 • 40.3k • 22

MilaWang

updated a dataset 6 months ago

sparkle-reasoning/aime2024

Viewer • Updated Jun 13 • 30 • 30

MilaWang

published a dataset 6 months ago

sparkle-reasoning/aime2024

Viewer • Updated Jun 13 • 30 • 30

MilaWang

authored a paper 6 months ago

Helpful Agent Meets Deceptive Judge: Understanding Vulnerabilities in Agentic Workflows

Paper • 2506.03332 • Published Jun 3 • 2

MilaWang

authored a paper 7 months ago

COSMOS: Predictable and Cost-Effective Adaptation of LLMs

Paper • 2505.01449 • Published Apr 30 • 3

alvinming

authored a paper 11 months ago

Demystifying Domain-adaptive Post-training for Financial LLMs

Paper • 2501.04961 • Published Jan 9 • 11

AI & ML interests

Recent Activity

Team members 2

sparkle-reasoning's activity