Zhicheng YANG's picture

7 3

Zhicheng YANG

yangzhch6

https://yangzhch6.github.io/

yangzhch6

AI & ML interests

reasoning with LLMs

Recent Activity

updated a dataset 18 days ago

yangzhch6/DeepInformal-DeepTheorem-Synthetic

updated a dataset 18 days ago

yangzhch6/DeepInformal-Openr1-Math-46K-Synthetic

updated a dataset 18 days ago

yangzhch6/compare-openr1

View all activity

Organizations

None yet

upvoted a collection 18 days ago

DeepInformal

6 items • Updated 18 days ago • 1

upvoted a paper about 1 month ago

Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward

Paper • 2510.03222 • Published Oct 3 • 74

upvoted 3 papers about 2 months ago

rStar2-Agent: Agentic Reasoning Technical Report

Paper • 2508.20722 • Published Aug 28 • 115

Depth-Breadth Synergy in RLVR: Unlocking LLM Reasoning Gains with Adaptive Exploration

Paper • 2508.13755 • Published Aug 19 • 14

Reinforcing Diffusion Models by Direct Group Preference Optimization

Paper • 2510.08425 • Published Oct 9 • 11

upvoted a collection 3 months ago

DARS

Dataset & Model of [Depth-Breadth Synergy in RLVR: Unlocking LLM Reasoning Gains with Adaptive Exploration](https://arxiv.org/abs/2508.13755v1) • 14 items • Updated Sep 26 • 1

upvoted a paper 3 months ago

Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains RLVR

Paper • 2508.14029 • Published Aug 19 • 118