4 3

haoran he

haoranhe

tinnerhrhe

AI & ML interests

None yet

Recent Activity

new activity about 2 months ago

haoranhe/ROVER-Qwen3-8B:Improve model card: Add metadata, paper, and GitHub links

new activity about 2 months ago

haoranhe/ROVER-Qwen3-4B:Improve model card: Add metadata, paper, project page, and GitHub links

new activity about 2 months ago

haoranhe/ROVER-countdown-3B:Improve model card: Add pipeline tag, library name, paper, and GitHub links

View all activity

Organizations

None yet

New activity in haoranhe/ROVER-Qwen3-8B about 2 months ago

Improve model card: Add metadata, paper, and GitHub links

#1 opened about 2 months ago by

nielsr

New activity in haoranhe/ROVER-Qwen3-4B about 2 months ago

Improve model card: Add metadata, paper, project page, and GitHub links

#1 opened about 2 months ago by

nielsr

New activity in haoranhe/ROVER-countdown-3B about 2 months ago

Improve model card: Add pipeline tag, library name, paper, and GitHub links

#1 opened about 2 months ago by

nielsr

upvoted a paper about 2 months ago

Random Policy Valuation is Enough for LLM Reasoning with Verifiable Rewards

Paper • 2509.24981 • Published Sep 29 • 29

commented a paper about 2 months ago

Random Policy Valuation is Enough for LLM Reasoning with Verifiable Rewards

Paper • 2509.24981 • Published Sep 29 • 29 •

updated 3 models about 2 months ago

published 3 models about 2 months ago

haoranhe/ROVER-countdown-3B

Text Generation • 3B • Updated Oct 1 • 5

haoranhe/ROVER-Qwen3-8B

Text Generation • 8B • Updated Oct 1 • 6 • 2

haoranhe/ROVER-Qwen3-4B

Text Generation • 4B • Updated Oct 1 • 7

updated a model 3 months ago

haoranhe/rpe-deepseek-1.5b

2B • Updated Aug 21

published a model 4 months ago

haoranhe/rpe-deepseek-1.5b

2B • Updated Aug 21

authored 4 papers 6 months ago

Large-Scale Actionless Video Pre-Training via Discrete Diffusion for Efficient Policy Learning

Paper • 2402.14407 • Published Feb 22, 2024 • 1

Diffusion Model is an Effective Planner and Data Synthesizer for Multi-Task Reinforcement Learning

Paper • 2305.18459 • Published May 29, 2023

Scaling Image and Video Generation via Test-Time Evolutionary Search

Paper • 2505.17618 • Published May 23 • 41

Bridging the Sim-to-Real Gap from the Information Bottleneck Perspective

Paper • 2305.18464 • Published May 29, 2023

upvoted a paper 6 months ago

Scaling Image and Video Generation via Test-Time Evolutionary Search

Paper • 2505.17618 • Published May 23 • 41

upvoted a paper 9 months ago

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

Paper • 2502.06703 • Published Feb 10 • 153

updated a model 12 months ago

haoranhe/VPDD-pretrain

Updated Nov 28, 2024 • 1

haoran he

AI & ML interests

Recent Activity

Organizations

haoranhe's activity

Improve model card: Add metadata, paper, and GitHub links

Improve model card: Add metadata, paper, project page, and GitHub links

Improve model card: Add pipeline tag, library name, paper, and GitHub links