Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
3
3
Sephen Chung
stephenchungmh
Follow
0 followers
·
4 following
https://www.stephen-c.com
stephen-chung-mh
AI & ML interests
Reinforcement learning
Recent Activity
authored
a paper
14 days ago
Interpreting Emergent Planning in Model-Free Reinforcement Learning
authored
a paper
14 days ago
Learning from Peers in Reasoning Models
authored
a paper
14 days ago
Thinker: Learning to Think Fast and Slow
View all activity
Organizations
None yet
Papers
4
arxiv:
2505.21097
arxiv:
2505.07787
arxiv:
2504.01871
arxiv:
2503.04808
models
3
Sort: Recently updated
stephenchungmh/thinker_r7b
8B
•
Updated
14 days ago
•
10
•
1
stephenchungmh/thinker_q1_5b
2B
•
Updated
14 days ago
•
10
•
1
stephenchungmh/thinker_r1_5b
2B
•
Updated
14 days ago
•
18
•
1
datasets
0
None public yet