h zhao
n1cck
AI & ML interests
None yet
Recent Activity
commented on
a paper
about 2 months ago
Sharing is Caring: Efficient LM Post-Training with Collective RL
Experience Sharing
commented on
a paper
2 months ago
Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains
RLVR
commented on
a paper
2 months ago
Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains
RLVR
Organizations
None yet