weixun's picture

9

weixun

weixun

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 months ago

Attention Illuminates LLM Reasoning: The Preplan-and-Anchor Rhythm Enables Fine-Grained Policy Optimization

upvoted a paper about 2 months ago

Part II: ROLL Flash -- Accelerating RLVR and Agentic Training with Asynchrony

upvoted a paper 2 months ago

GEM: A Gym for Agentic LLMs

View all activity

Organizations

None yet

Papers 1

arxiv:2405.11143

models 0

None public yet

datasets 0

None public yet