Fan Yuan
Leoyfan
AI & ML interests
None yet
Recent Activity
upvoted a paper 7 days ago
Self-Distilled Agentic Reinforcement Learning upvoted a paper 27 days ago
Pause or Fabricate? Training Language Models for Grounded Reasoning upvoted a paper about 1 month ago
GFT: From Imitation to Reward Fine-Tuning with Unbiased Group Advantages and Dynamic Coefficient Rectification