huangyundu's picture

5 1

huangyundu

yundu

·

AI & ML interests

None yet

Recent Activity

liked a model 19 days ago

moonshotai/Kimi-K2-Thinking

upvoted a collection 26 days ago

upvoted a paper 26 days ago

Supervised Reinforcement Learning: From Expert Trajectories to Step-wise Reasoning

View all activity

Organizations

None yet

upvoted a collection 26 days ago

post-train

1 item • Updated 26 days ago • 1

upvoted a paper 26 days ago

Supervised Reinforcement Learning: From Expert Trajectories to Step-wise Reasoning

Paper • 2510.25992 • Published 27 days ago • 44

upvoted 2 papers 28 days ago

Reasoning with Sampling: Your Base Model is Smarter Than You Think

Paper • 2510.14901 • Published Oct 16 • 47

Tongyi DeepResearch Technical Report

Paper • 2510.24701 • Published 29 days ago • 95

upvoted a paper 29 days ago

Agent Learning via Early Experience

Paper • 2510.08558 • Published Oct 9 • 265