On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification Paper • 2508.05629 • Published Aug 7 • 178 • 21
view post Post 4219 why did 36 people unfollow me 😭we are back in the hundreds.if you become my 500th follower and have proof I'll give you 5 dollars worth of openrouter credits as an API key See translation 3 replies · 😔 4 4 😎 4 4 👀 1 1 + Reply