Large Reasoning Models Learn Better Alignment from Flawed Thinking Paper • 2510.00938 • Published 26 days ago • 57
What Characterizes Effective Reasoning? Revisiting Length, Review, and Structure of CoT Paper • 2509.19284 • Published Sep 23 • 22
Learning to Reason as Action Abstractions with Scalable Mid-Training RL Paper • 2509.25810 • Published 28 days ago • 5