-
Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement Learning
Paper • 2502.14768 • Published • 47 -
S^2R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning
Paper • 2502.12853 • Published • 29 -
Diverse Inference and Verification for Advanced Reasoning
Paper • 2502.09955 • Published • 18 -
Distillation Scaling Laws
Paper • 2502.08606 • Published • 48
shanshan wang
cooleel
AI & ML interests
None yet
Recent Activity
updated
a dataset
2 days ago
tensorlake/OCRBenchV2-DocParsing-UpdatedGT
published
a dataset
2 days ago
tensorlake/OCRBenchV2-DocParsing-UpdatedGT
updated
a collection
19 days ago
DocAI