Demystifying Reinforcement Learning in Agentic Reasoning Paper • 2510.11701 • Published 19 days ago • 31
— UI is a good thing 💅 — Collection cool spaces with a cool UI, what could be better? • 5 items • Updated May 5 • 25
[NeurIPS 2025] RPC Resources Collection Sampled Reasoning Paths for NeurIPS 2025 Paper: A Theoretical Study on Bridging Internal Probability and Self-Consistency for LLM Reasoning • 6 items • Updated 9 days ago • 8
Pushing on Multilingual Reasoning Models with Language-Mixed Chain-of-Thought Paper • 2510.04230 • Published 27 days ago • 26
MobileLLM Collection Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 • 46 items • Updated Sep 10 • 131
New Trends for Modern Machine Translation with Large Reasoning Models Paper • 2503.10351 • Published Mar 13 • 25
AceReason Collection Math and Code reasoning model trained through reinforcement learning (RL) • 7 items • Updated 10 days ago • 18
Tool Use Reasoning Collection A collection of tool use reasoning dataset in Hermes format • 5 items • Updated Jul 23 • 8
Don't Overthink It: A Survey of Efficient R1-style Large Reasoning Models Paper • 2508.02120 • Published Aug 4 • 19
[New] AI Technologies & Services Collection 'OpenFree AI' 커뮤니티: https://discord.gg/openfreeai • 182 items • Updated 7 days ago • 21
NEMOTRON-CROSSTHINK: Scaling Self-Learning beyond Math Reasoning Paper • 2504.13941 • Published Apr 15 • 11
Eagle 2: Building Post-Training Data Strategies from Scratch for Frontier Vision-Language Models Paper • 2501.14818 • Published Jan 20 • 9