-
THOR: Tool-Integrated Hierarchical Optimization via RL for Mathematical Reasoning
Paper • 2509.13761 • Published • 16 -
Knapsack RL: Unlocking Exploration of LLMs via Optimizing Budget Allocation
Paper • 2509.25849 • Published • 47 -
Reactive Transformer (RxT) -- Stateful Real-Time Processing for Event-Driven Reactive Language Models
Paper • 2510.03561 • Published • 23 -
Less is More: Recursive Reasoning with Tiny Networks
Paper • 2510.04871 • Published • 462
Daniel Kloimwieder
dkkloimwieder
·
AI & ML interests
None yet
Recent Activity
updated
a collection
20 days ago
Paper
updated
a collection
22 days ago
Paper
updated
a collection
24 days ago
Paper
Organizations
None yet