Arch-Router: Aligning LLM Routing with Human Preferences Paper • 2506.16655 • Published Jun 19 • 17 • 2
ReTool: Reinforcement Learning for Strategic Tool Use in LLMs Paper • 2504.11536 • Published Apr 15 • 63