Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Sankhya0 's Collections
Embodied ai
RL
Neural

RL

updated Oct 5
Upvote
-

  • ExGRPO: Learning to Reason from Experience

    Paper • 2510.02245 • Published Oct 2 • 78

  • A Comprehensive Survey of Self-Evolving AI Agents: A New Paradigm Bridging Foundation Models and Lifelong Agentic Systems

    Paper • 2508.07407 • Published Aug 10 • 97

  • rStar2-Agent: Agentic Reasoning Technical Report

    Paper • 2508.20722 • Published Aug 28 • 115

  • Memory-R1: Enhancing Large Language Model Agents to Manage and Utilize Memories via Reinforcement Learning

    Paper • 2508.19828 • Published Aug 27 • 6

  • Tree Search for LLM Agent Reinforcement Learning

    Paper • 2509.21240 • Published Sep 25 • 87

  • FlowRL: Matching Reward Distributions for LLM Reasoning

    Paper • 2509.15207 • Published Sep 18 • 113

  • Parallel-R1: Towards Parallel Thinking via Reinforcement Learning

    Paper • 2509.07980 • Published Sep 9 • 99
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs