Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Jiahao004 's Collections
DeepTheorem

DeepTheorem

updated Jun 11

A dataset and RL-zero pipeline for advanced mathematical reasoning of informal theorem proving.

Upvote
2

  • Jiahao004/DeepTheorem

    Viewer • Updated Jul 3 • 121k • 505 • 25

  • Jiahao004/DeepTheorem-qwen-1.5b-rl

    2B • Updated May 26 • 1 • 1

  • Jiahao004/DeepTheorem-qwen-3b-rl

    3B • Updated May 26

  • Jiahao004/DeepTheorem-qwen-7b-rl

    8B • Updated May 26 • 1 • 3

  • Jiahao004/HMMT_FIMO_Putnam

    Updated Jun 6 • 38 • 2

  • DeepTheorem: Advancing LLM Reasoning for Theorem Proving Through Natural Language and Reinforcement Learning

    Paper • 2505.23754 • Published May 29 • 15
Upvote
2
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs