Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Zeyuan Allen-Zhu's picture
3 9 16

Zeyuan Allen-Zhu

zhuzeyuan
sam-mosaic's profile picture Matheart's profile picture danielz01's profile picture
·
http://zeyuan.allen-zhu.com
  • ZeyuanAllenZhu

AI & ML interests

None yet

Organizations

Meta Llama's profile picture AI at Meta's profile picture

authored 2 papers about 1 year ago

Physics of Language Models: Part 2.2, How to Learn From Mistakes on Grade-School Math Problems

Paper • 2408.16293 • Published Aug 29, 2024 • 27

Physics of Language Models: Part 2.1, Grade-School Math and the Hidden Reasoning Process

Paper • 2407.20311 • Published Jul 29, 2024 • 5
authored 6 papers over 1 year ago

Physics of Language Models: Part 1, Context-Free Grammar

Paper • 2305.13673 • Published May 23, 2023 • 7

Physics of Language Models: Part 3.3, Knowledge Capacity Scaling Laws

Paper • 2404.05405 • Published Apr 8, 2024 • 10

Physics of Language Models: Part 3.1, Knowledge Storage and Extraction

Paper • 2309.14316 • Published Sep 25, 2023 • 8

Physics of Language Models: Part 3.2, Knowledge Manipulation

Paper • 2309.14402 • Published Sep 25, 2023 • 7

LoRA: Low-Rank Adaptation of Large Language Models

Paper • 2106.09685 • Published Jun 17, 2021 • 52

Reverse Training to Nurse the Reversal Curse

Paper • 2403.13799 • Published Mar 20, 2024 • 13
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs