-
Reasoning Language Models: A Blueprint
Paper • 2501.11223 • Published • 33 -
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
Paper • 2402.03300 • Published • 131 -
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
Paper • 2501.12948 • Published • 425
Peng Zhang
irvingfish
·
AI & ML interests
None yet
Recent Activity
liked
a Space
19 days ago
HuggingFaceTB/smol-training-playbook
liked
a model
3 months ago
openai/gpt-oss-20b
liked
a Space
9 months ago
nanotron/ultrascale-playbook
Organizations
None yet