Geyang's picture

Geyang

geyang627

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 29 days ago

Safe and Scalable Web Agent Learning via Recreated Websites

upvoted an article about 1 month ago

Deriving the PPO Loss from First Principles

upvoted an article about 1 month ago

A Guide to Reinforcement Learning Post-Training for LLMs: PPO, DPO, GRPO, and Beyond

View all activity

Organizations

upvoted a paper 29 days ago

Safe and Scalable Web Agent Learning via Recreated Websites

Paper • 2603.10505 • Published Mar 11 • 27

upvoted 2 articles about 1 month ago

Article

Deriving the PPO Loss from First Principles

Dec 25, 2025

•

40

Article

A Guide to Reinforcement Learning Post-Training for LLMs: PPO, DPO, GRPO, and Beyond

Jan 19

•

13

upvoted a collection 8 months ago

Qwen3

84 items • Updated Dec 31, 2025 • 1.75k

upvoted a collection 10 months ago

CARE

14 items • Updated Jun 30, 2025 • 2

upvoted a paper almost 2 years ago

Beyond Imitation: Leveraging Fine-grained Quality Signals for Alignment

Paper • 2311.04072 • Published Nov 7, 2023 • 1