Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
hyung gyu rho's picture
2 2

hyung gyu rho

sirano1004
·
  • sirano1004

AI & ML interests

None yet

Recent Activity

authored a paper about 1 month ago
Margin Adaptive DPO: Leveraging Reward Model for Granular Control in Preference Optimization
upvoted a paper about 1 month ago
A Contextual Quality Reward Model for Reliable and Efficient Best-of-N Sampling
upvoted a paper about 1 month ago
Margin Adaptive DPO: Leveraging Reward Model for Granular Control in Preference Optimization
View all activity

Organizations

None yet

upvoted 2 papers about 1 month ago

A Contextual Quality Reward Model for Reliable and Efficient Best-of-N Sampling

Paper • 2510.04087 • Published Oct 5 • 1

Margin Adaptive DPO: Leveraging Reward Model for Granular Control in Preference Optimization

Paper • 2510.05342 • Published Oct 6 • 5
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs