Yu Meng's picture

1 3

Yu Meng

yumeng5

·

yumeng5

AI & ML interests

None yet

Recent Activity

upvoted a paper 26 days ago

TruthRL: Incentivizing Truthful LLMs via Reinforcement Learning

upvoted a paper 8 months ago

Efficient Test-Time Scaling via Self-Calibration

authored a paper about 1 year ago

Establishing Knowledge Preference in Language Models

View all activity

Organizations

upvoted a paper 26 days ago

TruthRL: Incentivizing Truthful LLMs via Reinforcement Learning

Paper • 2509.25760 • Published 27 days ago • 52

upvoted a paper 8 months ago

Efficient Test-Time Scaling via Self-Calibration

Paper • 2503.00031 • Published Feb 25 • 15

upvoted a collection over 1 year ago

SimPO

This collections contains a list of SimPO and baseline models. • 49 items • Updated Mar 16 • 23