32 8 33

Haoxiang Wang

Haoxiang-Wang

https://haoxiang-wang.github.io/

AI & ML interests

Machine Learning (Transfer Learning, OOD Generalization, Domain Adaptation, Meta-Learning)

Recent Activity

upvoted a paper about 2 months ago

Reinforce-Ada: An Adaptive Sampling Framework for Reinforce-Style LLM Training

updated a model 4 months ago

nvidia/NFT-32B

published a model 4 months ago

nvidia/NFT-32B

View all activity

Organizations

upvoted a paper about 2 months ago

Reinforce-Ada: An Adaptive Sampling Framework for Reinforce-Style LLM Training

Paper • 2510.04996 • Published Oct 6 • 15

updated a model 4 months ago

nvidia/NFT-32B

Text Generation • 33B • Updated Jul 15 • 59 • • 5

published 2 models 4 months ago

nvidia/NFT-32B

Text Generation • 33B • Updated Jul 15 • 59 • • 5

nvidia/NFT-7B

Text Generation • 8B • Updated Jul 15 • 66 • 2

updated a model 4 months ago

nvidia/NFT-7B

Text Generation • 8B • Updated Jul 15 • 66 • 2

upvoted a paper 6 months ago

Bridging Supervised Learning and Reinforcement Learning in Math Reasoning

Paper • 2505.18116 • Published May 23 • 4

commented a paper 6 months ago

Bridging Supervised Learning and Reinforcement Learning in Math Reasoning

Paper • 2505.18116 • Published May 23 • 4 •

upvoted a paper 8 months ago

Cosmos-Reason1: From Physical Common Sense To Embodied Reasoning

Paper • 2503.15558 • Published Mar 18 • 50

upvoted a paper 9 months ago

Self-rewarding correction for mathematical reasoning

Paper • 2502.19613 • Published Feb 26 • 83

New activity in nvidia/Cosmos-1.0-Autoregressive-4B 11 months ago

access restriction

#3 opened 11 months ago by

qyx915915

Access restrictions

#2 opened 11 months ago by

fximax

updated 9 models 11 months ago

Haoxiang Wang

AI & ML interests

Recent Activity

Organizations

Haoxiang-Wang's activity

access restriction

Access restrictions