Shenao Zhang's picture

3 9 10

Shenao Zhang

ZhangShenao

·

https://shenao-zhang.github.io/

ShenaoZhang

AI & ML interests

None yet

Recent Activity

authored a paper about 2 months ago

Learning to Reason as Action Abstractions with Scalable Mid-Training RL

upvoted a paper about 2 months ago

Learning to Reason as Action Abstractions with Scalable Mid-Training RL

commented on a paper about 2 months ago

Learning to Reason as Action Abstractions with Scalable Mid-Training RL

View all activity

Organizations

upvoted a paper about 2 months ago

Learning to Reason as Action Abstractions with Scalable Mid-Training RL

Paper • 2509.25810 • Published Sep 30 • 5

upvoted a paper 3 months ago

LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model

Paper • 2509.00676 • Published Aug 31 • 83

upvoted a paper 5 months ago

ViCrit: A Verifiable Reinforcement Learning Proxy Task for Visual Perception in VLMs

Paper • 2506.10128 • Published Jun 11 • 22

upvoted 2 papers 6 months ago

MORSE-500: A Programmatically Controllable Video Benchmark to Stress-Test Multimodal Reasoning

Paper • 2506.05523 • Published Jun 5 • 34

Beyond Markovian: Reflective Exploration via Bayes-Adaptive RL for LLM Reasoning

Paper • 2505.20561 • Published May 26 • 7

upvoted a paper 7 months ago

Reward-Augmented Data Enhances Direct Preference Alignment of LLMs

Paper • 2410.08067 • Published Oct 10, 2024 • 2

upvoted a paper 9 months ago

Reason for Future, Act for Now: A Principled Framework for Autonomous LLM Agents with Provable Sample Efficiency

Paper • 2309.17382 • Published Sep 29, 2023 • 5

upvoted a paper 11 months ago

Offline Reinforcement Learning for LLM Multi-Step Reasoning

Paper • 2412.16145 • Published Dec 20, 2024 • 38

upvoted a paper over 1 year ago

Self-Exploring Language Models: Active Preference Elicitation for Online Alignment

Paper • 2405.19332 • Published May 29, 2024 • 22