Hui Chen's picture

4 7

Hui Chen

chchenhui

·

chchenhui

AI & ML interests

Machine Learning, Natural language processing

Recent Activity

upvoted a paper about 1 month ago

MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling

upvoted a paper 3 months ago

Language Models Can Learn from Verbal Feedback Without Scalar Rewards

authored a paper 4 months ago

VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use

View all activity

Organizations

upvoted a paper about 1 month ago

MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling

Paper • 2511.11793 • Published Nov 14 • 164

upvoted a paper 3 months ago

Language Models Can Learn from Verbal Feedback Without Scalar Rewards

Paper • 2509.22638 • Published Sep 26 • 70

upvoted a paper 4 months ago

VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use

Paper • 2509.01055 • Published Sep 1 • 76

upvoted 3 papers 7 months ago

HardTests: Synthesizing High-Quality Test Cases for LLM Coding

Paper • 2505.24098 • Published May 30 • 43

MLR-Bench: Evaluating AI Agents on Open-Ended Machine Learning Research

Paper • 2505.19955 • Published May 26 • 13

GuardReasoner-VL: Safeguarding VLMs via Reinforced Reasoning

Paper • 2505.11049 • Published May 16 • 60

upvoted a paper over 1 year ago

Bootstrapping Language Models with DPO Implicit Rewards

Paper • 2406.09760 • Published Jun 14, 2024 • 41