Jian Guan's picture

4 11

Jian Guan

Jiann

·

https://jianguanthu.github.io/

JianGuanTHU

AI & ML interests

Natural language generation;storytelling

Recent Activity

updated a dataset 20 days ago

Jiann/GS-Reasoner-Data

published a dataset 20 days ago

Jiann/GS-Reasoner-Data

upvoted a paper 26 days ago

BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping

View all activity

Organizations

upvoted a paper 26 days ago

BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping

Paper • 2510.18927 • Published 27 days ago • 82

upvoted a paper about 2 months ago

PromptCoT 2.0: Scaling Prompt Synthesis for Large Language Model Reasoning

Paper • 2509.19894 • Published Sep 24 • 32

upvoted a paper 5 months ago

Reinforcing Spatial Reasoning in Vision-Language Models with Interwoven Thinking and Visual Drawing

Paper • 2506.09965 • Published Jun 11 • 3

upvoted a paper about 1 year ago

Unlocking Reasoning Potential in Large Langauge Models by Scaling Code-form Planning

Paper • 2409.12452 • Published Sep 19, 2024 • 1