1 4

Junqi Gao

ChetKao

gjq100

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 months ago

A Survey of Reinforcement Learning for Large Reasoning Models

updated a model 4 months ago

ChetKao/Bohdi-Llama-3.2-3B-Instruct

updated a model 4 months ago

ChetKao/Bohdi-Qwen2.5-7B-Instruct

View all activity

Organizations

None yet

upvoted a paper about 2 months ago

A Survey of Reinforcement Learning for Large Reasoning Models

Paper • 2509.08827 • Published Sep 10 • 184

updated 4 models 4 months ago

published 4 models 4 months ago

ChetKao/Bohdi-Llama-3.2-3B-Instruct

Text Generation • 3B • Updated Jun 24

ChetKao/Bohdi-Qwen2.5-7B-Instruct

Text Generation • 8B • Updated Jun 24 • 4 • 1

ChetKao/Bohdi-Llama-3.1-8B-Instruct

Text Generation • 8B • Updated Jun 24 • 2

ChetKao/Bohdi-gemma-2-9b-it

Text Generation • 9B • Updated Jun 24 • 1

commented a paper 5 months ago

Graph Counselor: Adaptive Graph Exploration via Multi-Agent Synergy to Enhance LLM Reasoning

Paper • 2506.03939 • Published Jun 4 • 2 •

upvoted a paper 5 months ago

ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows

Paper • 2505.19897 • Published May 26 • 104

upvoted a paper 6 months ago

TTRL: Test-Time Reinforcement Learning

Paper • 2504.16084 • Published Apr 22 • 120

authored a paper 7 months ago

GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning

Paper • 2504.00891 • Published Apr 1 • 14

authored a paper 9 months ago

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

Paper • 2502.06703 • Published Feb 10 • 153

upvoted a paper 9 months ago