baomao
wuliPumpkin
		AI & ML interests
None yet
		Recent Activity
						upvoted 
								a
								paper
							
						about 1 month ago
						
					
						
						
						Beyond the Exploration-Exploitation Trade-off: A Hidden State Approach
  for LLM Reasoning in RLVR
						Organizations
None yet