arxiv:2504.09710
							
						Wentian Zhao
zwt123home123
		·
				AI & ML interests
None yet
		Recent Activity
						upvoted 
								a
								paper
							
						about 1 month ago
						
					
						
						
						Vision-Zero: Scalable VLM Self-Improvement via Strategic Gamified
  Self-Play
						
						upvoted 
								a
								paper
							
						about 1 month ago
						
					
						
						
						EPO: Entropy-regularized Policy Optimization for LLM Agents
  Reinforcement Learning
						
						upvoted 
								a
								paper
							
						4 months ago
						
					
						
						
						Skip a Layer or Loop it? Test-Time Depth Adaptation of Pretrained LLMs
						Organizations
None yet