Evaluating Agentic Search with Agent-as-a-Judge
			
	
	AI & ML interests
Natural language processing, language models, language agents
Recent Activity
	View all activity
	
			models
			44
		
			
	
	
	
	
	 
				osunlp/GUI-Drag-7B
		
				8B
			• 
	
				Updated
					
				
				• 
					
					73
				
	
				
				
 
				osunlp/GUI-Drag-3B
		
				4B
			• 
	
				Updated
					
				
				• 
					
					29
				
	
				
				
 
				osunlp/WebJudge-7B
			Image-Text-to-Text
			• 
		
				8B
			• 
	
				Updated
					
				
				• 
					
					45
				
	
				• 
					
					6
				
 
				osunlp/SAE_BioCLIP_24K_ViT-B-16_iNat21
		
	
				Updated
					
				
				
				
	
				
				
 
				osunlp/UGround-V1-7B
			Image-Text-to-Text
			• 
		
				8B
			• 
	
				Updated
					
				
				• 
					
					2.96k
				
	
				• 
					
					19
				
 
				osunlp/UGround
			Image-Text-to-Text
			• 
		
				7B
			• 
	
				Updated
					
				
				• 
					
					5
				
	
				• 
					
					24
				
 
				osunlp/Dreamer-7B-Classifieds
			Image-Text-to-Text
			• 
		
				8B
			• 
	
				Updated
					
				
				• 
					
					1
				
	
				• 
					
					1
				
 
				osunlp/Dreamer-7B-Shopping
			Image-Text-to-Text
			• 
		
				8B
			• 
	
				Updated
					
				
				• 
					
					1
				
	
				• 
					
					1
				
 
				osunlp/Dreamer-7B-Reddit
			Image-Text-to-Text
			• 
		
				8B
			• 
	
				Updated
					
				
				• 
					
					1
				
	
				• 
					
					1
				
 
				osunlp/Dreamer-7B
			Image-Text-to-Text
			• 
		
				8B
			• 
	
				Updated
					
				
				• 
					
					1
				
	
				• 
					
					5
				
			datasets
			20
		
			
	
	
	
	
	osunlp/Mind2Web-2
			Viewer
			• 
	
				Updated
					
				• 
			
			130
	
				• 
					
					107
				
				• 
					
					15
				
osunlp/Mind2Web
			Viewer
			• 
	
				Updated
					
				• 
			
			253
	
				• 
					
					1.04k
				
				• 
					
					112
				
osunlp/GUI-Drag-dataset
			Preview
			• 
	
				Updated
					
				
	
				• 
					
					36
				
				
				
osunlp/WebGuard
			Viewer
			• 
	
				Updated
					
				• 
			
			6k
	
				• 
					
					44
				
				
				
osunlp/AutoSDT-5K
			Viewer
			• 
	
				Updated
					
				• 
			
			5.15k
	
				• 
					
					81
				
				• 
					
					3
				
osunlp/UGround-V1-Data-Box
			Viewer
			• 
	
				Updated
					
				• 
			
			488k
	
				• 
					
					178
				
				• 
					
					8
				
osunlp/UGround-V1-Data
			Viewer
			• 
	
				Updated
					
				• 
			
			1.23M
	
				• 
					
					8.62k
				
				• 
					
					21
				
osunlp/Online-Mind2Web
			Viewer
			• 
	
				Updated
					
				• 
			
			300
	
				• 
					
					538
				
				• 
					
					14
				
osunlp/Dreamer-V1-Data
			Viewer
			• 
	
				Updated
					
				• 
			
			3.12M
	
				• 
					
					1.41k
				
				• 
					
					3
				
osunlp/HippoRAG_2
			Preview
			• 
	
				Updated
					
				
	
				• 
					
					195
				
				• 
					
					4