Independent evaluation results
#9
by
						
yaronr
	
							
						- opened
							
					
Dear Qwen team,
I'm pleased to share our independent evaluation of the model using our implementation of the MMLU-Pro benchmark.
The results demonstrate impressive performance for the model across multiple categories compared with other models.
I hope you find this useful.
