IMatrix GGUFs calibrated on https://huggingface.co/datasets/eaddario/imatrix-calibration/tree/main combined_all_small set.
Note: Due to the nonstandard tensor sizes, some quantization types do not make sense. For example, due to fallbacks IQ2_M is just 300MB smaller than IQ4_NL. Thus, I only upload the quantizations that actually made sense.
- Downloads last month
 - 50
 
							Hardware compatibility
						Log In
								
								to view the estimation
	Inference Providers
	NEW
	
	
	This model isn't deployed by any Inference Provider.
	๐
			
		Ask for provider support
Model tree for ilintar/NVIDIA-Nemotron-Nano-9B-v2-GGUF
Base model
nvidia/NVIDIA-Nemotron-Nano-12B-v2-Base
				Finetuned
	
	
nvidia/NVIDIA-Nemotron-Nano-12B-v2
						
				Finetuned
	
	
nvidia/NVIDIA-Nemotron-Nano-9B-v2