TobDeBer
/

SmartQuant

Model card Files Files and versions

29.9 GB

3 contributors

History: 17 commits

TobDeBer's picture

Delete SmartQuant-Falcon-H1-0.5B-Instruct.gguf

04cfb42 verified 21 days ago

.gitattributes

2.2 kB

Rename llama-quantize to llama-quantize-sq about 1 month ago
README.md

405 Bytes

Update README.md 6 months ago
SmartQuant-Llama-3.3-70B-Instruct.gguf

21 GB
xet

Rename Llama-3.3-70B-Instruct-SmartQuant.gguf to SmartQuant-Llama-3.3-70B-Instruct.gguf 6 months ago
SmartQuant-granite-3.3-8b-instruct.gguf

5.84 GB
xet

Rename granite-3.3-8b-instruct-SmartQuant.gguf to SmartQuant-granite-3.3-8b-instruct.gguf 6 months ago
Tiny-Moe.Q6_K_T3.gguf

84.7 MB
xet

Upload Tiny-Moe.Q6_K_T3.gguf with huggingface_hub 2 months ago
calibration_datav3.txt

280 kB

add quantization tool 6 months ago
granite-4.0-tiny-preview-iq4_xs_T3UD.gguf

2.9 GB
xet

Upload granite-4.0-tiny-preview-iq4_xs_T3UD.gguf with huggingface_hub about 2 months ago
llama-quantize-sq

2.78 MB
xet

Rename llama-quantize to llama-quantize-sq about 1 month ago