Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
TobDeBer
/
SmartQuant
like
0
GGUF
imatrix
conversational
License:
llama3.3
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
SmartQuant
29.9 GB
3 contributors
History:
17 commits
TobDeBer
Delete SmartQuant-Falcon-H1-0.5B-Instruct.gguf
04cfb42
verified
21 days ago
.gitattributes
Safe
2.2 kB
Rename llama-quantize to llama-quantize-sq
about 1 month ago
README.md
Safe
405 Bytes
Update README.md
6 months ago
SmartQuant-Llama-3.3-70B-Instruct.gguf
21 GB
xet
Rename Llama-3.3-70B-Instruct-SmartQuant.gguf to SmartQuant-Llama-3.3-70B-Instruct.gguf
6 months ago
SmartQuant-granite-3.3-8b-instruct.gguf
5.84 GB
xet
Rename granite-3.3-8b-instruct-SmartQuant.gguf to SmartQuant-granite-3.3-8b-instruct.gguf
6 months ago
Tiny-Moe.Q6_K_T3.gguf
84.7 MB
xet
Upload Tiny-Moe.Q6_K_T3.gguf with huggingface_hub
2 months ago
calibration_datav3.txt
Safe
280 kB
add quantization tool
6 months ago
granite-4.0-tiny-preview-iq4_xs_T3UD.gguf
2.9 GB
xet
Upload granite-4.0-tiny-preview-iq4_xs_T3UD.gguf with huggingface_hub
about 2 months ago
llama-quantize-sq
2.78 MB
xet
Rename llama-quantize to llama-quantize-sq
about 1 month ago