Commit History
Delete llama-server-6343-cuda
aad2b42
verified
Rename llama-quantize to llama-quantize-sq
c7dfad7
verified
Upload granite-4.0-tiny-preview-iq4_xs_T3UD.gguf with huggingface_hub
2375290
verified
Upload llama-server-6343-cuda with huggingface_hub
b02ea94
verified
Upload Tiny-Moe.Q6_K_T3.gguf with huggingface_hub
4885b4c
verified
Upload SmartQuant-Falcon-H1-0.5B-Instruct.gguf with huggingface_hub
2eda48e
verified
Rename Llama-3.3-70B-Instruct-SmartQuant.gguf to SmartQuant-Llama-3.3-70B-Instruct.gguf
4755934
verified
Rename granite-3.3-8b-instruct-SmartQuant.gguf to SmartQuant-granite-3.3-8b-instruct.gguf
3344ebb
verified
add quantization tool
8ec229d
TobDeBer
commited on
add granite-3.3-8b-instruct-SmartQuant.gguf
aaed805
TobDeBer
commited on
add first SmartQuant model
ef482e3
TobDeBer
commited on
Update README.md
60b1740
verified
Update README.md
f0b7865
verified
Update README.md
c6e6867
verified
track Llama-3.3-70B-Instruct-SmartQuant.gguf
9f6fb97
Tobias Bergmann
commited on