NOTE: You will need a recent build of llama.cpp to run these quants (i.e. at least commit 494c870).
GGUF importance matrix (imatrix) quants for https://huggingface.co/ibm/labradorite-13b
- The importance matrix was trained for ~50K tokens (105 batches of 512 tokens) using a general purpose imatrix calibration dataset.
- The imatrix is being used on the K-quants as well.
| Layers | Context | Template |
|---|---|---|
40 |
4096 |
<|system|> |
- Downloads last month
- 62
Hardware compatibility
Log In
to view the estimation
3-bit
4-bit
6-bit
8-bit
Model tree for dranger003/labradorite-13b-iMat.GGUF
Base model
ibm-research/labradorite-13b