- GGUF importance matrix (imatrix) quants for https://huggingface.co/ShinojiResearch/Senku-70B-Full
- The importance matrix was trained for ~50K tokens (105 batches of 512 tokens) using a general purpose imatrix calibration dataset.
- The imatrix is being used on the K-quants as well.
2024-02-26: Updating quants - IQ3_M/IQ3_S/IQ3_XS and IQ2_M/IQ2_S (requires latest commit a33e6a0d).
| Layers | Context | Template |
|---|---|---|
80 |
32764 |
<|im_start|>system |
- Downloads last month
- 140
Hardware compatibility
Log In
to view the estimation
1-bit
2-bit
3-bit
4-bit
