Update README.md
Browse files
README.md
CHANGED
|
@@ -13,9 +13,11 @@ tags:
|
|
| 13 |
|
| 14 |
# Model Card
|
| 15 |
|
| 16 |
-
Dynamic quantization of DeepSeek-R1-0528
|
| 17 |
|
| 18 |
-
THIS QUANT ONLY
|
|
|
|
|
|
|
| 19 |
|
| 20 |
See [this detailed guide](https://github.com/ikawrakow/ik_llama.cpp/discussions/258) on how to setup an run **ik_llama**.
|
| 21 |
|
|
|
|
| 13 |
|
| 14 |
# Model Card
|
| 15 |
|
| 16 |
+
Dynamic quantization of DeepSeek-R1-0528 using optimized format only available on **ik_llama** fork, resized to run with 24GB to 32GB VRAM and 512GB RAM systems while providing the best balance between quality and performance for coding.
|
| 17 |
|
| 18 |
+
THIS QUANT USES OPTIMIZED FORMAT THAT IS ONLY AVAILABLE ON **IK_LLAMA** FORK!!!
|
| 19 |
+
|
| 20 |
+
Note that **ik_llama** can run all the **llama.cpp** quants, while adding support for interleaved formats (_R4 and _R8).
|
| 21 |
|
| 22 |
See [this detailed guide](https://github.com/ikawrakow/ik_llama.cpp/discussions/258) on how to setup an run **ik_llama**.
|
| 23 |
|