File size: 201 Bytes
16d194a |
1 2 3 4 5 6 7 |
---
license: mit
---
Quantized MTP head of Deepseek R1. For use with the [Unsloth's Q4_K](https://huggingface.co/unsloth/DeepSeek-R1-GGUF) quants.
Llama.cpp does not support MTP heads, but vLLM does. |