Vedant Nanda
Initial commit
16d194a
|
raw
history blame
201 Bytes
metadata
license: mit

Quantized MTP head of Deepseek R1. For use with the Unsloth's Q4_K quants.

Llama.cpp does not support MTP heads, but vLLM does.