Vedant Nanda commited on
Commit
16d194a
·
1 Parent(s): 9410bf2

Initial commit

Browse files
Files changed (2) hide show
  1. .gitattributes +1 -0
  2. README.md +7 -3
.gitattributes CHANGED
@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ *.gguf filter=lfs diff=lfs merge=lfs -text
README.md CHANGED
@@ -1,3 +1,7 @@
1
- ---
2
- license: mit
3
- ---
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ ---
4
+
5
+ Quantized MTP head of Deepseek R1. For use with the [Unsloth's Q4_K](https://huggingface.co/unsloth/DeepSeek-R1-GGUF) quants.
6
+
7
+ Llama.cpp does not support MTP heads, but vLLM does.