phi0112358/DeepSeek-V2-Lite-Chat-Q4_0-GGUF
16B
•
Updated
•
9
GGUFs, conventional and k-quants – both without imatrix. This should be faster for CPU inference. Right now DeepSee MoEs (Mixture of Experts)