Experts for GPU-Poors - a phi0112358 Collection

phi0112358 's Collections

Llamafile-2-Go-Deutsch

Experts for GPU-Poors

Experts for GPU-Poors

updated Aug 25, 2024

GGUFs, conventional and k-quants – both without imatrix. This should be faster for CPU inference. Right now DeepSee MoEs (Mixture of Experts)

phi0112358/DeepSeek-V2-Lite-Chat-Q4_0-GGUF

16B • Updated Aug 18, 2024 • 9
phi0112358/DeepSeek-V2-Lite-Chat-Q8_0-GGUF

16B • Updated Aug 18, 2024 • 4
phi0112358/DeepSeek-Coder-V2-Lite-Base-Q4_K_S-GGUF

16B • Updated Aug 11, 2024 • 13
phi0112358/DeepSeek-Coder-V2-Lite-Instruct-Q4_K_M-GGUF

16B • Updated Aug 11, 2024 • 13
phi0112358/DeepSeek-Coder-V2-Lite-Instruct-Q8_0-GGUF

16B • Updated Aug 18, 2024 • 34