Made it small enough to fit on my 6000

Had to quantize from gguf Didnt notice much if any quality loss

Downloads last month
2
GGUF
Model size
123B params
Architecture
llama
Hardware compatibility
Log In to view the estimation

3-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for IIEleven11/Behemoth-ReduX-123B-v1a-Q3_K_M