This is an experiment. Llama.cpp does not support GLM-4.5V and GLM-4.6V yet. I made llama.cpp believe that it's the GLM-4.5-Air architecture (so this model can only process text). It seems to have worked.

Downloads last month: 2,061

GGUF

Model size

107B params

Architecture

glm4moe

Hardware compatibility

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for AliceThirty/GLM-4.6V-gguf

Base model

zai-org/GLM-4.6V

Quantized

(4)

this model