ERNIE 4.5 - a AmpereComputing Collection

AmpereComputing 's Collections

Gemma 3

GLM 4.5

GPT-OSS

Llama 4

Mistral

Phi 4

Qwen 3

QwQ

ERNIE 4.5

updated Sep 12

Ampere's quantization formats (Q4_K_4 / Q8R16) require Ampere optimized llama.cpp available here: https://hub.docker.com/r/amperecomputingai/llama.cpp

AmpereComputing/ernie-4.5-a3b-21b-thinking-gguf

22B • Updated Sep 11 • 5