@hypothetical on Hugging Face: "The smallest and the highest quality in the world Gemma4 E2B and E4B models!…"

Join the conversation

Join the community of Machine Learners and AI enthusiasts.

posted an update 1 day ago

Post

1160

The smallest and the highest quality in the world Gemma4 E2B and E4B models! 7x compression! From 9.3GB -> 1.4GB!

TheStageAI/gemma-4-E2B-it
TheStageAI/gemma-4-E4B-it

yano2mch

about 4 hours ago

I'd love to see this treatment on some of the larger models. Been using G4 26B's and used to use 70B models, those squashed down would remove the need to quantize at all. It would even make the 100B+ models workable.

(note, 8Gb VRAM so memory is definitely the bottleneck)

In this post

hypothetical Kirill
yano2mch yano