Post
1160
The smallest and the highest quality in the world Gemma4 E2B and E4B models! 7x compression! From 9.3GB -> 1.4GB!
TheStageAI/gemma-4-E2B-it
TheStageAI/gemma-4-E4B-it
TheStageAI/gemma-4-E2B-it
TheStageAI/gemma-4-E4B-it
Join the community of Machine Learners and AI enthusiasts.
Sign UpI'd love to see this treatment on some of the larger models. Been using G4 26B's and used to use 70B models, those squashed down would remove the need to quantize at all. It would even make the 100B+ models workable.
(note, 8Gb VRAM so memory is definitely the bottleneck)