Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
caiovicentino1 's Collections
HLWQ Large MoE (100B+)
HLWQ Models
HLWQ Video & Diffusion Models
HLWQ Gemma Models
Nemotron 30B — Consumer GPU Inference
HLWQ Unified (Weights Q5 + KV Cache Q3)
HLWQ MLX (Apple Silicon)
Large Models (27B-35B) HLWQ
Qwen3.5-4B EOQ Quantized
Qwen2.5 EOQ Quantized
Qwen3.5-9B HLWQ
EOQ Compressed Models
Qwen3.5-27B HLWQ

EOQ Compressed Models

updated 14 days ago

EOQ (Entropy-Optimal Quantization) compressed models. Mixed-bit allocation + rANS entropy coding. Smaller download, dequant at load time.

Upvote
-

  • caiovicentino1/Qwen3.5-9B-EOQ-v3

    Text Generation • 5B • Updated 7 days ago • 288 • 1

  • caiovicentino1/Qwen3.5-9B-EOQ-v2

    5B • Updated 7 days ago • 52

  • caiovicentino1/Qwen3.5-9B-EOQ-Dynamic-BitPacked

    5B • Updated 7 days ago • 93 • 1

  • caiovicentino1/Qwen3.5-35B-A3B-EOQ-v3

    15B • Updated 7 days ago • 101
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs