jayn7
/

Z-Image-Turbo-GGUF

Text-to-Image

GGUF

image-generation

Model card Files Files and versions

xet

Community

GGFModel

by zawminhtutx - opened Dec 2, 2025

base: refs/heads/main

←

from: refs/pr/7

Discussion Files changed

-63

Files changed (1) hide show

README.md +1 -63

README.md CHANGED Viewed

@@ -13,7 +13,6 @@ Quantized GGUF versions of the [Z-Image Turbo](https://huggingface.co/Tongyi-MAI
 | Model | Download |
 |--------|--------------|
 | Z-Image Turbo GGUF | [Download](https://huggingface.co/jayn7/Z-Image-Turbo-GGUF/tree/main) |
-| Qwen3-4B (Text Encoder) | [unsloth/Qwen3-4B-GGUF](https://huggingface.co/unsloth/Qwen3-4B-GGUF)
 ### 📷 Example Comparison
 ![z_image_comparison_1](https://cdn-uploads.huggingface.co/production/uploads/651f78681719ac0cec346537/ILKCwkG5LkjF2ZrAXXRbJ.png)
@@ -30,69 +29,8 @@ Check out the original model card [Z-Image Turbo](https://huggingface.co/Tongyi-
 The model can be used with:
-- [**ComfyUI-GGUF**](https://github.com/city96/ComfyUI-GGUF) by **city96**
-- [**Diffusers**](https://github.com/huggingface/diffusers)
-#### Example Usage
-<details>
-<summary>Diffusers</summary>
-```sh
-pip install git+https://github.com/huggingface/diffusers
-```
-```py
-from diffusers import ZImagePipeline, ZImageTransformer2DModel, GGUFQuantizationConfig
-import torch
-prompt = "Young Chinese woman in red Hanfu, intricate embroidery. Impeccable makeup, red floral forehead pattern. Elaborate high bun, golden phoenix headdress, red flowers, beads. Holds round folding fan with lady, trees, bird. Neon lightning-bolt lamp (⚡️), bright yellow glow, above extended left palm. Soft-lit outdoor night background, silhouetted tiered pagoda (西安大雁塔), blurred colorful distant lights."
-height = 1024
-width = 1024
-seed = 42
-#hf_path = "https://huggingface.co/jayn7/Z-Image-Turbo-GGUF/blob/main/z_image_turbo-Q3_K_M.gguf"
-local_path = "path\to\local\model\z_image_turbo-Q3_K_M.gguf"
-transformer = ZImageTransformer2DModel.from_single_file(
-    local_path,
-    quantization_config=GGUFQuantizationConfig(compute_dtype=torch.bfloat16),
-    dtype=torch.bfloat16,
-)
-pipeline = ZImagePipeline.from_pretrained(
-    "Tongyi-MAI/Z-Image-Turbo",
-    transformer=transformer,
-    dtype=torch.bfloat16,
-).to("cuda")
-# [Optional] Attention Backend
-# Diffusers uses SDPA by default. Switch to Custom attention backend for better efficiency if supported:
-#pipeline.transformer.set_attention_backend("_sage_qk_int8_pv_fp16_triton") # Enable Sage Attention
-#pipeline.transformer.set_attention_backend("flash") # Enable Flash-Attention-2
-#pipeline.transformer.set_attention_backend("_flash_3") # Enable Flash-Attention-3
-# [Optional] Model Compilation
-# Compiling the DiT model accelerates inference, but the first run will take longer to compile.
-#pipeline.transformer.compile()
-# [Optional] CPU Offloading
-# Enable CPU offloading for memory-constrained devices.
-#pipeline.enable_model_cpu_offload()
-images = pipeline(
-    prompt=prompt,
-    num_inference_steps=9, # This actually results in 8 DiT forwards
-    guidance_scale=0.0, # Guidance should be 0 for the Turbo models
-    height=height,
-    width=width,
-    generator=torch.Generator("cuda").manual_seed(seed)
-).images[0]
-images.save("zimage.png")
-```
-</details>
 ### Credits

 | Model | Download |
 |--------|--------------|
 | Z-Image Turbo GGUF | [Download](https://huggingface.co/jayn7/Z-Image-Turbo-GGUF/tree/main) |
 ### 📷 Example Comparison
 ![z_image_comparison_1](https://cdn-uploads.huggingface.co/production/uploads/651f78681719ac0cec346537/ILKCwkG5LkjF2ZrAXXRbJ.png)
 The model can be used with:
+- [**ComfyUI-GGUF**](https://github.com/city96/ComfyUI-GGUF) by **city96**
 ### Credits