Upload ./README.md with huggingface_hub
Browse files
README.md
CHANGED
|
@@ -44,13 +44,12 @@ This repository contains Nunchaku-quantized versions of [Shuttle Jaguar](https:/
|
|
| 44 |
- [`svdq-int4_r32-shuttle-jaguar.safetensors`](./svdq-int4_r32-shuttle-jaguar.safetensors): SVDQuant quantized INT4 Shuttle Jaguar model. For users with non-Blackwell GPUs (pre-50-series).
|
| 45 |
- [`svdq-fp4_r32-shuttle-jaguar.safetensors`](./svdq-fp4_r32-shuttle-jaguar.safetensors): SVDQuant quantized NVFP4 Shuttle Jaguar model. For users with Blackwell GPUs (50-series).
|
| 46 |
|
| 47 |
-
|
| 48 |
### Model Sources
|
| 49 |
|
| 50 |
- **Inference Engine:** [nunchaku](https://github.com/nunchaku-tech/nunchaku)
|
| 51 |
- **Quantization Library:** [deepcompressor](https://github.com/nunchaku-tech/deepcompressor)
|
| 52 |
- **Paper:** [SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models](http://arxiv.org/abs/2411.05007)
|
| 53 |
-
- **Demo:** [
|
| 54 |
|
| 55 |
## Usage
|
| 56 |
|
|
|
|
| 44 |
- [`svdq-int4_r32-shuttle-jaguar.safetensors`](./svdq-int4_r32-shuttle-jaguar.safetensors): SVDQuant quantized INT4 Shuttle Jaguar model. For users with non-Blackwell GPUs (pre-50-series).
|
| 45 |
- [`svdq-fp4_r32-shuttle-jaguar.safetensors`](./svdq-fp4_r32-shuttle-jaguar.safetensors): SVDQuant quantized NVFP4 Shuttle Jaguar model. For users with Blackwell GPUs (50-series).
|
| 46 |
|
|
|
|
| 47 |
### Model Sources
|
| 48 |
|
| 49 |
- **Inference Engine:** [nunchaku](https://github.com/nunchaku-tech/nunchaku)
|
| 50 |
- **Quantization Library:** [deepcompressor](https://github.com/nunchaku-tech/deepcompressor)
|
| 51 |
- **Paper:** [SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models](http://arxiv.org/abs/2411.05007)
|
| 52 |
+
- **Demo:** [demo.nunchaku.tech](https://demo.nunchaku.tech)
|
| 53 |
|
| 54 |
## Usage
|
| 55 |
|