Update README.md
Browse files
README.md
CHANGED
|
@@ -16,8 +16,8 @@ It is the result of converting Eric's float32 repo to float16 for easier storage
|
|
| 16 |
|
| 17 |
## Repositories available
|
| 18 |
|
| 19 |
-
* [
|
| 20 |
-
* [
|
| 21 |
* [float16 HF format model for GPU inference and further conversions](https://huggingface.co/TheBloke/Wizard-Vicuna-7B-Uncensored-HF).
|
| 22 |
|
| 23 |
# Original model card
|
|
|
|
| 16 |
|
| 17 |
## Repositories available
|
| 18 |
|
| 19 |
+
* [4-bit GPTQ models for GPU inference](https://huggingface.co/TheBloke/Wizard-Vicuna-7B-Uncensored-GPTQ).
|
| 20 |
+
* [4-bit, 5-bit and 8-bit GGML models for CPU (+CUDA) inference](https://huggingface.co/TheBloke/Wizard-Vicuna-7B-Uncensored-GGML).
|
| 21 |
* [float16 HF format model for GPU inference and further conversions](https://huggingface.co/TheBloke/Wizard-Vicuna-7B-Uncensored-HF).
|
| 22 |
|
| 23 |
# Original model card
|