steampunque
/

Mistral-Small-3.1-24B-Instruct-2503-Hybrid-GGUF

4-bit precision

Model card Files Files and versions

steampunque commited on Jun 2

Commit

5ce9c73

·

verified ·

1 Parent(s): fdb398e

Update README.md

Files changed (1) hide show

README.md +8 -0

README.md CHANGED Viewed

@@ -69,6 +69,14 @@ Q3_K_M | 11.5e9 | 5.58  | not tested, should work well
 Q4_K_H | 12.5e9 | 5.49  | slightly smaller than IQ4_XS, similar performance
 IQ4_XS | 12.9e9 | 5.38  | not tested, should work well
 ## Download the file from below:
 | Link | Type | Size/e9 B | Notes |
 |------|------|-----------|-------|

 Q4_K_H | 12.5e9 | 5.49  | slightly smaller than IQ4_XS, similar performance
 IQ4_XS | 12.9e9 | 5.38  | not tested, should work well
+Usage:
+This is a vision capable model. It can be used together with its multimedia projector layers to process images and text inputs
+and generate text outputs. The mmproj file is made available in this repository. To test vision mode follow the docs in the mtmd
+readme in the tools directory of the source tree https://github.com/ggml-org/llama.cpp/blob/master/tools/mtmd/README.md .
+Use of the best available model (Q4_K_H) is recommended to maximize the accuracy of vision mode.  To run it on a 12G VRAM
+GPU use --ngl 32.  Generation speed is still quite good with partial offload.
 ## Download the file from below:
 | Link | Type | Size/e9 B | Notes |
 |------|------|-----------|-------|