Text Generation
Transformers
Safetensors
falcon_h1
falcon-h1
ybelkada commited on
Commit
434c6a7
·
verified ·
1 Parent(s): bc10a49

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -3
README.md CHANGED
@@ -56,7 +56,7 @@ For more details about the training protocol of this model, please refer to the
56
 
57
  # Usage
58
 
59
- Currently to use this model you can either rely on Hugging Face `transformers`, `vLLM` or our custom fork of `llama.cpp` library.
60
 
61
  ## Inference
62
 
@@ -102,8 +102,8 @@ vllm serve tiiuae/Falcon-H1-1B-Instruct --tensor-parallel-size 2 --data-parallel
102
 
103
  ### `llama.cpp`
104
 
105
- While we are working on integrating our architecture directly into `llama.cpp` library, you can install our fork of the library and use it directly: https://github.com/tiiuae/llama.cpp-Falcon-H1
106
- Use the same installing guidelines as `llama.cpp`.
107
 
108
  # Evaluation
109
 
 
56
 
57
  # Usage
58
 
59
+ Currently to use this model you can either rely on Hugging Face `transformers`, `vLLM` or our `llama.cpp` library.
60
 
61
  ## Inference
62
 
 
102
 
103
  ### `llama.cpp`
104
 
105
+ You can find all GGUF files compatible with `llama.cpp` under [our official collection](https://huggingface.co/collections/tiiuae/falcon-h1-6819f2795bc406da60fab8df)
106
+
107
 
108
  # Evaluation
109