Safetensors
qwen2
draft
speculative-decoding
jukofyork commited on
Commit
b9878ad
·
verified ·
1 Parent(s): 00805f0

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -1
README.md CHANGED
@@ -234,4 +234,6 @@ drop_tails = true
234
 
235
  ```
236
 
237
- I used six `RTX A6000` GPUs over three nodes and hence the `60` batch size (`6 x 10 gradient accumulation steps = 60`).
 
 
 
234
 
235
  ```
236
 
237
+ I used six `RTX A6000` GPUs over three nodes and hence the `60` batch size (`6 x 10 gradient accumulation steps = 60`):
238
+
239
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/65995c45539c808e84c38bf1/GHyDC4c8zR34i_VfCjYKn.png)