Update README.md
Browse files
README.md
CHANGED
|
@@ -132,7 +132,7 @@ assistant_model = AutoModelForCausalLM.from_pretrained(
|
|
| 132 |
assistant_model_id, torch_dtype=torch_dtype, low_cpu_mem_usage=True, use_safetensors=True
|
| 133 |
)
|
| 134 |
assistant_model.to(device)
|
| 135 |
-
model_id = "openai/whisper-large-
|
| 136 |
model = AutoModelForSpeechSeq2Seq.from_pretrained(
|
| 137 |
model_id, torch_dtype=torch_dtype, low_cpu_mem_usage=True, use_safetensors=True
|
| 138 |
)
|
|
@@ -155,7 +155,7 @@ print(result["text"])
|
|
| 155 |
```
|
| 156 |
## Training
|
| 157 |
|
| 158 |
-
The model was trained for 40,000 optimisation steps (or
|
| 159 |
```
|
| 160 |
--teacher_model_name_or_path "openai/whisper-large-v3"
|
| 161 |
--train_dataset_name "mozilla-foundation/common_voice_16_1"
|
|
|
|
| 132 |
assistant_model_id, torch_dtype=torch_dtype, low_cpu_mem_usage=True, use_safetensors=True
|
| 133 |
)
|
| 134 |
assistant_model.to(device)
|
| 135 |
+
model_id = "openai/whisper-large-v3"
|
| 136 |
model = AutoModelForSpeechSeq2Seq.from_pretrained(
|
| 137 |
model_id, torch_dtype=torch_dtype, low_cpu_mem_usage=True, use_safetensors=True
|
| 138 |
)
|
|
|
|
| 155 |
```
|
| 156 |
## Training
|
| 157 |
|
| 158 |
+
The model was trained for 40,000 optimisation steps (or 0.98 epochs), on a single RTX3090 for ~30 hours, using the following training parameters:
|
| 159 |
```
|
| 160 |
--teacher_model_name_or_path "openai/whisper-large-v3"
|
| 161 |
--train_dataset_name "mozilla-foundation/common_voice_16_1"
|