deepdml
/

whisper-large-v3-turbo-ig-mix-norm

@@ -26,8 +26,9 @@ model-index:
     metrics:
     - name: Wer
       type: wer
-      value: 34.648462173849666
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
@@ -35,9 +36,9 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [deepdml/whisper-large-v3-turbo](https://huggingface.co/deepdml/whisper-large-v3-turbo) on the google/fleurs dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.7047
-- Wer: 34.6485
-- Cer: 10.6743
 ## Model description
@@ -69,11 +70,11 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss | Wer     | Cer     |
 |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|
-| 0.2017        | 0.2   | 1000 | 0.6332          | 40.5520 | 13.2337 |
-| 0.1311        | 0.4   | 2000 | 0.6560          | 37.9865 | 12.0943 |
-| 0.0572        | 0.6   | 3000 | 0.6759          | 36.4608 | 11.5819 |
-| 0.0503        | 0.8   | 4000 | 0.6915          | 35.1684 | 10.8913 |
-| 0.0355        | 1.0   | 5000 | 0.7047          | 34.6485 | 10.6743 |
 ### Framework versions
@@ -82,16 +83,3 @@ The following hyperparameters were used during training:
 - Pytorch 2.3.0+cu121
 - Datasets 2.19.1
 - Tokenizers 0.19.1
-## Citation
-Please cite the model using the following BibTeX entry:
-```bibtex
-@misc{deepdml/whisper-large-v3-turbo-ig-mix-norm,
-      title={Fine-tuned Whisper turbo ASR model for speech recognition in Lingala},
-      author={Jimenez, David},
-      howpublished={\url{https://huggingface.co/deepdml/whisper-large-v3-turbo-ig-mix-norm}},
-      year={2025}
-    }
-```

     metrics:
     - name: Wer
       type: wer
+      value: 31.264605428725506
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [deepdml/whisper-large-v3-turbo](https://huggingface.co/deepdml/whisper-large-v3-turbo) on the google/fleurs dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.7028
+- Wer: 31.2646
+- Cer: 10.8084
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss | Wer     | Cer     |
 |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|
+| 0.2019        | 0.2   | 1000 | 0.6438          | 36.4596 | 12.5223 |
+| 0.1293        | 0.4   | 2000 | 0.6558          | 33.7633 | 11.6044 |
+| 0.0589        | 0.6   | 3000 | 0.6882          | 31.8758 | 10.6653 |
+| 0.0504        | 0.8   | 4000 | 0.6845          | 31.0669 | 10.3172 |
+| 0.0353        | 1.0   | 5000 | 0.7028          | 31.2646 | 10.8084 |
 ### Framework versions
 - Pytorch 2.3.0+cu121
 - Datasets 2.19.1
 - Tokenizers 0.19.1

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:fa2afba5225d169b03c2904cd045e593b67af384cb6bfb87115e5ee3996c5f4a
 size 3235581408

 version https://git-lfs.github.com/spec/v1
+oid sha256:5b31b8a3d88dc0333219d41774509c5550880704a7016acda787a7ff238e1318
 size 3235581408