Commit
·
24161a9
1
Parent(s):
98b21f6
docs: update readme
Browse files- README.md +13 -5
- config/config.json +0 -11
- config/preprocessor.ts +0 -0
README.md
CHANGED
|
@@ -30,6 +30,16 @@ pipeline_tag: automatic-speech-recognition
|
|
| 30 |
|
| 31 |
This repository contains a quantized version of the Indic Conformer model, a large-scale automatic speech recognition (ASR) model created for Indic languages by AI4Bharat. The original model can be found [here](https://huggingface.co/ai4bharat/indic-conformer-600m-multilingual)
|
| 32 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 33 |
## Model Details
|
| 34 |
|
| 35 |
- **Model Type**: Automatic Speech Recognition (ASR)
|
|
@@ -44,7 +54,7 @@ This model is intended for transcribing speech in Indic languages into text. It
|
|
| 44 |
|
| 45 |
## Usage
|
| 46 |
|
| 47 |
-
[](https://www.kaggle.com/code/haposeiz/using-indic-asr-quantized)
|
| 48 |
|
| 49 |
### Installation
|
| 50 |
|
|
@@ -78,9 +88,7 @@ print(text)
|
|
| 78 |
## Model Files
|
| 79 |
|
| 80 |
### Config Sunfolder
|
| 81 |
-
- `config.json`: Model configuration including architecture details, quantization settings, and RNN-T parameters
|
| 82 |
- `vocab.json`: Subword vocabulary for supported languages
|
| 83 |
-
- `preprocessor.json`: Preprocessor configuration for audio feature extraction
|
| 84 |
- `language_masks.json`: Language-specific masks for handling multilingual inputs
|
| 85 |
|
| 86 |
### ONNX Subfolder
|
|
@@ -94,13 +102,13 @@ print(text)
|
|
| 94 |
|
| 95 |
## Training Data
|
| 96 |
|
| 97 |
-
|
| 98 |
|
| 99 |
The Calibration Dataset was curated from the [Indic Voices Dataset](https://huggingface.co/datasets/ai4bharat/IndicVoices).
|
| 100 |
|
| 101 |
## Additional Links
|
| 102 |
|
| 103 |
-
|
| 104 |
|
| 105 |
## Contact
|
| 106 |
|
|
|
|
| 30 |
|
| 31 |
This repository contains a quantized version of the Indic Conformer model, a large-scale automatic speech recognition (ASR) model created for Indic languages by AI4Bharat. The original model can be found [here](https://huggingface.co/ai4bharat/indic-conformer-600m-multilingual)
|
| 32 |
|
| 33 |
+
## Benchmarks
|
| 34 |
+
|
| 35 |
+
These benchmarks were conducted on Google Colab free tier with Tesla T4 GPU for Hindi.
|
| 36 |
+
You can use the notebooks in `scripts` directory to reproduce the results or compute for other languages.
|
| 37 |
+
|
| 38 |
+
| *Decoding Method* | FP 32 WER | int8 WER | FP32 CER | int8 CER |
|
| 39 |
+
| ----------------- | --------- | -------- | -------- | -------- |
|
| 40 |
+
| CTC | 0.1645 | 0.2985 | 0.0661 | 0.1698 |
|
| 41 |
+
| RNNT | 0.1508 | 0.2939 | 0.0642 | 0.149 |
|
| 42 |
+
|
| 43 |
## Model Details
|
| 44 |
|
| 45 |
- **Model Type**: Automatic Speech Recognition (ASR)
|
|
|
|
| 54 |
|
| 55 |
## Usage
|
| 56 |
|
| 57 |
+
Use the notebook: [](https://www.kaggle.com/code/haposeiz/using-indic-asr-quantized)
|
| 58 |
|
| 59 |
### Installation
|
| 60 |
|
|
|
|
| 88 |
## Model Files
|
| 89 |
|
| 90 |
### Config Sunfolder
|
|
|
|
| 91 |
- `vocab.json`: Subword vocabulary for supported languages
|
|
|
|
| 92 |
- `language_masks.json`: Language-specific masks for handling multilingual inputs
|
| 93 |
|
| 94 |
### ONNX Subfolder
|
|
|
|
| 102 |
|
| 103 |
## Training Data
|
| 104 |
|
| 105 |
+
Calibration Dataset:https://www.kaggle.com/datasets/haposeiz/indicvoices-calibration-1408
|
| 106 |
|
| 107 |
The Calibration Dataset was curated from the [Indic Voices Dataset](https://huggingface.co/datasets/ai4bharat/IndicVoices).
|
| 108 |
|
| 109 |
## Additional Links
|
| 110 |
|
| 111 |
+
GitHub: https://github.com/atharva-again/indic-asr-onnx
|
| 112 |
|
| 113 |
## Contact
|
| 114 |
|
config/config.json
DELETED
|
@@ -1,11 +0,0 @@
|
|
| 1 |
-
{
|
| 2 |
-
"auto_map": {
|
| 3 |
-
"AutoConfig": "model_onnx.IndicASRConfig",
|
| 4 |
-
"AutoModel": "model_onnx.IndicASRModel"
|
| 5 |
-
},
|
| 6 |
-
"BLANK_ID": 256,
|
| 7 |
-
"RNNT_MAX_SYMBOLS": 10,
|
| 8 |
-
"PRED_RNN_LAYERS": 2,
|
| 9 |
-
"PRED_RNN_HIDDEN_DIM": 640,
|
| 10 |
-
"SOS": 256
|
| 11 |
-
}
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
config/preprocessor.ts
DELETED
|
Binary file (91.7 kB)
|
|
|