atharva-again commited on
Commit
24161a9
·
1 Parent(s): 98b21f6

docs: update readme

Browse files
Files changed (3) hide show
  1. README.md +13 -5
  2. config/config.json +0 -11
  3. config/preprocessor.ts +0 -0
README.md CHANGED
@@ -30,6 +30,16 @@ pipeline_tag: automatic-speech-recognition
30
 
31
  This repository contains a quantized version of the Indic Conformer model, a large-scale automatic speech recognition (ASR) model created for Indic languages by AI4Bharat. The original model can be found [here](https://huggingface.co/ai4bharat/indic-conformer-600m-multilingual)
32
 
 
 
 
 
 
 
 
 
 
 
33
  ## Model Details
34
 
35
  - **Model Type**: Automatic Speech Recognition (ASR)
@@ -44,7 +54,7 @@ This model is intended for transcribing speech in Indic languages into text. It
44
 
45
  ## Usage
46
 
47
- [![Open in Kaggle](https://img.shields.io/badge/Open%20in-Kaggle-blue?logo=kaggle)](https://www.kaggle.com/code/haposeiz/using-indic-asr-quantized)
48
 
49
  ### Installation
50
 
@@ -78,9 +88,7 @@ print(text)
78
  ## Model Files
79
 
80
  ### Config Sunfolder
81
- - `config.json`: Model configuration including architecture details, quantization settings, and RNN-T parameters
82
  - `vocab.json`: Subword vocabulary for supported languages
83
- - `preprocessor.json`: Preprocessor configuration for audio feature extraction
84
  - `language_masks.json`: Language-specific masks for handling multilingual inputs
85
 
86
  ### ONNX Subfolder
@@ -94,13 +102,13 @@ print(text)
94
 
95
  ## Training Data
96
 
97
- The model was quantized using a Calibration Dataset that can be found [here](https://www.kaggle.com/datasets/haposeiz/indicvoices-calibration-1408).
98
 
99
  The Calibration Dataset was curated from the [Indic Voices Dataset](https://huggingface.co/datasets/ai4bharat/IndicVoices).
100
 
101
  ## Additional Links
102
 
103
- - GitHub: https://github.com/atharva-again/indic-asr-onnx
104
 
105
  ## Contact
106
 
 
30
 
31
  This repository contains a quantized version of the Indic Conformer model, a large-scale automatic speech recognition (ASR) model created for Indic languages by AI4Bharat. The original model can be found [here](https://huggingface.co/ai4bharat/indic-conformer-600m-multilingual)
32
 
33
+ ## Benchmarks
34
+
35
+ These benchmarks were conducted on Google Colab free tier with Tesla T4 GPU for Hindi.
36
+ You can use the notebooks in `scripts` directory to reproduce the results or compute for other languages.
37
+
38
+ | *Decoding Method* | FP 32 WER | int8 WER | FP32 CER | int8 CER |
39
+ | ----------------- | --------- | -------- | -------- | -------- |
40
+ | CTC | 0.1645 | 0.2985 | 0.0661 | 0.1698 |
41
+ | RNNT | 0.1508 | 0.2939 | 0.0642 | 0.149 |
42
+
43
  ## Model Details
44
 
45
  - **Model Type**: Automatic Speech Recognition (ASR)
 
54
 
55
  ## Usage
56
 
57
+ Use the notebook: [![Open in Kaggle](https://img.shields.io/badge/Open%20in-Kaggle-blue?logo=kaggle)](https://www.kaggle.com/code/haposeiz/using-indic-asr-quantized)
58
 
59
  ### Installation
60
 
 
88
  ## Model Files
89
 
90
  ### Config Sunfolder
 
91
  - `vocab.json`: Subword vocabulary for supported languages
 
92
  - `language_masks.json`: Language-specific masks for handling multilingual inputs
93
 
94
  ### ONNX Subfolder
 
102
 
103
  ## Training Data
104
 
105
+ Calibration Dataset:https://www.kaggle.com/datasets/haposeiz/indicvoices-calibration-1408
106
 
107
  The Calibration Dataset was curated from the [Indic Voices Dataset](https://huggingface.co/datasets/ai4bharat/IndicVoices).
108
 
109
  ## Additional Links
110
 
111
+ GitHub: https://github.com/atharva-again/indic-asr-onnx
112
 
113
  ## Contact
114
 
config/config.json DELETED
@@ -1,11 +0,0 @@
1
- {
2
- "auto_map": {
3
- "AutoConfig": "model_onnx.IndicASRConfig",
4
- "AutoModel": "model_onnx.IndicASRModel"
5
- },
6
- "BLANK_ID": 256,
7
- "RNNT_MAX_SYMBOLS": 10,
8
- "PRED_RNN_LAYERS": 2,
9
- "PRED_RNN_HIDDEN_DIM": 640,
10
- "SOS": 256
11
- }
 
 
 
 
 
 
 
 
 
 
 
 
config/preprocessor.ts DELETED
Binary file (91.7 kB)