Commit
·
9f40eb4
1
Parent(s):
fc03db7
Add arXiv link.
Browse files
README.md
CHANGED
|
@@ -120,6 +120,8 @@ license: cc-by-4.0
|
|
| 120 |
|
| 121 |
### `espnet/geolid_vl107only_independent_trainable`
|
| 122 |
|
|
|
|
|
|
|
| 123 |
This geolocation-aware language identification (LID) model is developed using the [ESPnet](https://github.com/espnet/espnet/) toolkit. It integrates the powerful pretrained [MMS-1B](https://huggingface.co/facebook/mms-1b) as the encoder and employs [ECAPA-TDNN](https://arxiv.org/pdf/2005.07143) as the embedding extractor to achieve robust spoken language identification.
|
| 124 |
|
| 125 |
The main innovations of this model are:
|
|
@@ -127,7 +129,7 @@ The main innovations of this model are:
|
|
| 127 |
2. Conditioning the intermediate representations of the self-supervised learning (SSL) encoder on intermediate-layer information.
|
| 128 |
This geolocation-aware strategy greatly improves robustness, especially for dialects and accented variations.
|
| 129 |
|
| 130 |
-
For further details on the geolocation-aware LID methodology, please refer to our paper: *Geolocation-Aware Robust Spoken Language Identification* (arXiv
|
| 131 |
|
| 132 |
### Usage Guide: How to use in ESPnet2
|
| 133 |
|
|
|
|
| 120 |
|
| 121 |
### `espnet/geolid_vl107only_independent_trainable`
|
| 122 |
|
| 123 |
+
[Paper](https://arxiv.org/pdf/2508.17148)
|
| 124 |
+
|
| 125 |
This geolocation-aware language identification (LID) model is developed using the [ESPnet](https://github.com/espnet/espnet/) toolkit. It integrates the powerful pretrained [MMS-1B](https://huggingface.co/facebook/mms-1b) as the encoder and employs [ECAPA-TDNN](https://arxiv.org/pdf/2005.07143) as the embedding extractor to achieve robust spoken language identification.
|
| 126 |
|
| 127 |
The main innovations of this model are:
|
|
|
|
| 129 |
2. Conditioning the intermediate representations of the self-supervised learning (SSL) encoder on intermediate-layer information.
|
| 130 |
This geolocation-aware strategy greatly improves robustness, especially for dialects and accented variations.
|
| 131 |
|
| 132 |
+
For further details on the geolocation-aware LID methodology, please refer to our paper: *Geolocation-Aware Robust Spoken Language Identification* ([arXiv](https://arxiv.org/pdf/2508.17148)).
|
| 133 |
|
| 134 |
### Usage Guide: How to use in ESPnet2
|
| 135 |
|