--- tags: - espnet - audio - automatic-speech-recognition language: code datasets: - proyecto_nahuatl license: cc-by-4.0 --- ## ESPnet2 ASR model ### `espnet/proyecto_nahuatl` This model was trained using proyecto_nahuatl recipe in [espnet](https://github.com/espnet/espnet/). ### Demo: How to use in ESPnet2 ```bash cd espnet/egs2/proyecto_nahuatl # add data path to prefix in run.sh ./run.sh ``` # RESULTS ## Environments - date: `Tue May 6 10:18:30 EDT 2025` - python version: `3.9.21 (main, Dec 11 2024, 16:24:11) [GCC 11.2.0]` - espnet version: `espnet 202412` - pytorch version: `pytorch 2.2.0+cu121` - Git hash: `90ea0823e6d0aa2ea7b322a8b513cbb0b8200e91` - Commit date: `Mon May 5 20:04:01 2025 +0000` ## exp/asr_train_asr_s3prl_raw_en_bpe150_sp/decode_asr_ctc_lm_lm_train_lm_en_bpe150_valid.loss.ave_asr_model_valid.loss.ave ### WER |dataset|Snt|Wrd|Corr|Sub|Del|Ins|Err|S.Err| |---|---|---|---|---|---|---|---|---| |test/Hidalgo|324|3760|9.9|57.4|32.7|8.0|98.1|100.0| |test/Tequila|870|9337|14.0|62.5|23.4|8.8|94.8|100.0| |test/Zacatlan|1836|16318|21.9|56.6|21.5|5.1|83.2|99.6| ### CER |dataset|Snt|Wrd|Corr|Sub|Del|Ins|Err|S.Err| |---|---|---|---|---|---|---|---|---| |test/Hidalgo|324|23998|62.4|13.0|24.6|15.3|52.9|100.0| |test/Tequila|870|65418|70.4|11.6|18.0|14.2|43.8|100.0| |test/Zacatlan|1836|108155|79.9|7.6|12.5|9.4|29.5|99.6| ### TER |dataset|Snt|Wrd|Corr|Sub|Del|Ins|Err|S.Err| |---|---|---|---|---|---|---|---|---| |test/Hidalgo|324|13202|48.2|27.2|24.6|15.9|67.7|100.0| |test/Tequila|870|37224|55.5|25.3|19.3|13.8|58.3|100.0| |test/Zacatlan|1836|61871|66.2|19.4|14.4|8.4|42.2|99.6|