Automatic Speech Recognition 📝 - a hf-audio Collection

hf-audio 's Collections

Xcodec and Xcodec2

Automatic Speech Recognition 📝

Text to Speech 🗣️

Audio Classification 🔊

Text to Music 🎧

Audio Codecs Embeddings 🎙️

Automatic Speech Recognition 📝

updated Sep 16, 2023

A collection of ASR models supported in 🤗 Transformers

openai/whisper-large-v2

Automatic Speech Recognition • 2B • Updated Feb 29, 2024 • 82.8k • 1.79k
facebook/wav2vec2-base-960h

Automatic Speech Recognition • 94.4M • Updated Nov 14, 2022 • 1.16M • 396
facebook/wav2vec2-large-xlsr-53

Updated Mar 18, 2022 • 256k • 159
facebook/hubert-xlarge-ls960-ft

Automatic Speech Recognition • 1.0B • Updated Jun 27, 2023 • 2.64k • 16
microsoft/wavlm-large

Feature Extraction • Updated Feb 2, 2022 • 573k • 105
facebook/mms-1b-all

Automatic Speech Recognition • 1.0B • Updated Jun 15, 2023 • 1.73M • 196
facebook/data2vec-audio-large-960h

Automatic Speech Recognition • Updated Jun 6, 2022 • 1.03k • 7
facebook/seamless-m4t-large

Automatic Speech Recognition • Updated Dec 14, 2023 • 513
facebook/s2t-small-librispeech-asr

Automatic Speech Recognition • 29.5M • Updated Sep 6, 2023 • 101k • 33
facebook/wav2vec2-conformer-rel-pos-large-960h-ft

Automatic Speech Recognition • Updated Jun 15, 2022 • 1.44k • 5
facebook/wav2vec2-xls-r-2b

Updated Aug 10, 2022 • 1.55k • 45