Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
hf-audio 's Collections
Xcodec and Xcodec2
Automatic Speech Recognition 📝
Text to Speech 🗣️
Audio Classification 🔊
Text to Music 🎧
Audio Codecs Embeddings 🎙️

Automatic Speech Recognition 📝

updated Sep 16, 2023

A collection of ASR models supported in 🤗 Transformers

Upvote
12

  • openai/whisper-large-v2

    Automatic Speech Recognition • 2B • Updated Feb 29, 2024 • 48.4k • 1.78k

  • facebook/wav2vec2-base-960h

    Automatic Speech Recognition • 94.4M • Updated Nov 14, 2022 • 2M • 384

  • facebook/wav2vec2-large-xlsr-53

    Updated Mar 18, 2022 • 320k • 151

  • facebook/hubert-xlarge-ls960-ft

    Automatic Speech Recognition • 1.0B • Updated Jun 27, 2023 • 634 • 15

  • microsoft/wavlm-large

    Feature Extraction • Updated Feb 2, 2022 • 314k • 92

  • facebook/mms-1b-all

    Automatic Speech Recognition • 1.0B • Updated Jun 15, 2023 • 130k • 167

  • facebook/data2vec-audio-large-960h

    Automatic Speech Recognition • Updated Jun 6, 2022 • 954 • 7

  • facebook/seamless-m4t-large

    Automatic Speech Recognition • Updated Dec 14, 2023 • 512

  • facebook/s2t-small-librispeech-asr

    Automatic Speech Recognition • 29.5M • Updated Sep 6, 2023 • 6.42k • 33

  • facebook/wav2vec2-conformer-rel-pos-large-960h-ft

    Automatic Speech Recognition • Updated Jun 15, 2022 • 937 • 5

  • facebook/wav2vec2-xls-r-2b

    Updated Aug 10, 2022 • 4.39k • 42
Upvote
12
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs