facebook/wav2vec2-base-960h
Automatic Speech Recognition
•
94.4M
•
Updated
•
5.85M
•
381
Generate spatial audio from images (and optionally text)
Paper Whisperer