How do I use wav2vec2-large-xlsr-53 instead of wav2vec2-chinese-base_fp16.safetensors?

#1
by anujkk - opened

How do I use wav2vec2-large-xlsr-53 instead of wav2vec2-chinese-base_fp16.safetensors? I tried converting it to safetensor and using it in the workflow but it gave error. Is it possible to use any other wav2vec2 model in WavVideoWrapper?

anujkk changed discussion title from How do I use wav2vec2-large-xlsr-53 instead of wav2vec2-chinese-base_fp16.safetensors to How do I use wav2vec2-large-xlsr-53 instead of wav2vec2-chinese-base_fp16.safetensors?
Owner

Each audio model so far seems to be trained with specific version of wav2vec and doesn't work well or at all with different ones.

I think the Chinese version works with any language, I've tested it with English, Spanish and people speaking English with various accents.

Owner

I think the Chinese version works with any language, I've tested it with English, Spanish and people speaking English with various accents.

This is true, it does, pretty sure it's finetuned on top of the English model.

Hi, where in ComfyUI does the wav2vec belong?

Sign up or log in to comment