What could be the usescases of this?

by krigeta - opened Oct 4

Discussion

krigeta

Oct 4

Hey @Atotti , amazing work but if possible can you share the usecases for this?

Atotti

Owner Oct 4

This model is not intended to be used on its own, but I think it has potential to serve as a foundational audio model similar to HuBERT or wavLM.
In fact, in my preliminary experiments, fine-tuning this model on an environmental sound classification task yielded a certain level of performance (though not particularly high).
It may also be possible to use it as the audio encoder component of other speech LLMs.