Scaling Speech Technology to 1,000+ Languages
Paper
•
2305.13516
•
Published
•
11
This repository contains the .safetensors version of the multilingual forced alignment model from Meta's MMS project, specifically the one used in torchaudio.pipelines.MMS_FA.
This model was originally released as part of the Massively Multilingual Speech (MMS) project. All credits go to the original authors.
The original model was provided in .pt format. It was loaded using torchaudio.pipelines.MMS_FA.get_model(with_star=False) and then saved to .safetensors format.