--- license: cc0-1.0 datasets: - mah92/Khadijah-FA_EN-Public-Phone-Audio-Dataset language: - fa - en pipeline_tag: text-to-speech --- # بسم اله الرحمن الرحیم - هست کلید در گنج حکیم # Model Card for Khadijah(SA) This is the first persian/english text-to-speech model using the brand new matcha TTS model. Much faster and better than VITS. Works best with the UNIVERSAL_V1_22050Hz hifigan vocoder. You can test this model [here](https://huggingface.co/spaces/k2-fsa/text-to-speech) under persian+english part. Enjoy! ## Training method see: [how_to_train_matcha_tts](https://huggingface.co/mah92/how_to_train_matcha_tts) ## Training results ![Training Results](khadijah-22050.png) ## Credits Trained by Ali Mahmoudi (@mah92) Special thanks to Masoud Azizi (@Mablue ), Amirreza Ramezani (@brightening-eyes ), and Dr. Hamid Jafari (Khaneh Noor Iranian Basir). Special thanks to people from @ttsfarsi channel. I should also thank you @csukuangfj from Xiaomi corporation for your helps and cares in icefall and sherpa-onnx repos. و ما نحن بشئ الا بما رحم ربنا