My thesis project revolves around the development of Nepali Text-to-Speech system based on transformer architecture. I have meticulously crafted a FastPitch with HiFi-GAN model tailored specifically for the Nepali language, trained from the ground up. The model exhibits impressive Mean Opinion Scores (MOS) with a maximum rating of 3.70, showcasing its high performance and effectiveness.
Team Members
Ishan Dongol, Bal Krishna Bal
- RoleResearcher | Lead Developer
- CategoryReactRustTypeScriptMachine LearningDeep LearningHerokuTransformerNLP
- DurationOngoing
- DemoClick for Demo