This project focuses on training high-quality pre-trained models.
| Embedder | Vocoder | Sample Rate 40k | Sample Rate 48k |
|---|---|---|---|
| contentvec | hifigannsf | ❌ | |
| contentvec | sifigan | ❌ | |
| spin | hifigannsf | ❌ | |
| spin | sifigan | ❌ | |
| chinese-hubert-base | hifigannsf | ❌ | |
| contentvec | bigvgan | ❌ | |
| spin-v2 | bigvgan | ❌ |
This project is licensed under the MIT License - see the LICENSE file for details.
Issues and Pull Requests are welcome to help improve the project!
Training code from Applio.
Dedicated to advancing Chinese speech synthesis technology. These base models have been used for fine-tuning most models at Convbased Studio.

