IndexTeam

12 models • 2 total models in database
Sort by:

IndexTTS-2

IndexTTS2: A Breakthrough in Emotionally Expressive and Duration-Controlled Auto-Regressive Zero-Shot Text-to-Speech Acknowledge 1. tortoise-tts 2. XTTSv2 3. BigVGAN 4. wenet 5. icefall 6. maskgct ...

18,530
600

IndexTTS-1.5

license:apache-2.0
1,676
76

Index-1.9B-Chat

NaNK
361
53

Index-1.9B-Character

NaNK
198
39

Index-1.9B-32K

NaNK
188
7

Index TTS

IndexTTS: An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System [[Paper]](https://arxiv.org/abs/2502.05512) [[Demos]](https://index-tts.github.io) [[Codes]](https://github.com/index-tts/index-tts) IndexTTS is a GPT-style text-to-speech (TTS) model mainly based on XTTS and Tortoise. It is capable of correcting the pronunciation of Chinese characters using pinyin and controlling pauses at any position through punctuation marks. We enhanced multiple modules of the system, including the improvement of speaker condition feature representation, and the integration of BigVGAN2 to optimize audio quality. Trained on tens of thousands of hours of data, our system achieves state-of-the-art performance, outperforming current popular TTS systems such as XTTS, CosyVoice2, Fish-Speech, and F5-TTS. Experience IndexTTS: Please contact [email protected] for more detailed information. Acknowledge 1. tortoise-tts 2. XTTSv2 3. BigVGAN 4. wenet 5. icefall 🌟 If you find our work helpful, please leave us a star and cite our paper.

license:apache-2.0
187
141

Index-1.9B-Character-GGUF

NaNK
177
10

Index-1.9B

NaNK
132
15

Index-1.9B-Pure

NaNK
131
5

Index-1.9B-Constant-LR

NaNK
126
3

Index-1.9B-Chat-GGUF

NaNK
117
26

Index-anisora

license:apache-2.0
22
209