Gradio

VoiceCraft

This is a duplicated space as the original space was broken.The description of the license is based of the original github page.

License

The codebase is under CC BY-NC-SA 4.0 (LICENSE-CODE), and the model weights are under Coqui Public Model License 1.0.0 (LICENSE-MODEL). Note that we use some of the code from other repository that are under different licenses: ./models/codebooks_patterns.py is under MIT license; ./models/modules, ./steps/optim.py, data/tokenizer.py are under Apache License, Version 2.0; the phonemizer we used is under GNU 3.0 License.

Input Audio

0:00

Original transcript

Use whisperx model to get the transcript. Fix and align it if necessary.

Text

Smart transcript

Mode

TTS Edit Long TTS

Last word in prompt

Prompt end time

0 7.86

Output Audio