Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

Speech synthesis results #1

Open
athenasaurav opened this issue Dec 10, 2022 · 0 comments
Open

Speech synthesis results #1

athenasaurav opened this issue Dec 10, 2022 · 0 comments

Comments

@athenasaurav
Copy link

athenasaurav commented Dec 10, 2022

Hello @hcy71o ,

Liked your work in Transfer TTS and SC VITS. I have trained a model up to 350000 steps using LibriTTS train clean 100 dataset only but when I synthesize results using some random audio file the speech is not clear.

So, my question is:

  1. How many steps did you train your model?

  2. What should be the length (duration) of audio files while passing to inference.py.

  3. Also should the reference audio be a part of the training data speaker, or can it be unseen?

  4. Do you have any demo page where we can see the comparison of Transfer TTS generated audio with VITS?

Thanks

# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant