-
Notifications
You must be signed in to change notification settings - Fork 110
New issue
Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? # to your account
For song vc what should I do #61
Comments
Hello, panxin! I'm also working on singing vc with StarGANv2-VC. I didn't retrain F0 and ASR model. Instead, I made a dataset consisting of Mandarin songs, Mandarin, Japanese and English speech. This is my result. |
@Francis-Komizu well, thank you for your reply, indeed I think starganvc using for song vc may need some further works to work out |
@panxin801 I'm currently working on singing conversion using this model with some further modifications for better performance. I may submit my work to INTERSPEECH next year. |
@yl4579 Well, Congratulations. I'm looking forward for your works . |
@yl4579, is INTERSPEECH 2022 September? If yes, can you share the paper link here |
@mraj96 Sorry, I mean INTERSPEECH next year so it'll be 2023. |
@yl4579 , thank you for your work on StarGANv2-vc. We have been working on making StarGANv2-vc workable on the singing domain. Please find our work https://arxiv.org/abs/2210.11096 which enhances StarGANv2-vc to make it work on the singing domain while working on any-to-any case. |
The main modification which makes StarGANv2-VC work on singing voice is the removal of pitch features from the instance normalization layers of the generator and having an absolute pitch reconstruction loss instead of a normalized pitch reconstruction loss. |
@mayank-git-hub Do you have a github for ROSVC? Couldn't find the source code, very interested! |
Hello and thank you sharing your great work, but I have some questions.
I'm looking forward for your reply, and thank you again.
The text was updated successfully, but these errors were encountered: