Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

For song vc what should I do #61

Open
panxin801 opened this issue Sep 14, 2022 · 9 comments
Open

For song vc what should I do #61

panxin801 opened this issue Sep 14, 2022 · 9 comments
Labels
discussion New research topic

Comments

@panxin801
Copy link

Hello and thank you sharing your great work, but I have some questions.

  1. For song vc with Madarian, I tried train a new starganv2vc model with pretrained ASR and F0 model, but the result sound not well, do you have some advice ?
  2. In song vc with Madarian, do i need to retrain a ASR or F0 model ?
    I'm looking forward for your reply, and thank you again.
@sophiefy
Copy link

sophiefy commented Sep 16, 2022

Hello and thank you sharing your great work, but I have some questions.

  1. For song vc with Madarian, I tried train a new starganv2vc model with pretrained ASR and F0 model, but the result sound not well, do you have some advice ?
  2. In song vc with Madarian, do i need to retrain a ASR or F0 model ?
    I'm looking forward for your reply, and thank you again.

Hello, panxin! I'm also working on singing vc with StarGANv2-VC. I didn't retrain F0 and ASR model. Instead, I made a dataset consisting of Mandarin songs, Mandarin, Japanese and English speech. This is my result.

@panxin801
Copy link
Author

@Francis-Komizu well, thank you for your reply, indeed I think starganvc using for song vc may need some further works to work out

@yl4579
Copy link
Owner

yl4579 commented Sep 16, 2022

@panxin801 I'm currently working on singing conversion using this model with some further modifications for better performance. I may submit my work to INTERSPEECH next year.

@panxin801
Copy link
Author

@yl4579 Well, Congratulations. I'm looking forward for your works .

@MuruganR96
Copy link

@panxin801 I'm currently working on singing conversion using this model with some further modifications for better performance. I may submit my work to INTERSPEECH next year.

@yl4579, is INTERSPEECH 2022 September? If yes, can you share the paper link here

@yl4579
Copy link
Owner

yl4579 commented Nov 22, 2022

@mraj96 Sorry, I mean INTERSPEECH next year so it'll be 2023.

@yl4579 yl4579 added the discussion New research topic label Jan 31, 2023
@mayank-git-hub
Copy link

@yl4579 , thank you for your work on StarGANv2-vc. We have been working on making StarGANv2-vc workable on the singing domain. Please find our work https://arxiv.org/abs/2210.11096 which enhances StarGANv2-vc to make it work on the singing domain while working on any-to-any case.

@mayank-git-hub
Copy link

The main modification which makes StarGANv2-VC work on singing voice is the removal of pitch features from the instance normalization layers of the generator and having an absolute pitch reconstruction loss instead of a normalized pitch reconstruction loss.

@billnye2
Copy link

@mayank-git-hub Do you have a github for ROSVC? Couldn't find the source code, very interested!

# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
discussion New research topic
Projects
None yet
Development

No branches or pull requests

6 participants