-
Notifications
You must be signed in to change notification settings - Fork 814
Proposal to integrate into 🤗 Hub #555
New issue
Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? # to your account
Proposal to integrate into 🤗 Hub #555
Conversation
@patrickvonplaten Thank you so much, this is a really great and useful feature :D. I have learned a lot from the huggingface transformers repo and as you can see, our repo has the same structure as the transformers repo then it would easily to integrated with HuggingFace_hub. I'm on a vacation and will be back in a few days. :D. |
Fix AutoProcessor
@patrickvonplaten Merged :D. Can you tell me what is a next steps ? |
Hey @dathudeptrai, Awesome to see that the PR is merged 🥳 In a next step, I think we can create an organization on the hub, here: https://huggingface.co/organizations/new (maybe called Also cc @julien-c , @osanseviero |
@patrickvonplaten I just added |
@dathudeptrai thank you for creating the org! That's awesome. There are some additional steps in our side. The two main things missing, I think, are:
@dathudeptrai something that could be interesting is to implement a |
@dathudeptrai by looking at the examples and familiarizing myself with the library, I was wondering if you would have an idea of the example code snippet that will be shown to the users. From what I see, there are two open questions:
Thank you for the library! I've been playing with it and it's awesome! |
Yes, almost TTS model now is 2 stages (text2mel and mel2wav). We can combine into one end2end model for the inference stage :D.
Unlike transformers for NLP where the input is almost the same, the text2mel's inputs are varied, they can add more input such as speaker_ids (for multi-speakers), language_ids (for multi-lingual), speaker_embeddings (for voice clone), style embedding (for emotional TTS) and some inputs to adjust speed, f0, energy ... But generally, we only need 2 inputs (input_ids and speaker_ids) :D. |
Hi @dathudeptrai. We got some exciting news! Last week our team worked on open-sourcing the code for adding code snippets as well as running the inference API for other libraries. This is in the huggingface_hub repo. This PR adds the code snippet as we discussed :) your users will already benefit from being able to search for all TensorFlowTTS models. |
@osanseviero Awesome! :D. I'm uploading all our models to https://huggingface.co/tensorspeech, will add a model card soon :D |
Awesome! I'm looking forward to see this :) As a tip, you can use different tags |
Does the tflite model available for ESPNET model https://huggingface.co/julien-c/kan-bayashi-jsut_tts_train_tacotron2_ja? |
Hi TensorSpeech team! I hereby propose an integration with the HuggingFace model hub 🤗
This integration would allow you to freely download/upload models from/to the Hugging Face Hub: https://huggingface.co/.
Your users could then directly download model weights, etc within Python without having to manually downloads weights.
Taking your fastspeech_2_inference.ipynb example the following diff would show the code could change to be able to directly download weights from the model hub.
As an example, I uploaded a fastspeech model to this repo of the HF hub:
I uploaded some weights exemplary to the hub here: https://huggingface.co/patrickvonplaten/tf_tts_fast_speech_2.
If you'd like to add this feature to your library we would obviously change the organization name from
patrickvonplaten
totensorspeech
.You can try it out by running the following code:
Besides freely storing your model weights, we also provide git version control and download statistics for your models :-) We can also provide you with a hosted inference API where users could try out your models directly on the website.
We've already integrated with a couple of other libraries - you can check them out here:
Sorry for the missing tests in the PR - I just did the minimal changes to showcase you how the integration with the HF hub could look like :-) I'd also be more than happy to add you guys to a Slack channel where we could discuss further.
Cheers,
Patrick & Hugging Face team
Also cc @julien-c