Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
-
Updated
Apr 21, 2025 - Python
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open
AudioLDM: Generate speech, sound effects, music and beyond, with text.
Audio generation using diffusion models, in PyTorch.
Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch
A family of diffusion models for text-to-audio generation.
InspireMusic: A Unified Framework for Music, Song, Audio Generation.
Official PyTorch implementation of BigVGAN (ICLR 2023)
[CVPR'23] MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation
FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.
Python library for designing and training your own Diffusion Models with PyTorch.
Pytorch implementation of BigVSAN
Official pytorch implementation of the paper: "Catch-A-Waveform: Learning to Generate Audio from a Single Short Example" (NeurIPS 2021)
The AI Podcast Studio: generate podcasts scripts and their audio version with a team of AI workers in a Podcast Studio 🎙️📜
A collection of useful audio datasets and transforms for PyTorch.
Trainer for audio-diffusion-pytorch
Word2Wave: a framework for generating short audio samples from a text prompt using WaveGAN and COALA.
Official implementation of the pipeline presented in I hear your true colors: Image Guided Audio Generation
Add a description, image, and links to the audio-generation topic page so that developers can more easily learn about it.
To associate your repository with the audio-generation topic, visit your repo's landing page and select "manage topics."