acoustic-model

Here are 24 public repositories matching this topic...

openvpi / DiffSinger

An advanced singing voice synthesis system with high fidelity, expressiveness, controllability and flexibility based on DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism

midi diffusion svs acoustic-model singing-voice pitch-prediction singing-voice-synthesis rectified-flow melody-frontend diffussion-model

Updated Feb 17, 2025
Python

MontrealCorpusTools / Montreal-Forced-Aligner

Star

Command line utility for forced alignment using Kaldi

python kaldi pronunciation-dictionary forced-alignment grapheme-to-phone acoustic-model

Updated Dec 2, 2024
Python

My-Voice Analysis is a Python library for the analysis of voice (simultaneous speech, high entropy) without the need of a transcription. It breaks utterances and detects syllable boundaries, fundamental frequency contours, and formants.

python-library speech-analysis praatscript acoustic-model voice-analysis

Updated Aug 31, 2021
Python

Shahabks / myprosody

Star

A Python library for measuring the acoustic features of speech (simultaneous speech, high entropy) compared to ones of native speech.

python-library voice-recognition prosody phonemes speech-analysis acoustic-model acoustic-features speech-patterns

Updated Nov 28, 2022
Python

cvqluu / Factorized-TDNN

Star

PyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and Kaldi

neural-network pytorch speech-recognition neural-networks kaldi speaker-recognition speaker-verification embedding speaker-diarization tdnn acoustic-model acoustic-models x-vector tdnn-f factorized-tdnn

Updated Jan 6, 2020
Python

guanlongzhao / fac-via-ppg

Star

Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams (Interspeech'19)

speech-synthesis acoustic-model accent-conversion

Updated Jul 6, 2023
Python

aluo-x / Learning_Neural_Acoustic_Fields

Star

Official code for "Learning Neural Acoustic Fields" (NeurIPS 2022)

pytorch impulse-response spatial-audio acoustics 3d-audio reverberation acoustic-model acoustic-models neural-fields implicit-functions neural-field spatial-audio-reproduction

Updated Jan 20, 2024
Python

X-LANCE / UniCATS-CTX-txt2vec

Star

[AAAI 2024] CTX-txt2vec, the acoustic model in UniCATS

text-to-speech tts speech-synthesis acoustic-model unicats vq-diffusion ctx-txt2vec

Updated Nov 18, 2024
Python

slp-rl / salmon

Star

The official code for the SALMon🍣 benchmark (ICASSP 2025)

audio-processing acoustic-model speech-language-model

Updated Feb 16, 2025
Python

jim-schwoebel / sound_event_detection

Star

🎵 A repository for manually annotating files to create labeled acoustic datasets for machine learning.

machine-learning acoustic-fingerprinting object-detection event-detection acoustics object-detection-pipelines audioset acoustic-model sound-event-detection acoustic-features object-detection-label common-voice common-voice-tool voice-computing object-detection-accuracy voicebook surveylex neurolex

Updated Feb 20, 2022
Python

hcy71o / SC-CNN

Star

SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker Text-to-Speech Systems

text-to-speech tts speech-synthesis zero-shot feature-extractor acoustic-model multi-speaker-tts

Updated Nov 1, 2023
Python

sooftware / End-to-End-Speech-Recognition-Models

Sponsor

Star

PyTorch implementation of automatic speech recognition models.

end-to-end pytorch transformer las vad e2e asr acoustic-model voice-activity-detection deepspeech2 listen-attend-and-spell

Updated Jan 10, 2021
Python

ronggong / jingjuSingingPhraseMatching

Star

Code for the paper: Audio to Score Matching by Combining Phonetic and Duration Information

score cnn-model phoneme singing-phrase acoustic-model hsmm

Updated Jul 9, 2017
Python

secretsauceai / precise-wakeword-model-maker

Star

Automated, end-to-end wakeword model maker using the Precise Wakeword Engine

nlp machine-learning hotword-detection acoustic-model wakeword wakeword-activation

Updated Feb 23, 2022
Python

zhaoyu611 / Automatic_Speech_Recognition_with_Multi_Models

Star

A Simple Automatic Speech Recognition (ASR) Model in Tensorflow, which only needs to focus on Deep Neural Network. It's easy to test popular cells (most are LSTM and its variants) and models (unidirectioanl RNN, bidirectional RNN, ResNet and so on). Moreover, you are welcome to play with self-defined cells or models.

deep-learning tensorflow lstm rnn automatic-speech-recognition ctc timit acoustic-model