speech-to-speech-translation (Cascaded STST)

cascaded speech-to-speech translation (STST), mapping from source speech in any language to target speech in German using my German TTS model.

Description

This repository demonstrates cascaded speech-to-speech translation (STST), which involves mapping source speech in any language to target speech in German. The demo utilizes the following models:

Whisper Base: OpenAI's model for speech translation
My German TTS: My text-to-speech model for generating German speech

How It Works

The cascaded STST process involves two steps:

Speech Translation (Source Language to German Text): The Whisper Base model translates source speech in any language into German text.
Text-to-Speech (German Text to Target Speech): The German text generated by Whisper Base is then input to the My German TTS model to produce the final target speech in German.

Usage

You can use it directly from my huggingface space link: https://huggingface.co/spaces/Salama1429/speech-to-speech-translation

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
README.md		README.md
app.py		app.py
packages.txt		packages.txt
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

speech-to-speech-translation (Cascaded STST)

Description

How It Works

Usage

For more details and examples, refer to the documentation and code in this repository.

About

Releases

Packages

Languages

Salama1429/speech-to-speech-translation

Folders and files

Latest commit

History

Repository files navigation

speech-to-speech-translation (Cascaded STST)

Description

How It Works

Usage

For more details and examples, refer to the documentation and code in this repository.

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages