Release v0.1.0 · Forced-Alignment-and-Vowel-Extraction/fave-asr

The FAVE-asr package provides a system for the automated transcription of sociolinguistic interview data on local machines for use by aligners like FAVE or the Montreal Forced Aligner. The package provides functions to label different speakers in the same audio (diarization), transcribe speech, and output TextGrids with phrase- or word-level alignments.

Unlike other services, fave-asr does not require uploading your data to other servers and instead focuses on processing audio on your own computer. Audio data can contain highly confidential information, and uploading this data to other services may not comply with ethical or legal data protection obligations. The goal of fave-asr is to serve those use cases where data protection makes local transcription necessary while making the process as seamless as cloud-based transcription services.

Example Use Cases

You want a transcription of an interview for more detailed hand correction.
You want to transcribe a large corpus and your analysis can tolerate a small error rate.
You want to make an audio corpus into a text corpus.
You want to know the number of speakers in an audio file.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v0.1.0

Example Use Cases