Skip to content

v0.1.0

Latest
Compare
Choose a tag to compare
@chrisbrickhouse chrisbrickhouse released this 10 Apr 02:57
· 2 commits to main since this release
985c7f3

The FAVE-asr package provides a system for the automated transcription of sociolinguistic interview data on local machines for use by aligners like FAVE or the Montreal Forced Aligner. The package provides functions to label different speakers in the same audio (diarization), transcribe speech, and output TextGrids with phrase- or word-level alignments.

Unlike other services, fave-asr does not require uploading your data to other servers and instead focuses on processing audio on your own computer. Audio data can contain highly confidential information, and uploading this data to other services may not comply with ethical or legal data protection obligations. The goal of fave-asr is to serve those use cases where data protection makes local transcription necessary while making the process as seamless as cloud-based transcription services.

Example Use Cases

  • You want a transcription of an interview for more detailed hand correction.
  • You want to transcribe a large corpus and your analysis can tolerate a small error rate.
  • You want to make an audio corpus into a text corpus.
  • You want to know the number of speakers in an audio file.