A 1300-hour English speech and text corpus of parliamentary debates for streaming ASR training and benchmarking, speech data filtering and speech data verbatimization.
-
Updated
Mar 30, 2024
A 1300-hour English speech and text corpus of parliamentary debates for streaming ASR training and benchmarking, speech data filtering and speech data verbatimization.
PhD Thesis: "Automatic speech recognition and machine translation with deep neural networks for open educational resources, parliamentary contents and broadcast media" (2024)
Add a description, image, and links to the speech-data-filtering topic page so that developers can more easily learn about it.
To associate your repository with the speech-data-filtering topic, visit your repo's landing page and select "manage topics."