ASR : Is BPE ASR model is better in accuracy compared to normal char #1703
-
I have been using ASR and there are two variant one is normal and the other is the BPE variant config file Char: https://github.com/NVIDIA/NeMo/blob/main/examples/asr/experimental/conformer/conformer_char.yaml I know that BPE is for Unicode characters and char is where we can represent transcripts with normal characters. I have been reading some other articles and I am a bit confused, so the question may be stupid. Please clarify. |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 1 reply
-
Subword automatically will not guarantee you better WER, but it does have a few advantages that help make it possible for models to perform better. Theres a new tutorial for ASR with subword encoding here |
Beta Was this translation helpful? Give feedback.
Subword automatically will not guarantee you better WER, but it does have a few advantages that help make it possible for models to perform better.
Theres a new tutorial for ASR with subword encoding here
which you could look into for some info on how subword helps CTC based model training - https://colab.research.google.com/github/NVIDIA/NeMo/blob/main/tutorials/asr/08_ASR_with_Subword_Tokenization.ipynb