ASR : Is BPE ASR model is better in accuracy compared to normal char #1703

harrypotter90 · 2021-02-03T12:06:21Z

harrypotter90
Feb 3, 2021

I have been using ASR and there are two variant one is normal and the other is the BPE variant config file

Char: https://github.com/NVIDIA/NeMo/blob/main/examples/asr/experimental/conformer/conformer_char.yaml
BPE: https://github.com/NVIDIA/NeMo/blob/main/examples/asr/experimental/conformer/conformer_bpe.yaml

I know that BPE is for Unicode characters and char is where we can represent transcripts with normal characters.
But will there be more accuracy if I use BPE instead of char for English ASR training?

I have been reading some other articles and I am a bit confused, so the question may be stupid. Please clarify.

Answered by titu1994

Feb 3, 2021

Subword automatically will not guarantee you better WER, but it does have a few advantages that help make it possible for models to perform better.

Theres a new tutorial for ASR with subword encoding here
which you could look into for some info on how subword helps CTC based model training - https://colab.research.google.com/github/NVIDIA/NeMo/blob/main/tutorials/asr/08_ASR_with_Subword_Tokenization.ipynb

View full answer

titu1994 · 2021-02-03T17:17:49Z

titu1994
Feb 3, 2021
Maintainer

Subword automatically will not guarantee you better WER, but it does have a few advantages that help make it possible for models to perform better.

Theres a new tutorial for ASR with subword encoding here
which you could look into for some info on how subword helps CTC based model training - https://colab.research.google.com/github/NVIDIA/NeMo/blob/main/tutorials/asr/08_ASR_with_Subword_Tokenization.ipynb

1 reply

VahidooX Mar 11, 2021
Collaborator

When you use subsampling like the one we have in Conformer, BPE based models may give you better WER.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ASR : Is BPE ASR model is better in accuracy compared to normal char #1703

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 1 comment 1 reply

{{title}}

{{title}}

Select a reply

ASR : Is BPE ASR model is better in accuracy compared to normal char #1703

harrypotter90 Feb 3, 2021

Replies: 1 comment · 1 reply

titu1994 Feb 3, 2021 Maintainer

VahidooX Mar 11, 2021 Collaborator

harrypotter90
Feb 3, 2021

Replies: 1 comment 1 reply

titu1994
Feb 3, 2021
Maintainer

VahidooX Mar 11, 2021
Collaborator