GitHub - rhss10/japanese_asr_nia-22-2-10: A code to fine-tune Wav2vec2-xls-r on Japanese ASR dataset, as a part of 2022 NIA 2-10 research work

General

A source code to fine-tune self-supervised learning model (SSL) on NIA-2022-2-10 Japanese Dataset for Japanese Automatic Speech Recognition (ASR) of total 1028 hrs.
NIA-2022-2-10 Japanese Dataset for Japanese Automatic Speech Recognition (ASR) will soon be released within 2023.
More information regarding the usage of the dataset and docker support will be updated with the relase of dataset.
If you're looking at this file through Docker ver 3.3, you may want to refer to https://github.com/rhss10/japanese_asr_nia-22-2-10 for the latest codes.

License

SPDX-FileCopyrightText: © 2023 Hyungshin Ryu <rhss10@snu.ac.kr>
SPDX-License-Identifier: Apache-2.0

Setup

The code splits the dataset into 8:1:1 train, valid, test + (files over 17 second) set for efficient GPU usage.
The train code utilizes the basic Trainer API by Huggingface. By default, the SSL model is set to Wav2Vec2-xls-r-300m.

Performance

Valid WER, CER: 4.50%, 2.33%,
Test WER, CER: 5.12%, 2.62%

For Test-only (TTA Qualification/Docker)

Notes

This section is used for
1. TTA Qualification
2. those who want to test the performance of the best model checkpoint using Docker (model checkpoint included inside)
With audio and json directory path provided, the test script will evaluate the WER/CER of the model checkpoint
The example test split list is shown in test_final.txt

Command

cd test
# Don't forget to change DIRECTORY PATH inside test/extract_data_test.py before executing the bash script
sh test.sh
# Done!

For Train/Test

1. Prepare Data

# Data processing should be done beforehand on the ACTUAL data path
# The example files (data/nia-10*.txt) will not work!
python extract_data.py
python create_datasets.py

2. Train

# Example command for training. Refer to train.py for more supported arguments
python train.py --exp_prefix NIA-10

3. Test

# Example
python test.py

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
data		data
logs/tb_tracker/NIA_bat16_lr0.0001_warm0.1		logs/tb_tracker/NIA_bat16_lr0.0001_warm0.1
test		test
.gitignore		.gitignore
LICENSE.md		LICENSE.md
README.md		README.md
requirements.txt		requirements.txt
test.log		test.log
test.py		test.py
train.log		train.log
train.py		train.py
vocab.json		vocab.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

General

License

Setup

Performance

For Test-only (TTA Qualification/Docker)

Notes

Command

For Train/Test

1. Prepare Data

2. Train

3. Test

About

Releases

Packages

Languages

License

rhss10/japanese_asr_nia-22-2-10

Folders and files

Latest commit

History

Repository files navigation

General

License

Setup

Performance

For Test-only (TTA Qualification/Docker)

Notes

Command

For Train/Test

1. Prepare Data

2. Train

3. Test

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages