Training code for Voxcentum: Spoken Language Identification for 100+ Languages Expanded to 100+ Hours.
Python version == 3.10.8
is recommended.
Install required packages using requirements.txt
.
conda create -n voxcentum python=3.10.8
conda activate voxcentum
conda install pip
pip install -r requirements.txt
TBD
This step creates training and testing files.
python generate_manifest.py --raw_data /path/to/raw_data --meta_store_path manifest
Data should be structured as follows (having subfolders under each language is fine):
├── /path/to/raw_data
├── language_x
...
├── language_y
...
└── language_z
...
This step starts training the model for language identification. Remember to check config.yaml
for hyperparameters.
python training.py config.yaml
python inference.py --model_path /path/to/ckpt --manifest_dir /path/to/manifest --output /output/dir