Skip to content

Keras implementation of Bidirectional LSTM & CNN layers, which predicts the associated phone sequences given the acoustic signals.

Notifications You must be signed in to change notification settings

danielchyeh/Sequence_Labeling

Repository files navigation

Sequence_Labeling

– Bidirectional LSTM & CNN layers are used for training the model, which predicts the associated phone sequences given the acoustic signals.

Project Link: https://www.csie.ntu.edu.tw/~yvchen/f106-adl/A1

Dataset

TIMIT dataset, which has MFCC features (39 dims) and FBank features (69 dims) for each frame is used in this task. It also has 48 different kinds of phone.

TIMIT dataset can be downloaded from kaggle https://www.kaggle.com/c/hw1-timit/data (created by class MLDS2017, NTU)

Quick start

Run the shell script

./hw1_best.sh [input directory] [output filename]

or any of the following: hw1_cnn.sh, hw1_rnn.sh

[input directory] should be TIMIT dataset which you download from link above

[output filename] a .csv file which shows the result of prediction

About

Keras implementation of Bidirectional LSTM & CNN layers, which predicts the associated phone sequences given the acoustic signals.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published