Speech Recognition

This is program uses Convolutional Neural Networks, spectrograms, and various image processing techniques to create a software package that can create and recognize words from an "audio dictionary"

Data used for this model

The current accuracy (based on an unseen data set) tends to be around 95% for the datahandle.py implementation

Add:

discretize the training and implementation of the neural net
tqdm for training progress (maybe)
implement in tensorflowRT/TFLite in order to process continuous speech
implement in tensorflowRT in order to process continuous speech

Bugs:

Model is using keras V1 model saving/loading protocols, review docs and update to be able to resume model training progress. Currently the callbacks have issues with saving in the ipynb version

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
__pycache__		__pycache__
.gitignore		.gitignore
README.md		README.md
SpeechRecognition.drawio		SpeechRecognition.drawio
SpeechRecognition.png		SpeechRecognition.png
convert_and_store.ipynb		convert_and_store.ipynb
datahandle.py		datahandle.py
model.h5		model.h5
model.json		model.json
model_train_and_test.ipynb		model_train_and_test.ipynb
spectrograms.db		spectrograms.db
speech_dependencies.py		speech_dependencies.py
sql_scripts.py		sql_scripts.py
sqlite_db_stuff.ipynb		sqlite_db_stuff.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Speech Recognition

Add:

Bugs:

References:

About

Releases

Packages

Languages

rlew631/SpeechRecognition

Folders and files

Latest commit

History

Repository files navigation

Speech Recognition

Add:

Bugs:

References:

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages