Skip to content

stm32-speech-recognition-and-traduction is a project developed for the Advances in Operating Systems exam at the University of Milan (academic year 2020-2021). It implements a speech recognition and speech-to-text translation system using a pre-trained machine learning model running on the stm32f407vg microcontroller.

License

Notifications You must be signed in to change notification settings

FedericaPaoli1/stm32-speech-recognition-and-traduction

Repository files navigation

BSD 2-Clause License


stm32-speech-recognition-and-traduction

Table of Contents
  1. About The Project
  2. Getting Started
  3. Usage
  4. License
  5. Contact

About The Project

stm32-speech-recognition-and-traduction is a project developed for the Operating Systems Design exam at the University of Milan (academic year 2020-2021). It implements a speech recognition and speech-to-text translation system using a pre-trained machine learning model. The system is able to distinguish a restricted set of words to limit the complexity of the final model so that it can be run on the stm32f407vg microcontroller where memory and calculation capacity are limited. In particular, the voice recognition system distinguishes simple commands such as switching the LEDs on and off and enabling the translation system. The latter, on the other hand, processes the voice signal by converting it into text, which is then displayed on the screen.

Built With

Getting Started

Required libraries

To correctly build this project, you'll need the X-CUBE AI and PDM2PCM ST libraries. You can download them directly from ST official website and install in the root level of this project.

Prerequisites

  • STM32F407VG board, equipped with both UART/USART interface and ST-MEMS digital microphone
  • USART-USB dongle to connect the pins of the UART/USART interface to a PC via USB

Usage

To enable the microphone mounted on the microcontroller to acquire a new word, the user B1 button must be pressed. Once this is done, a one-second audio signal is acquired. Depending on the word spoken, different actions are performed.

  • Say the word ON twice in succession; this word enables the printing system on the terminal.

terminal_on




  • Say the word OFF, after pronouncing the word ON.

terminal_off




  • Say various words, after pronouncing the word ON, followed by the pronunciation of the word OFF.

terminal_various_words




  • Say ONE to turn on the green LED, after the word ON has been pronounced.

terminal_one


board_one




  • Say TWO to turn on the blue LED, after the word ON and ONE has been pronounced.

terminal_two


board_two




  • Say THREE to turn on the red LED, after the word ON, ONE and TWO has been pronounced.

terminal_three


board_three




  • Say FOUR to turn on the orange LED, after the word ON, ONE, TWO and THREE has been pronounced and followed by the word OFF

terminal_four


board_four




  • Say the word VISUAL three times in succession to show the calculation of the execution time statistics. This is followed by the pronunciation of the word STOP, followed by the pronunciation of the word VISUAL to show the reset of the execution times (expressed in clock cycles).

terminal_execution_times




License

Distributed under the BSD 2-Clause License. See LICENSE for more information.

Contact

Federica Paoli' - federicapaoli1@gmail.com
Stefano Taverni - ste.taverni@gmail.com

Project Link: https://github.com/FedericaPaoli1/stm32-speech-recognition-and-traduction

About

stm32-speech-recognition-and-traduction is a project developed for the Advances in Operating Systems exam at the University of Milan (academic year 2020-2021). It implements a speech recognition and speech-to-text translation system using a pre-trained machine learning model running on the stm32f407vg microcontroller.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published