As we know Python is a suitable language for scriptwriters and developers. Let’s write a script for Voice Assistant using Python. The query for the assistant can be manipulated as per the user’s need. Speech recognition is the process of converting audio into text. This is commonly used in voice assistants like Alexa, Siri, etc. Python provides an API called SpeechRecognition to allow us to convert audio into text for further processing. In this article, we will look at converting large or long audio files into text using the SpeechRecognition API in python.
This is a desktop assistant which aims to assist you while your working on your computer. It can open an exisiting or folder,open websites,provide search results based on input from keyboard or by using mic.
Check the guide.png files for supported operations.
platform used-Pycharm Coommunity Edition 2019.3.2, ,Python 3.8 ,Database used:SQLLite.
pyler, wikipedia, wikipediaApi, pyscopg2, pyaudio, pillow, pyttsx3, speech_recognition,
- Speech Recognition — Speech recognition is an important feature used in house automation and artificial intelligence devices. The main function of this library is it tries to understand whatever the humans speak and converts the speech to text.
- pyttsx3 — pyttxs3 is a text to speech conversion library in python. This package supports text to speech engines on Mac os x, Windows and on Linux.
- wikipedia — Wikipedia is a multilingual online encyclopedia used by many people from academic community ranging from freshmen to students to professors who wants to gain information over a particular topic. This package in python extracts data’s required from Wikipedia.
- ecapture — This module is used to capture images from your camera
- datetime — This is an inbuilt module in python and it works on date and time
- os — This module is a standard library in python and it provides the function to interact with operating system
- time — The time module helps us to display time
- Web browser — This is an in-built package in python. It extracts data from the web
- Subprocess — This is a standard library use to process various system commands like to log off or to restart your PC.
Functions Implemented : (All these functionalities can be implemented either by giving voice input or by giving user text input)
- Opens Camera
- Opens Specified site on browser
- Gives information about a specific topic from google
- Gives weather updates on a specific location
- Gives information about a specific topic from wikipedia
- Opens a particular file or folder