Skip to content

separate server handling speech synthesis and recognition

Notifications You must be signed in to change notification settings

BrainBloomAI/SpeechService

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

BrainBloomAI's Speech Service

Handles speech synthesis and recognition within BrainBloomAI with multilanguage support

  • Recognition: Uses websockets to streams raw audio data in opus codecs in mpeg containers for live transcription
  • Synthesis: Uses simple HTTP requests to obtain streamable audio data in aac codecs

Startup

To startup a development server,

npm i
npm start

Alternatively, you may use pnpm as a drop-in replacement for npm

Lightweight server with no build processes

/src/config.js

Configuration for frontend

  • .PORT: (number) port number the Speech Service server will run on
  • .invokeOrigin: (string) URL address of BrainBloomAI's frontend (e.g. "http://localhost:8000", no leading forward slash), used to establish CORS policy for web sockets establishment

.env

Only one variable is used to maintain sessions

  • DEEPGRAM_API_KEY: (string) DeepGram's supplied API key

Dependencies

©️ 2024 BrainBloomAI Team. All rights reserved.

About

separate server handling speech synthesis and recognition

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published