Skip to content

Speech to Text REST API built with flask with two endpoints. Uses deep speech and CMU sphinx models to transcribe the audio which can either be recorded from the web interface or directly upload through it.

Notifications You must be signed in to change notification settings

CodeBreaker444/speech-to-text-web-flask-python-deepspeech-model

Repository files navigation

Speech To Text🔈 API written in python using flask micro-architecture as backend.

💻Technology Stack

  1. Frontend : HTML, BOOTSTRAP, Recorder.js, AudioDisplay.js
  2. Backend : Flask
  3. Speech Recognition : (Includes Two MODELS : DeepSpeech, CMU Sphinx)
  4. Deployment: wsgi, aws
  5. Endpoints : "/generate_transcript" "/download_transcript" /generate_transcript: Accepts POST request parameter "file" with audio form data from recorder.js /download_transcript: Accepts GET and downlods transcript saved in output.txt

⚒Testing

Included 3 Audio files in Files/Audio/test_audio/

Transcript is stored in Files/Transcript/output.txt

⚡️Run

chmod +x run_me.sh && ./run_me.sh

(This script installs all the dependencies for this project and runs the flask application)

Note : This project is tested on python 3.6 on a mac running MAC os Catalina.

Manual Run can be done through running aigalore_mainfile.py. No arguments needed.

📁Sample Output

your power is sufficient i said
Total Recognised Words:6
Words Per Minute:169.81132075471697
Total Filler Words:3

📭About me


website: govardhanchitrda.com

About

Speech to Text REST API built with flask with two endpoints. Uses deep speech and CMU sphinx models to transcribe the audio which can either be recorded from the web interface or directly upload through it.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published