GitHub - CodeBreaker444/speech-to-text-web-flask-python-deepspeech-model: Speech to Text REST API built with flask with two endpoints. Uses deep speech and CMU sphinx models to transcribe the audio which can either be recorded from the web interface or directly upload through it.

Speech To Text🔈 API written in python using flask micro-architecture as backend.

💻Technology Stack

Frontend : HTML, BOOTSTRAP, Recorder.js, AudioDisplay.js
Backend : Flask
Speech Recognition : (Includes Two MODELS : DeepSpeech, CMU Sphinx)
Deployment: wsgi, aws
Endpoints : "/generate_transcript" "/download_transcript" /generate_transcript: Accepts POST request parameter "file" with audio form data from recorder.js /download_transcript: Accepts GET and downlods transcript saved in output.txt

⚒Testing

Included 3 Audio files in Files/Audio/test_audio/

Transcript is stored in Files/Transcript/output.txt

⚡️Run

chmod +x run_me.sh && ./run_me.sh

(This script installs all the dependencies for this project and runs the flask application)

Note : This project is tested on python 3.6 on a mac running MAC os Catalina.

Manual Run can be done through running aigalore_mainfile.py. No arguments needed.

📁Sample Output

your power is sufficient i said
Total Recognised Words:6
Words Per Minute:169.81132075471697
Total Filler Words:3

📭About me

website: govardhanchitrda.com

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
Files		Files
ds_models061		ds_models061
static		static
templates		templates
.gitattributes		.gitattributes
.gitignore		.gitignore
Procfile		Procfile
Screenshot 2020-09-19 at 1.31.10 PM.png		Screenshot 2020-09-19 at 1.31.10 PM.png
Screenshot 2020-09-19 at 11.51.53 AM.png		Screenshot 2020-09-19 at 11.51.53 AM.png
Screenshot 2020-09-19 at 12.37.44 PM.png		Screenshot 2020-09-19 at 12.37.44 PM.png
Screenshot 2020-09-19 at 12.57.09 PM.png		Screenshot 2020-09-19 at 12.57.09 PM.png
Screenshot 2020-09-19 at 12.57.18 PM.png		Screenshot 2020-09-19 at 12.57.18 PM.png
Screenshot 2020-09-19 at 3.42.23 PM.png		Screenshot 2020-09-19 at 3.42.23 PM.png
aigalore_mainfile.py		aigalore_mainfile.py
cmu_sphinx.py		cmu_sphinx.py
deep_speech.py		deep_speech.py
readme.md		readme.md
requirements.txt		requirements.txt
run_me.sh		run_me.sh
runtime.txt		runtime.txt
video_structuring.py		video_structuring.py
wsgi.py		wsgi.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Speech To Text🔈 API written in python using flask micro-architecture as backend.

💻Technology Stack

⚒Testing

⚡️Run

📁Sample Output

📭About me

About

Releases

Packages

Languages

CodeBreaker444/speech-to-text-web-flask-python-deepspeech-model

Folders and files

Latest commit

History

Repository files navigation

Speech To Text🔈 API written in python using flask micro-architecture as backend.

💻Technology Stack

⚒Testing

⚡️Run

📁Sample Output

📭About me

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages