Skip to content

Audio Diagnose, Media Enhance, and Speech-to-Text running on Azure Serverless Functions.

License

Notifications You must be signed in to change notification settings

dolbyio-samples/media-azure-serverless-workflow

description languages products
An example Media Workflow with Dolby.io presented at the 2021 Azure Serverless Conf
python
azure-functions
azure-storage
azure-cognitive-services

Media Pipeline

This sample highlights how to use Azure functions and Dolby.io to create a media processing pipeline that transcribes and enhances input media.

Getting Started

Prerequisites

Params.json

  • "cog_api_key": Azure Cognitive Services Speech-to-text API Key,

  • "dolbyio_api_key": Dolby.io media API Key,

  • "azure_api_key" : Azure storage account API key,

  • "AZURE_ACC_NAME" : Azure storage account name,

  • "AZURE_CONTAINER" : Azure storage container name,

  • "AZURE_BLOB_OUTPUT_SUFFIX" : the output suffix added to the enhanced version of the input media,

  • "SCORE_THRESHOLD" : the score threshold that decides if the audio quality of the input media is too low,

  • "transcription_url" : the Azure cognitive services input URL for transcription, dependent on region.

  • "tunneling_url" : The location of your Serverless HTTP trigger, or your HTTP tunnel (if you run locally)

Steps

Local with VS Code (Recommended for development)

Server with VS Code (Recommended for deployment)

References

About

Audio Diagnose, Media Enhance, and Speech-to-Text running on Azure Serverless Functions.

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages