This repository uses Dolby.io Communications REST API and Deepgram's Pre-recorded Audio API in Python.
In the blog, the focus is on the importance of accessibility in conference recordings and how to automate the transcription generation process. It explores using Dolby.io Communications REST APIs along with Deepgram's Pre-recorded Audio API in Python as an example to transcribe conference recordings to text. The aim is to provide alternative ways for people to consume the information, such as faster reading, accommodating different languages, and catering to deaf and hard of hearing people.
To follow along you will need:
- A Dolby.io Account, this is to obtain the API Key and Secret Key
- A Deepgram account to access their Pre-recorded Audio API which requires an API Key
Before we begin coding, we need to ensure we have all the proper libraries for calling these APIs. We can do this with a simple pip command (use the appropriate pip command for your operating system):
pip3 install asyncio deepgram-sdk dolbyio-rest-apis
This will install both the Dolby.io and Deepgram SDKs, as well as Python’s native asynchronous function library to aid us in calling the async requests the two SDKs use.
In the case any bugs occur, report it using Github issues, and we will see to it.
We welcome your interest in trying to experiment with our repos.
If there are any suggestions or if you would like to deliver any positive notes, feel free to open an issue and let us know!
For a deeper dive, we welcome you to review the following:
- Communications API
- Cloud-Based Audio Production Workflows with Dolby.io, Dropbox + Zapier
- Beginner’s Guide to Diagnosing Audio Issues as Part of an Azure Serverless Media Workflow
- Generate a Transcript of Your Meeting
- Blog Session - Communications API
Using decades of Dolby's research in sight and sound technology, Dolby.io provides APIs to integrate real-time streaming, voice & video communications, and file-based media processing into your applications. Sign up for a free account to get started building the next generation of immersive, interactive, and social apps.