Use OpenAI's Whisper to transcribe audio files and diariaze speakers of the transcribed text
-
Updated
Jun 11, 2024 - Python
Use OpenAI's Whisper to transcribe audio files and diariaze speakers of the transcribed text
Timestamped ASR microservice
Web app for transcribing audio file (.wav format) to text usingGoogle Cloud Speech API.
Transcribe Bangla Audio into Text
AWS Lambda Function which creates a transcribe job, that reads mp3 file and converts it into text format in a json file.
Whisper is a pre-trained model for automatic speech recognition (ASR) and speech translation. In this template, we will import the Whisper model on Inferless Platform.
A Windows desktop application that can generate subtitles, translations, and summaries for videos in 8 languages using API and SDK from Tencent, Alibaba, and Baidu. You can use it for generating bilingual transcripts for videos and summarising the key points from the transcript using LexRank.
This application contains "Audio to text", "Dictation" and "Gender prediction" modules in it.
AudioTextPro: Convert audio to text accurately in real-time using our advanced AI speech recognition technology. 🐍
Whisper Large V3 is a pre-trained model developed by OpenAI and designed for tasks like automatic speech recognition (ASR), speech translation and language identification.
Speech-to-Text using OpenAI's Whisper model
inter-convert between audio & text, easy to use with GUI desktop application by PaddleSpeech and PySide6.
TranscriptGen is an application for transcribing audio and video files. Transcription output is .txt or .srt. Most audio and video formats supported (with ffmpeg).
WER, MER, WIL of Whisper vs Vosk vs Google transcribators comparator
Transform audio recordings into text transcripts effortlessly with AudioTranscribe! 🎙️📝 Simplify your transcription process and enhance accessibility with top-notch accuracy. Explore the power of text-to-speech conversion today! 🚀🎧
Event-driven AI > A Python-Kafka event-driven micro-services solution for distributed audio transcriptions.
"Speech-to-Text Realtime with Extension" is a browser extension that converts speech to text in real-time. It supports multiple languages, making it ideal for note-taking, customer service, and accessibility. Easy to install and use on popular browsers.
Edge AI > AI app to easily perform transcriptions on regular computers. Quality on par with on-cloud alternatives. Lower costs. Reduced privacy risks.
Audio2TextBot is a Telegram bot that facilitates audio and video file processing to convert them into text format using various pre-trained models.
📼 A streamlit web interface designed to extract words from video/audio files into text • Python, FFmpeg, Whisper, YT-DLP
Add a description, image, and links to the audio-to-text topic page so that developers can more easily learn about it.
To associate your repository with the audio-to-text topic, visit your repo's landing page and select "manage topics."