Transcripts for Episodes of my Podcast "Recsperts - Recommender Systems Experts"
Repository for code and output of episode transcription
- Upload episode mp3-file to Google Drive Recsperts' audio folder
- Start this Notebook in Colab and choose GPU as runtime type
- Adapt filename for episode file
- Run the transcription with whisper
- Save pickle file to the Recsperts transcripts folder as
${episode_name}_raw.pkl
- Run postprocessing to generate
${episode_name}.txt
- Add, commit and push to git repository: https://github.com/mkurovski/recsperts-transcripts
- Upload to transistor (use API for this): https://developers.transistor.fm/
Model size that is used for transcriptions: small.en
.
For more info, check out https://github.com/openai/whisper