Skip to content

AudioTextPro: Convert audio to text accurately in real-time using our advanced AI speech recognition technology. 🐍

License

Apache-2.0, MIT licenses found

Licenses found

Apache-2.0
LICENSE-APACHE
MIT
LICENSE-MIT
Notifications You must be signed in to change notification settings

sebastienrousseau/audiotext

Repository files navigation

AudioTextPro logo

Python AudioTextPro

Convert audio to text accurately in real-time using our advanced AI speech recognition technology.

Overview πŸ“–

AudioTextPro is a Python module designed to interact with the AssemblyAI API to transcribe audio files. It provides a simple and efficient way to convert speech into text using AssemblyAI's powerful transcription services.

Features ✨

  • Uploads audio files to AssemblyAI.
  • Requests transcripts of uploaded audio files.
  • Processes and stores the resulting transcripts.
  • Simple API interaction through Python functions.
  • Extensible for future additions, such as more complex processing of transcripts.

Requirements πŸ“‹

  • Python 3.9 or higher
  • The requests, and python-dotenv packages
  • An AssemblyAI API key (get one here)

Installation πŸ› 

  1. Install the required packages:
pip install requests python-dotenv
  1. Clone the audiotextpro repository:
git clone https://github.com/sebastienrousseau/audiotextpro.git
  1. Add your OpenAI API key to a .env file in the project directory:
# API_TOKEN is used to authenticate with the AssemblyAI API.
# Replace with your actual API token from AssemblyAI.
API_TOKEN = ''

# FILENAME should be the path to the audio file you wish to transcribe.
# Replace with the path to your actual file.
FILENAME = ''

# TIMEOUT is the number of seconds to wait for the transcription to complete.
# Replace with the number of seconds you wish to wait.
TIMEOUT=""

Usage πŸš€

Command Line Interface

To use AudioTextPro, navigate to the project directory in your terminal and run the following command:

python3 -m audiotextpro

Ensure your audio file is accessible from the project and that the API key is valid to avoid any request issues.

File Structure πŸ“

.
β”œβ”€β”€ LICENSE-APACHE
β”œβ”€β”€ LICENSE-MIT
β”œβ”€β”€ MANIFEST.in
β”œβ”€β”€ Makefile
β”œβ”€β”€ README.md
β”œβ”€β”€ audiotextpro
β”‚Β Β  β”œβ”€β”€ __init__.py
β”‚Β Β  β”œβ”€β”€ __main__.py
β”‚Β Β  β”œβ”€β”€ api.py
β”‚Β Β  β”œβ”€β”€ file.py
β”‚Β Β  β”œβ”€β”€ transcript.json
β”‚Β Β  β”œβ”€β”€ transcript.txt
β”‚Β Β  β”œβ”€β”€ transcript_processor.py
β”‚Β Β  └── utils.py
β”œβ”€β”€ pyproject.toml
β”œβ”€β”€ requirements.txt
β”œβ”€β”€ samples
β”‚Β Β  └── call_from_a_customer.mp3
β”œβ”€β”€ setup.cfg
β”œβ”€β”€ setup.py
└── transcript.json

3 directories, 19 files

License πŸ“œ

The project is licensed under the terms of both the MIT license and the Apache License (Version 2.0).

About

AudioTextPro: Convert audio to text accurately in real-time using our advanced AI speech recognition technology. 🐍

Topics

Resources

License

Apache-2.0, MIT licenses found

Licenses found

Apache-2.0
LICENSE-APACHE
MIT
LICENSE-MIT

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published