This application converts spoken words into musical notes using FastAPI, React, and AssemblyAI.
- Python 3.8+
- Node.js and npm
- AssemblyAI API key
-
Clone the repository
-
Set up the backend:
# Install Python dependencies pip install -r requirements.txt # Set up your AssemblyAI API key in .env file # Replace 'your_api_key_here' with your actual API key
-
Set up the frontend:
cd frontend npm install
-
Start the backend server:
uvicorn main:app --reload
-
Start the frontend development server:
cd frontend npm run dev
-
Open your browser and navigate to the URL shown in the frontend terminal output (usually http://localhost:5173)
- Click the "Start Recording" button to begin recording audio
- Speak into your microphone
- Click "Stop Recording" when finished
- Click "Process Audio" to send the recording to the server
- The transcribed text will appear below
- Audio recording using the Web Audio API
- Real-time audio processing
- Speech-to-text conversion using AssemblyAI
- Modern UI with TailwindCSS