Voicenator

Breaking Language Barriers: Real-time Translation and Transcription with Voicenator

Introduction

Voicenator is a cutting-edge AI-powered application designed to break down language barriers by offering real-time translation and transcription services. Leveraging the power of Web Speech API, Deepgram, and WebSockets, Voicenator provides seamless speech-to-text and text-to-speech functionalities, making communication easier and more efficient.

Features

Real-time Translation: Translate speech or text in real-time across multiple languages.
Speech-to-Text: Accurate transcription of spoken words into text using Deepgram.
Text-to-Speech: Convert written text into natural-sounding speech with SpeechSynthesis.
Voice Cloning: Create high-quality AI clones of human voices.
AI Dubbing: Automatically translate and dub videos in multiple languages.
Transcription: Transcribe videos with high accuracy in over 20 languages.
AI Avatar: Generate AI-driven video content.
Text-to-Speech API: Utilize our API for natural-sounding text-to-speech conversions.

Installation

Prerequisites

Node.js (v14 or higher)
npm (v6 or higher) or yarn

Steps

Clone the repository:

git clone https://github.com/your-username/voicenator.git
cd voicenator

Install dependencies:
```
npm install
# or
yarn install
```

Create a .env file in the root directory and add the following variables:

REACT_APP_DEEPGRAM_API_KEY=your_deepgram_api_key
REACT_APP_WEB_SOCKET_URL=your_websocket_url

Start the development server:
```
npm start
# or
yarn start
```

Usage

Real-time Translation: Select your source and target languages, then start speaking or typing to see instant translations.
Speech-to-Text: Use the microphone button to start speaking and see your words transcribed in real-time.
Text-to-Speech: Enter text into the input field and press the play button to hear the speech output.
Voice Cloning, AI Dubbing, and other advanced features: Navigate through the application menu to access and utilize these functionalities.

Configuration

Web Speech API

Voicenator utilizes the Web Speech API for speech recognition and synthesis. Ensure your browser supports this API.

Deepgram API

Deepgram provides the backend for speech-to-text functionality. Sign up on Deepgram's website to get your API key and add it to your .env file.

WebSockets

WebSockets are used for real-time data transmission. Configure the WebSocket URL in your .env file.

Technologies Used

React: Frontend framework
TypeScript: Static typing for JavaScript
Redux: State management
Web Speech API: Browser API for speech recognition and synthesis
Deepgram: Speech-to-text API
Socket.io: WebSocket library for real-time communication
Vite: Build tool for frontend development

Contributing

Contributions are welcome! Please follow these steps:

Fork the repository
Create your feature branch:
```
git checkout -b feature/YourFeature
```
Commit your changes:
```
git commit -m 'Add YourFeature'
```
Push to the branch:
```
git push origin feature/YourFeature
```
Open a pull request

License

This project is licensed under the Apache-2.0 License - see the LICENSE file for details.

Contact

For questions or suggestions, please reach out to us:

Email: ayushsoni1010.work@gmail.com
Website: https://ayushsoni1010.com

Thank you for using Voicenator! We hope this tool makes your communication more effective and breaks down language barriers effortlessly.

Name		Name	Last commit message	Last commit date
Latest commit History 65 Commits
docker		docker
public		public
server		server
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
components.json		components.json
index.html		index.html
package-lock.json		package-lock.json
package.json		package.json
postcss.config.js		postcss.config.js
tailwind.config.js		tailwind.config.js
tsconfig.app.json		tsconfig.app.json
tsconfig.json		tsconfig.json
tsconfig.node.json		tsconfig.node.json
vite.config.ts		vite.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Voicenator

Table of Contents

Introduction

Features

Installation

Prerequisites

Steps

Usage

Configuration

Web Speech API

Deepgram API

WebSockets

Technologies Used

Contributing

License

Contact

About

Releases

Packages

Languages

License

ayushsoni1010/voicenator

Folders and files

Latest commit

History

Repository files navigation

Voicenator

Table of Contents

Introduction

Features

Installation

Prerequisites

Steps

Usage

Configuration

Web Speech API

Deepgram API

WebSockets

Technologies Used

Contributing

License

Contact

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages