Skip to content

This project takes podcast episodes from the Podcast Index, converts the audio into text, summarizes the content, generates an image based on the summary, translates the summary into French, and allows users to ask questions about the episode. Additionally, ElevenLabs is used for audio generation.

Notifications You must be signed in to change notification settings

extrawest/ai-podcast-to-blog

Repository files navigation

Podcast Summary & Q&A App

Maintenance Maintaner Ask Me Anything ! GitHub license GitHub release

This project takes podcast episodes from the Podcast Index, converts the audio into text, summarizes the content, generates an image based on the summary, translates the summary into French, and allows users to ask questions about the episode. Additionally, ElevenLabs is used for audio generation.

Features

  • Audio to Text: Convert podcast episodes into text using Hugging Face models.
  • Summarization: Create concise summaries of podcast episodes.
  • Translation: Translate summarized content into French.
  • Image Generation: Generate images based on the summarization.
  • Q&A: Ask questions about the episode and get accurate answers.
  • Audio Creation: Generate audio content with ElevenLabs.
  • User Authentication: Secure authentication and user management with ClerkJs.

Tech Stack

  • Next.js: Frontend framework for building fast and scalable web applications.
  • Hugging Face: Provides models for transcription, summarization, and translation.
  • ElevenLabs: Generates audio content based on summaries.
  • LangChain: Orchestrates the entire process by creating a chain that integrates all functionalities.
  • ClerkJs: User authentication and management.
  • Axios: Handles API requests.

Installation

  1. Install dependencies:

    npm install
  2. Start the development server:

    npm run dev
  3. Visit http://localhost:3000 to access the app.

How It Works

  1. Fetch Podcast: Axios is used to retrieve podcast audio from the Podcast Index.
  2. Audio Transcription: Hugging Face models convert the audio into text.
  3. Summarization: The transcribed text is summarized using Hugging Face models.
  4. Translation: The summary is translated into French using Hugging Face translation models.
  5. Image Generation: An image is generated from the summarization using AI tools.
  6. Audio Creation: ElevenLabs generates audio from the summarized content.
  7. Q&A: Users can ask questions about the episode, and LangChain coordinates the response process.
  8. Authentication: ClerkJs handles user login and account management.

Running in Production

To deploy the app:

  1. Build the app for production:

    npm run build
  2. Start the production server:

    npm start

Contributing

Feel free to open issues or submit pull requests to improve the project. Contributions are welcome!

License

This project is licensed under the MIT License.

About

This project takes podcast episodes from the Podcast Index, converts the audio into text, summarizes the content, generates an image based on the summary, translates the summary into French, and allows users to ask questions about the episode. Additionally, ElevenLabs is used for audio generation.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published