Skip to content

The GitHub repository Information_Summarizer by Prathamesh Patil is a versatile tool built using Django, React, and Tailwind CSS. It allows users to generate summaries from various sources like plain text, PDFs, DOCX files, video transcripts, and website URLs, utilizing an advanced NLP model—Trained T5-base, trained on the abisee/cnn_dailymail 3.0.

Notifications You must be signed in to change notification settings

prathamesh-patil-5090/Information_Summarizer

Repository files navigation

Information Summarizer

Information Summarizer is a versatile tool built using Django, React, and Tailwind CSS. It allows users to generate summaries from various sources like plain text, PDFs, DOCX files, video transcripts, and website URLs, using advanced NLP model - Trained T5-base, which trained on abisee/cnn_dailymail , 3.0.0. dataset of hugging face.

Table of Contents

Description

Information Summarizer is designed to streamline the summarization process for various content types, leveraging AI models to produce concise, meaningful summaries. Users can upload text files, PDFs, and input URLs or video links to generate summaries.

Features

  • Summarize various content types:
    • Plain text
    • PDF files
    • Video transcripts from YouTube
    • Web URLs
  • User authentication (login and signup)
  • PDF and text file upload and summary generation
  • Video transcription and summarization using the youtube-transcript-api
  • Summarization models - Trained T5-base, which trained on abisee/cnn_dailymail , 3.0.0. dataset of hugging face

Technologies Used

  • Frontend: React, Tailwind CSS
  • Backend: Django, Django REST Framework
  • Database: SQLite
  • OCR and Summarization: Trained T5-base, which trained on abisee/cnn_dailymail , 3.0.0. dataset of hugging face
  • Other Libraries: youtube-transcript-api for video transcription

Requirements

  • Python 3.12
  • Django (version compatible with your Python version)
  • Required Python libraries (see requirements.txt)

Installation

  1. Clone the Repository
    git clone https://github.com/patilprathamesh07/Information_Summarizer.git
    cd Information_Summarizer
  2. Install Python Dependencies
pip install -r requirements.txt

All set to start the program :

cd Information_Summarizer
python manage.py migrate
python manage.py runserver

Usage

  1. Login and Signup: Access authentication features at the login page.
  2. Upload Content: Go to the upload page to submit text, PDF, url , or video links.
  3. Summary Generation: Choose the content type and receive a summarized version on the result page.

Project Structure

Information_Summarizer/
├── info_summarizer/
│   ├── __init__.py
│   ├── asgi.py
│   ├── settings.py
│   ├── urls.py
│   ├── wsgi.py
├── pdfs/
├── transcripts/
├── summarizer/
│   ├── __init__.py
│   ├── admin.py
│   ├── apps.py
│   ├── models.py
│   ├── tests.py
│   ├── views.py
│   └── templates/
│       ├── login.html
│       ├── upload.html
│       └── result.html
├── README.md
├── db.sqlite3
├── manage.py
└── requirements.txt

Contribution Guidelines

  1. Fork the repository.
  2. Create a feature branch (git checkout -b feature/AmazingFeature).
  3. Commit your changes (git commit -m 'Add some AmazingFeature').
  4. Push to the branch (git push origin feature/AmazingFeature).
  5. Open a pull request.

License

This project is licensed under the MIT License.

Screenshots:

image

image

image

image

About

The GitHub repository Information_Summarizer by Prathamesh Patil is a versatile tool built using Django, React, and Tailwind CSS. It allows users to generate summaries from various sources like plain text, PDFs, DOCX files, video transcripts, and website URLs, utilizing an advanced NLP model—Trained T5-base, trained on the abisee/cnn_dailymail 3.0.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published