Skip to content

This project leverages advanced AI models to generate captions for images and translate them into regional languages (Kannada and Hindi). Additionally, it offers text-to-speech conversion, making it accessible to a wider audience, specially those with visual impairments.

Notifications You must be signed in to change notification settings

LavanyaAN21/Depiction-of-image-features-with-audio-to-aid-visually-impaired-person

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 

Repository files navigation

Depiction of image features with audio to aid visually impaired person

About

This project leverages advanced AI models to generate captions for images and translate them into regional languages (Kannada and Hindi). Additionally, it offers text-to-speech conversion, making it accessible to a wider audience, specially those with visual impairments.

Key Features

Image Captioning: Generate meaningful captions based on the content of images. Language Translation: Translate captions from English to Kannada and Hindi. Speech Conversion: Convert captions to audio files using gTTS for ease of access. Multi-modal Application: Supports both visual and auditory outputs for different use cases.

Use Cases

Accessibility Aid: Helps visually impaired users by describing images via audio. Language Learning Tool: Supports language translation for educational purposes. Interactive Learning: Enhances digital learning tools with multi-language support.

The goal of this project is to:

  1. Generate meaningful captions for images.
  2. Translate captions into regional languages (English,Kannada & Hindi).
  3. Convert captions to audio for accessibility.

This tool can be useful in various applications such as:

  • Assisting visually impaired individuals with image descriptions.
  • Learning language translations through images.
  • Enhancing interactive educational tools.

Outputs

Screenshot 2024-10-16 183155 Screenshot 2024-10-16 183215

About

This project leverages advanced AI models to generate captions for images and translate them into regional languages (Kannada and Hindi). Additionally, it offers text-to-speech conversion, making it accessible to a wider audience, specially those with visual impairments.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages