This project leverages advanced AI models to generate captions for images and translate them into regional languages (Kannada and Hindi). Additionally, it offers text-to-speech conversion, making it accessible to a wider audience, specially those with visual impairments.
Image Captioning: Generate meaningful captions based on the content of images. Language Translation: Translate captions from English to Kannada and Hindi. Speech Conversion: Convert captions to audio files using gTTS for ease of access. Multi-modal Application: Supports both visual and auditory outputs for different use cases.
Accessibility Aid: Helps visually impaired users by describing images via audio. Language Learning Tool: Supports language translation for educational purposes. Interactive Learning: Enhances digital learning tools with multi-language support.
The goal of this project is to:
- Generate meaningful captions for images.
- Translate captions into regional languages (English,Kannada & Hindi).
- Convert captions to audio for accessibility.
- Assisting visually impaired individuals with image descriptions.
- Learning language translations through images.
- Enhancing interactive educational tools.