Proposal: T5-Bootcamp-DeepLearning-project

Final project - SDAIA Academy - Bootcamp Data Science

Introduction:

The project, "Eye for Blind," aims to create a deep learning model that can explain the content of an image in the form of speech through caption generation with the attention mechanism on the Flickr8K data set

Goal:

It aims to use text to speech conversion in order to showcase our result in an audio format, thus, allowing us to recognize the objects and explain them accordingly in an audiblemanner.

Future work:

create an application to help blind people explain the pictures accordingly in an audible manner.

Dataset:

8091 Images
40455 Captions

Dataset sourec:

from Kaggle website [Kaggle]

Algorithms:

Inception-v3 model
CCN Model.
Attention Model.
RNN Model.
Greedy Search
Beam Search
Gtts

Tools:

Softwares:

VScode
mp3
Trello
Jupyter
Github
PowerPoint
Zoom

Languages & Libarry

Python
Pandas
numpy
seaborn
plotly
sklearn
PIL
tqdm
Adam
InceptionV3

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
Output		Output
Eye for blind.pptx		Eye for blind.pptx
MVP.ipynb		MVP.ipynb
README.md		README.md
captions.txt		captions.txt
finalcode.ipynb		finalcode.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Proposal: T5-Bootcamp-DeepLearning-project

Introduction:

Goal:

Future work:

Dataset:

Dataset sourec:

Algorithms:

Tools:

Softwares:

Languages & Libarry

Team Members

About

Releases

Packages

Languages

EngrRaghad/T5-Bootcamp-DeepLearning-project

Folders and files

Latest commit

History

Repository files navigation

Proposal: T5-Bootcamp-DeepLearning-project

Introduction:

Goal:

Future work:

Dataset:

Dataset sourec:

Algorithms:

Tools:

Softwares:

Languages & Libarry

Team Members

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages