Image Captioning

This repository contains my implementation of an image captioning model. The model takes an image as input and generates a descriptive English caption.

Project Overview

I used several different model architectures such as CNN-LSTMs and CNN-Transformers.
The project involves the MSCOCO2017 dataset. I initially used the Flickr30k, but I found that my captioning results were much better on MSCOCO2017, most likely because it has more data.

Results

This model achieved a maximum BLEU-4 caption score of 11.0

Original Caption: polar bear swimming in the water by wall

Generated Caption: polar bear swimming by large wave

Inspiration

This project was inspired by the following papers:

Name		Name	Last commit message	Last commit date
Latest commit History 67 Commits
.ipynb_checkpoints		.ipynb_checkpoints
.jupyter/desktop-workspaces		.jupyter/desktop-workspaces
.gitignore		.gitignore
README.md		README.md
lstm-coco2017-image-captioning.ipynb		lstm-coco2017-image-captioning.ipynb
lstm-flickr30k-image-captioning.ipynb		lstm-flickr30k-image-captioning.ipynb
todo.md		todo.md
vanilla-transformer-flickr30k-image-captioning.ipynb		vanilla-transformer-flickr30k-image-captioning.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Image Captioning

Project Overview

Results

Inspiration

About

Releases

Packages

Languages

kennykguo/image-captioning

Folders and files

Latest commit

History

Repository files navigation

Image Captioning

Project Overview

Results

Inspiration

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages