Build and experiment scalable Image Captioning techniques.
Data preparation and loading is complete for the following Image captioning datasets
- Flickr8k
- Google Conceptual Captions
- The main dataset is available for download at
https://ai.google.com/research/ConceptualCaptions/download
Steps to download images:
- Download the training split from the dataset page
- Set the
GOOGLE_CAPTIONS_FILE
url path inpreprocessing\GoogleCaptions.py
. - Choose the number of samples to download by setting
NUM_SAMPLES
- Run the script using
python3 GoocleCaptions.py
- The images would be downloaded in the set directory
- The keras data loaders are written for
Flickr8k
andGoogle Conceptual Captions
atdatasets\flickr8k
anddatasets\googlecc.py
. - Currently, there is no separate usage of Validation set and a part of the training data is used for Validation.