The goal of this project is to building some milestone Deep Convolutional Neural Networks from scratch and using them for classification.
There are two dataset being used for that experiment. MNIST and CIFAR10
- MNIST is a dataset of 60,000 28x28 grayscale images of the 10 digits, along with a test set of 10,000 images. More info can be found at the MNIST homepage
- CIFAR10 is a dataset of 50,000 32x32 color training images and 10,000 test images, labeled over 10 categories.
I've carried out two tasks:
-
Creating Googlenet and Resnet34 architectures from scratch and training on MNIST dataset.
-
Performed transfer learning. VGG16 model have been initialized with the weights based on a training on ImageNet dataset without including the top layer. Then new fully-connected layer with random initialization, with the correct number of output. Just new fully-connected layer has been trained.
Model | Accuracy |
---|---|
1. GoogleNet model | 98.07% |
2. ResNet34 model | 98.23% |
3. VGG16 model | 65.10% |
VGG16 model accuracy can also be pulled up by increasing the number of epoch.