Skip to content

Latest commit

 

History

History
59 lines (54 loc) · 2.16 KB

File metadata and controls

59 lines (54 loc) · 2.16 KB

Next level deep learning from scratch

This repo includes code for my Next level deep learning from scratch.

Requirements

  • Python 3.x
  • Numpy
  • matplotlib
  • CuPy (optional)

Features and capablities

A network built with this library has the following features and capablities:

  • Simple, keras-like usage form
  • GPU support through CuPy
  • Construct multilayer neural networks
  • Choice of many popular activation functions (sigmoid, tanh, relu, lrelu, elu and softmax)
  • Choice of loss function (softmax_cross_entropy or MSE) or add your own loss.
  • Choice of different optimizers (SGD, AdaGrad or RMSProp) or add your own optimizer
  • Choice of different regularizers (l1 or l2) or add your own regularizer
  • Batch normalization
  • Mini-batch
  • Dropout
  • Produce validation predictions for each epoch

Training sample

A full model training and hyperparameter optimization code and runs can be found in Training sample.ipynb. Dataset used is fashion mnist with dimensionality reduced data and can be found in .\input folder.
The final model achieves 89.96% test accuracy.

Sample code

# import deep learning class
from DeepLearning import NN, Layers, Utils, Losses, Optimizers, Regularizers, Metrics

# read training (100 samples), validation and test data
# .
# .
# .

# define model hyperparameters
clf = NN(Losses.softmax_cross_entropy,
         Optimizers.RMSProp,
         regularizer=Regularizers.l1,
         reg_lambda=0.01)
clf.add(Layers.Dense(128, input_dim=100))
clf.add(Layers.Dense(96, activation='lrelu', batch_norm=(0.99, 0.001, 1e-5)))
clf.add(Layers.Dense(64, activation='lrelu', batch_norm=(0.99, 0.001, 1e-5)))
clf.add(Layers.Dense(10, activation='softmax'))

# training network
result = clf.fit(x_train,
                 y_train,
                 learning_rate=0.01,
                 batch_size=128,
                 epochs=epochs,
                 print_freq=1,
                 gamma=0.0,
                 decay=0.9,
                 validation=(x_val, y_val))
                 
loss, acc, val_loss, val_acc = result['Loss'], result['Accuracy'], result['Val Loss'], result['Val Accuracy']

# make predictions
y_pred = clf.predict(x_test)