MNIST Handwritten Digit Deep Learning Accelerator ASIC

A deep learning accelerator ASIC chip design to classify images from the MNIST handwritten image dataset.

Source: Wikipedia - MNIST database

Design implementation for Tiny Tapeout.

Thanks to Columbus IEEE Joint Chapter of the Solid-State Circuits and Circuits and Systems Societies!

Example:

Input -->	Serialized to ASIC -->	Neural Network -->	Output
MNIST Image	Cycle Input Pin: 01234560123456 ---------------------------------- 0-1 00000000000000 2-3 00000000000000 4-5 00000000111100 6-7 00001111111100 8-9 00001111100000 10-11 00000110000000 12-13 00000011000000 14-15 00000001110000 16-17 00000000110000 18-19 00000011110000 20-21 00000111100000 22-23 00111110000000 24-25 00111100000000 26-27 00000000000000	?	BCD: 0101 = 5 7-Segment:5

MNIST Dataset + Preprocessing

Input images from the MNIST Dataset are preprocessed by a raspberry pi and transmitted to the ASIC. The images in MNIST are 28x28 grayscale images. However, as part of the preprocessing step, these images are reduced to a 14x14 black/white image to reduce the amount of data needed to be transmitted to the ASIC and to reduce the complexity of the neural network. Since the images are 14x14, a 8-pin interface (ui_in) is used which transmits 7 pixels at a time for 28 clock cycles to transmit each image. The remaining bit, the most significant bit (MSB), is a active-low signal. pulled low to start transmitting a new image.

Preprocessing Python Script

A preprocessing python script (utility.py) is provided to convert the standard MNIST images into the reduced dataformat used in this project. The script is used to train the network, test the network, convert the pytorch implementation into verilog, and generate cocotb unit-tests directly from the MNIST dataset.

Design

Neural Network

Based on MNIST pytorch example.

class Net(nn.Module):
    def __init__(self):
        super(Net, self).__init__()
        self.conv1 = nn.Conv2d(1, 16, 3, 1)  #1,32,3,1
        self.conv2 = nn.Conv2d(16, 32, 3, 1) #32,64,3,1
        self.dropout1 = nn.Dropout(0.25)
        self.dropout2 = nn.Dropout(0.5)
        self.fc1 = nn.Linear(800, 128) #9216
        self.fc2 = nn.Linear(128, 10)

    def forward(self, x):
        x = self.conv1(x)
        x = F.relu(x)
        x = self.conv2(x)
        x = F.relu(x)
        x = F.max_pool2d(x, 2)
        x = self.dropout1(x)
        x = torch.flatten(x, 1)
        x = self.fc1(x)
        x = F.relu(x)
        x = self.dropout2(x)
        x = self.fc2(x)
        output = F.log_softmax(x, dim=1)
        return output

Hardware Implementation

Implemented into Verilog as a main file: project.v with 3 supporting files for readimage.v, neuralnetwork.v, and decoder.v

Latest GDS Rendering:

Results

Goal - show results of chip vs identical python-based neural network implementation.

Name		Name	Last commit message	Last commit date
Latest commit History 104 Commits
.github/workflows		.github/workflows
docs		docs
src		src
test		test
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
info.yaml		info.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MNIST Handwritten Digit Deep Learning Accelerator ASIC

MNIST Dataset + Preprocessing

Preprocessing Python Script

Design

Neural Network

Hardware Implementation

Results

Tiny Tapeout

References

About

Releases

Packages

Languages

License

estods3/mnist_accelerator

Folders and files

Latest commit

History

Repository files navigation

MNIST Handwritten Digit Deep Learning Accelerator ASIC

MNIST Dataset + Preprocessing

Preprocessing Python Script

Design

Neural Network

Hardware Implementation

Results

Tiny Tapeout

References

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages