Skip to content

Latest commit

 

History

History
64 lines (38 loc) · 3.89 KB

README.md

File metadata and controls

64 lines (38 loc) · 3.89 KB

MxNetR for the Class Activation Mapping (CAM)

Background

This project propose a simple example to expose the implicit attention of Convolutional Neural Networks on the image. The paper is published at CVPR'16. The basic idea is based on global average pooling (GAP) layer in the last of network. The framework of the Class Activation Mapping (CAM) is as below:

F1

As illustrated in above figure, global average pooling (GAP) outputs the spatial average of the feature map of each unit at the last convolutional layer. A weighted sum of these values is used to generate the final output. Similarly, we compute a weighted sum of the feature maps of the last convolutional layer to obtain our class activation maps. We describe this more formally below for the case of softmax. The same technique can be applied to regression and other losses.

Fortunately, some of popular networks such as DenseNet, SqueezeNet, ResNet already use global average pooling (GAP) at the end, so we can directly use pre-trained model to generate the class activation mapping (CAM) without any modification. Here is a sample script to generate class activation mapping (CAM) by a pre-trained DenseNet-169.

Note: If you cannot find the 'densenet-imagenet-169-0-0125.params' file in 'model' directory, you can download it from here. This pre-trained parameter is contributed by bruinxiong/densenet.mxnet. After download, you can put this file in the 'model' directory.

Requirements

You can install the package directly in the R console.

cran <- getOption("repos")
cran["dmlc"] <- "https://s3-us-west-2.amazonaws.com/apache-mxnet/R/CRAN/"
options(repos = cran)
install.packages("mxnet")

You can install the package directly in the R console.

## try http:// if https:// URLs are not supported
source("https://bioconductor.org/biocLite.R")
biocLite("EBImage")

You can install the package directly in the R console.

install.packages(c('magrittr', 'imager'))

Start to try

The example code can be found from 'code/Class-Activation-Mapping.R', and the demo output is shown as following.

F2

The function was been coded in 'code/Single fucntion.R', it also support chinese labels.

F3

It's worth noting that denseNet can be applied to image with any size after a minor modification. The function was been coded in 'code/Single function (support any size).R'. However, because the limitation about architecture, the image size must be resized to multiple of 32. Moreover, this modification will decrease the accuracy.

F4

Other pre-trained MxNet Network

MXNet Model Zoo provides number of pre-trained model by imagenet. You can download them to modify this script. Among Inception v3 w/BatchNorm, ResidualNet152, ResNext101-64x4d can be directly used to generate the class activation mapping (CAM).