Adversarial Attacks and Robust Networks

Team ROBUSTNET: BENREKIA Mohamed Ali, TADJER Amina, ZAGHRINI Eloıse - December 20, 2021

ABSTRACT

In recent years, a lot of weaknesses have been demonstrated in classification neural networks. Even in extremely precise and deep models, simple and undetectable by humans attacks on the input data can make the accuracy drop significantly. To remedy this problem, a lot of defense methods have been proposed to improve the robustness of any model. In this project, we will study 3 different attacks : FGSM, PGD and C&W, and their efficiency. We will then test out and compare different defense methods against those attacks : Adversarial Training, Defensive Distillation, Randomized Networks, and Model Ensemble.In this project, we will explore different parameters for each method and visualize the results.

Keywords: FGSM, PGD, C&W, Defensive distillation, Randomized Networks, Network Ensemble

This is a basic code repository for Assignment 3.

The repository contains a basic model and a basic training and testing procedure. It will work on the testing-platform (but it will not perform well against adversarial examples). The goal of the project is to train a new model that is as robust as possible.

Basic usage

Install python dependencies with pip:

$ pip install -r requirements.txt

Test the basic model:

$ ./model.py
Testing with model from 'models/default_model.pth'. 
Downloading https://www.cs.toronto.edu/~kriz/cifar-10-python.tar.gz to ./data/cifar-10-python.tar.gz
100.0%
Extracting ./data/cifar-10-python.tar.gz to ./data/
Model natural accuracy (test): 53.07

(Re)train the basic model:

Note: default training procedure does nothing, so you will have to implement it yourself in order to observe good results.

$ ./model.py --force-train
Training model
models/default_model.pth
Files already downloaded and verified
Starting training
[1,   500] loss: 0.576
[1,  1000] loss: 0.575
...

Train/test the basic model and store the weights to a different file:

$ ./model.py --model-file models/mymodel.pth
...

Load the module project and test it as close as it will be tested on the testing plateform:

$ ./test_project.py

Even safer: do it from a different directory:

$ mkdir tmp
$ cd tmp
$ ../test_project.py ../

Modifying the project

You can modify anything inside this git repository, it will work as long as:

it contains a model.py file in the root directory
the model.py file contains a class called Net derived from torch.nn.Module
the Net class has a function call load_for_testing() that initializes the model for testing (typically by setting the weights properly). The default load_for_testing() loads and store weights from a model file, you will also need to make sure the repos contains a model file that can be loaded into the Net architecture using Net.load(model_file).
You may modify this README.md file.

Before pushing

When you have made improvements your version of the git repository:

Add and commit every new/modified file to the git repository, including your model files in models/.(Check with git status) DO NOT CHECK THE DATA IN PLEASE!!!!
Run test_project.py and verify the default model file used by load_for_testing() is the model file that you actually want to use for testing on the platform.
Push your last change

Note: If you want to avoid any problems, it is a good idea to make a local copy of your repos (with git clone <repos> <repos-copy>) and to test the project inside this local copy.

Good luck!

Visualize results:

We added some notebooks to demonstrate our work(FGSM,PGD,Adversarial Training, Randomized Networks, Carlini & Wagner, distillation defense)

A3_main.ipynb: Demonstrate and visualize how basic model, distilled model and randomized model work against FGSM and PGD.
C&W_Vs_Basic_model.ipynb: Test C&W attack against Basic Model
C&W_Vs_Distilled_model.ipynb: Test C&W attack against Distilled Model
Démonstration_FGSM,PGD.ipynb: Visualize images after FGSM and PGD attacks, and accuracy vs epsilon.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
models		models
A3_main.ipynb		A3_main.ipynb
Adversarial_Training_Vs_FGSM.ipynb		Adversarial_Training_Vs_FGSM.ipynb
Adversarial_Training_with_PGD.ipynb		Adversarial_Training_with_PGD.ipynb
C&W_Vs_Basic_model.ipynb		C&W_Vs_Basic_model.ipynb
C&W_Vs_Distilled_model_ipynb.ipynb		C&W_Vs_Distilled_model_ipynb.ipynb
Démonstration_FGSM,PGD.ipynb		Démonstration_FGSM,PGD.ipynb
LICENSE		LICENSE
README.md		README.md
Report_Robust_Networks.pdf		Report_Robust_Networks.pdf
attacks.py		attacks.py
carlini_wagner_l2.py		carlini_wagner_l2.py
distil_defense.py		distil_defense.py
ensemble_model.py		ensemble_model.py
model.py		model.py
randomized.py		randomized.py
requirements.txt		requirements.txt
robustnet_Presentation.pdf		robustnet_Presentation.pdf
test_project.py		test_project.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Adversarial Attacks and Robust Networks

Basic usage

Modifying the project

Before pushing

Visualize results:

About

Releases

Packages

Languages

License

iladan0/RobustNetworks

Folders and files

Latest commit

History

Repository files navigation

Adversarial Attacks and Robust Networks

Basic usage

Modifying the project

Before pushing

Visualize results:

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages