TL;DR

Methods

# SGD (single-thread)
$ python src/baseline_main.py --model=cnn --dataset=cifar --epochs=200 --verbose=0 --local_bs=50 --gpu=0

# FedAvg
$ python src/federated_main.py --model=cnn --dataset=cifar --iid=1 --epochs=1000 --num_users=21 --byzantines=0 --frac=0.1 --verbose=0 --local_bs=50 --gpu=0

# FedAsync
$ python src/fedAsync_main.py --model=cnn --dataset=cifar --iid=1 --epochs=1000 --num_users=21 --byzantines=0 --frac=0.1 --stale=4 --alpha=0.6 --verbose=0 --local_bs=50 --gpu=0

# BRAIN
$ python src/brain_main.py --model=cnn --dataset=cifar --iid=1 --epochs=1000 --num_users=21 --byzantines=0 --score_byzantines=0 --frac=0.1 --stale=4 --diff=0.55 --window=4 --threshold=0.0 --verbose=0 --local_bs=50 --gpu=0

Visualization

Do multiple run (ex. 100) to draw dots well.

# Acc & Loss
$ python src/visual.py

# Combined Graph
$ python src/visual_combine.py

# Model Update DAG
$ python src/cache.py

Test

$ pytest src/test_cache.py

References

CNN Model Implementation: cifar10-airbench

Federated-Learning (PyTorch)

Implementation of the vanilla federated learning paper : Communication-Efficient Learning of Deep Networks from Decentralized Data.

Experiments are produced on MNIST, Fashion MNIST and CIFAR10 (both IID and non-IID). In case of non-IID, the data amongst the users can be split equally or unequally.

Since the purpose of these experiments are to illustrate the effectiveness of the federated learning paradigm, only simple models such as MLP and CNN are used.

Requirments

Install all the packages from requirments.txt

Python3
Pytorch
Torchvision

Data

Download train and test datasets manually or they will be automatically downloaded from torchvision datasets.
Experiments are run on Mnist, Fashion Mnist and Cifar.
To use your own dataset: Move your dataset to data directory and write a wrapper on pytorch dataset class.

Running the experiments

The baseline experiment trains the model in the conventional way.

To run the baseline experiment with MNIST on MLP using CPU:

python src/baseline_main.py --model=mlp --dataset=mnist --epochs=10

Or to run it on GPU (eg: if gpu:0 is available):

python src/baseline_main.py --model=mlp --dataset=mnist --gpu=0 --epochs=10

Federated experiment involves training a global model using many local models.

To run the federated experiment with CIFAR on CNN (IID):

python src/federated_main.py --model=cnn --dataset=cifar --gpu=0 --iid=1 --epochs=10

To run the same experiment under non-IID condition:

python src/federated_main.py --model=cnn --dataset=cifar --gpu=0 --iid=0 --epochs=10

You can change the default values of other parameters to simulate different conditions. Refer to the options section.

Options

The default values for various paramters parsed to the experiment are given in options.py. Details are given some of those parameters:

--dataset: Default: 'mnist'. Options: 'mnist', 'fmnist', 'cifar'
--model: Default: 'mlp'. Options: 'mlp', 'cnn'
--gpu: Default: None (runs on CPU). Can also be set to the specific gpu id.
--epochs: Number of rounds of training.
--lr: Learning rate set to 0.01 by default.
--verbose: Detailed log outputs. Activated by default, set to 0 to deactivate.
--seed: Random Seed. Default set to 1.

Federated Parameters

--iid: Distribution of data amongst users. Default set to IID. Set to 0 for non-IID.
--num_users:Number of users. Default is 100.
--frac: Fraction of users to be used for federated updates. Default is 0.1.
--local_ep: Number of local training epochs in each user. Default is 10.
--local_bs: Batch size of local updates in each user. Default is 10.
--unequal: Used in non-iid setting. Option to split the data amongst users equally or unequally. Default set to 0 for equal splits. Set to 1 for unequal splits.

Results on MNIST

Baseline Experiment:

The experiment involves training a single model in the conventional way.

Parameters:

Optimizer: : SGD
Learning Rate: 0.01

Table 1: Test accuracy after training for 10 epochs:

Model	Test Acc
MLP	92.71%
CNN	98.42%

Federated Experiment:

The experiment involves training a global model in the federated setting.

Federated parameters (default values):

Fraction of users (C): 0.1
Local Batch size (B): 10
Local Epochs (E): 10
Optimizer : SGD
Learning Rate : 0.01

Table 2: Test accuracy after training for 10 global epochs with:

Model	IID	Non-IID (equal)
MLP	88.38%	73.49%
CNN	97.28%	75.94%

Name		Name	Last commit message	Last commit date
Latest commit History 165 Commits
data		data
save/objects		save/objects
save_heatmap/objects		save_heatmap/objects
save_simul/objects		save_simul/objects
simulate		simulate
src		src
visualization		visualization
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
simul.sh		simul.sh
simul_clean.sh		simul_clean.sh
simul_heatmap.sh		simul_heatmap.sh
test.sh		test.sh
time.sh		time.sh
visual.sh		visual.sh
visual_clean.sh		visual_clean.sh
watch.sh		watch.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TL;DR

Methods

Visualization

Test

References

Federated-Learning (PyTorch)

Requirments

Data

Running the experiments

Options

Federated Parameters

Results on MNIST

Baseline Experiment:

Federated Experiment:

Further Readings

Papers:

Blog Posts:

About

Releases

Packages

Languages

License

BRAIN-chain/BRAIN-eval-training

Folders and files

Latest commit

History

Repository files navigation

TL;DR

Methods

Visualization

Test

References

Federated-Learning (PyTorch)

Requirments

Data

Running the experiments

Options

Federated Parameters

Results on MNIST

Baseline Experiment:

Federated Experiment:

Further Readings

Papers:

Blog Posts:

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages