`GradSim` Improvements #837

mauicv · 2022-12-13T11:36:33Z

I think the dash onsite demonstrated the GradSim method is slow for large models. This is because currently, pytorch and tensorflow don’t let you compute gradients per instance in a batch which gradient similarity requires. We can do this before time by storing the gradients but this becomes impossible for large models. Note that partial solutions include: a) using a subset of model weights, such as a final layer, to decrease memory overhead or b) reducing the dataset you're comparing against using something like ProtoSelect. Both of these are user-level interventions. I think our focus should be figuring out how to batch the gradient computations.

The text was updated successfully, but these errors were encountered:

mauicv · 2022-12-13T12:08:51Z

I think differential privacy libraries such as Opacus may do what we need. This blog post details how they perform efficient per-sample gradient computation.

mauicv · 2022-12-13T14:42:47Z

Example of using Opacus to comupte per-sample gradient computation:

!pip install -q opacus

from opacus.grad_sample import GradSampleModule

import torch
import torch.nn as nn
import torch.nn.functional as F


class MNISTConvNet(nn.Module):
    def __init__(self):
        super(MNISTConvNet, self).__init__()
        self.conv1 = nn.Conv2d(1, 10, 5)
        self.pool1 = nn.MaxPool2d(2, 2)
        self.conv2 = nn.Conv2d(10, 20, 5)
        self.pool2 = nn.MaxPool2d(2, 2)
        self.fc1 = nn.Linear(320, 50)
        self.fc2 = nn.Linear(50, 10)

    def forward(self, input):
        x = self.pool1(F.relu(self.conv1(input)))
        x = self.pool2(F.relu(self.conv2(x)))
        x = x.view(x.size(0), -1)
        x = F.relu(self.fc1(x))
        x = F.relu(self.fc2(x))
        return x


net = GradSampleModule(MNISTConvNet())

input = torch.randn((2, 1, 28, 28))
target = torch.randn((2)).to(torch.long)
loss_fn = nn.CrossEntropyLoss()

out = net(input)
err = loss_fn(out, target)
err.backward()

for p in net.parameters():
  print(p.grad_sample.shape)

Note a limitation of this approach is that opacus only overrides a finite number of torch module types so we might potentially end up with limitations.

mauicv · 2023-01-05T11:53:15Z

@jklaise & @RobertSamoilescu, tagging for discussion, any thoughts?

mauicv added the Type: Enhancement A better way of doing things label Dec 13, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`GradSim` Improvements #837

`GradSim` Improvements #837

mauicv commented Dec 13, 2022

mauicv commented Dec 13, 2022

mauicv commented Dec 13, 2022 •

edited

Loading

mauicv commented Jan 5, 2023

GradSim Improvements #837

GradSim Improvements #837

Comments

mauicv commented Dec 13, 2022

mauicv commented Dec 13, 2022

mauicv commented Dec 13, 2022 • edited Loading

mauicv commented Jan 5, 2023

`GradSim` Improvements #837

`GradSim` Improvements #837

mauicv commented Dec 13, 2022 •

edited

Loading