New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Merge util.metrics.py from CerebNet and FastSurferCNN into one file #590

Draft

taha-abdullah wants to merge 4 commits into Deep-MI:dev from taha-abdullah:pytest

Contributor

taha-abdullah commented Oct 8, 2024

Deleted CerebNet.utils.metrics.py and merged all functionality into FastSurferCNN.utils.metrics.py
refactoring usage across files
changed dice score implementation to numpy (tried MONAI and Ignite, didn't work out)
added support for multiple images in test_images
nested parameterization (subjects and test files) in test_images and test_stats

dkuegler requested changes

View reviewed changes

requirements.txt Outdated Show resolved Hide resolved

pyproject.toml Outdated Show resolved Hide resolved

.github/workflows/quicktest.yaml Outdated Show resolved Hide resolved

taha-abdullah force-pushed the pytest branch 2 times, most recently from fe5772b to 80884ad Compare

October 8, 2024 12:52

dkuegler requested changes

View reviewed changes

Member

dkuegler left a comment

We should do another iteration on metrics.

.github/workflows/quicktest.yaml Outdated

Member

dkuegler Oct 8, 2024

I do not understand why these changes are required. I think the previous version (before your changes here) was better. It has the whole pipeline grouped into one job.

I guess the "on" variable should be uncommented :)

Member

m-reuter Oct 9, 2024

I commented out the on pull requests to dev and stable. These should only be added at the very end, when everything works (if at all). Until then it should be triggered manually for testing.
Even later I think - given the computational overhead - it makes sense to trigger this only for specific pull requests, that even touch any of the files involved in testing, and not just modification to doc strings etc.

Contributor Author

taha-abdullah Oct 10, 2024

Commented out trigger on pull request and changed it back to last iteration

Member

dkuegler Oct 10, 2024

@m-reuter One major disadvantage for this is that such workflows can only be triggered for branches that are part of the repository. I.e. we can no longer run them on all pull requests when we want to. At least I have not found a way to tigger them. The best solution is probably to always trigger them, but have a label in the PR that needs to be set to actually run this workflow.

FastSurferCNN/utils/metrics.py Outdated Show resolved Hide resolved

FastSurferCNN/utils/metrics.py Outdated Show resolved Hide resolved

FastSurferCNN/utils/metrics.py Outdated Show resolved Hide resolved

FastSurferCNN/utils/metrics.py Outdated

Comment on lines 118 to 122

-                  device : str of torch.device, optional
-                      Device specification in case of distributed computation usage.
-                      In most of the cases, it can be defined as "cuda:local_rank" or "cuda"
-                      if already set `torch.cuda.set_device(local_rank)`. By default, if a distributed process group is
-                      initialized and available, device is set to `cuda`.

Member

dkuegler Oct 8, 2024

It seems to me that this metrics functionality of this class was previously implemented here and is now lost. Was this ever really used before. We should double-check. It might have been used for multi-gpu support or something.

It seems to me like FastSurfer still uses this (so at a minimum, it needs to be adapted there)

FastSurfer/FastSurferCNN/utils/meters.py

Line 75 in 462b00d

self.dice_score = DiceScore(cfg.MODEL.NUM_CLASSES, device=device)

This device variable is being passed through the pipeline including utils/train.py but it only seems to be self.device which is initialized in init

FastSurfer/FastSurferCNN/train.py

Line 79 in 5bd139e

self.device = torch.device("cuda" if torch.cuda.is_available() else "cpu")

In the end you will need to test the train script with a minimal dataset to make sure it does not get broken. I will find such a dataset for you, but if can also be generated with generate_dataset.py

FastSurferCNN/utils/metrics.py

+                              )
+                              self.union[i, j] = self.union[i, j] + torch.sum(gt) + torch.sum(pred)
+                  def update(self, output):

Member

dkuegler Oct 8, 2024

It seems to me like update always calls _update_union_intersection.

FastSurferCNN/utils/metrics.py

Comment on lines +181 to +191

+                      for i, c1 in enumerate(self.class_ids):
+                          gt = (labels_batch == c1).float()
+                          for j, c2 in enumerate(self.class_ids):
+                              pred = (batch_output == c2).float()
+                              self.intersection[i, j] = self.intersection[i, j] + torch.sum(
+                                  torch.mul(gt, pred)
+                              )
+                              self.union[i, j] = self.union[i, j] + torch.sum(gt) + torch.sum(pred)

Member

dkuegler Oct 8, 2024

Also, it seems like the original implementation had a condition here that would only compute the full confusion matrix if explicitly required. Most times, the diagonal of the confusion matrix is enough (see previous). And always computing the full confusion matrix is EXTREMELY computationally expensive. (Prohibitively so). So we should only compute the full confusion matrix if it is explicitly requested.

FastSurferCNN/utils/metrics.py


		def comput_dice_cnf(self):

Member

dkuegler Oct 8, 2024

This would be the confusion matrix :)

taha-abdullah added 4 commits

October 22, 2024 10:31


          Parameterized tests in test_images.py and test_stats.py

e903685

also added test configuration files for new filetypes

rebased pytest-feture onto merge-metrics

adding support for multiple images in test_images.py


          Merging all functionality from CerebNet.utils.metrics.py into FastSur…

f41bba2

…ferCNN.utils.metrics.py

- Deleting CerebNet.utils.metrics.py

- refactoring usage across files

- Changing back to numpy dice score implementation

- removing CerebNet.utils.metrics.py from docs


          removing CerebNet.utils.metrics.py from docs

710291e


          PR 590 Review changes:

e73139a

- quicktest.yaml: commenting out trigger on pull request
- FastSurferCNN.utils.metrics.py: docstring changes

taha-abdullah force-pushed the pytest branch from c85f457 to e73139a Compare

October 22, 2024 08:31

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet