Labeling Neural Representations with Inverse Recognition

Accepted at 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

Inverse Recognition (INVERT) is a method designed to enhance our understanding of the representations learned by Deep Neural Networks (DNNs). It aims to bridge the gap between these complex, hierarchical data representations and human-understandable concepts. Unlike existing global explainability methods, INVERT is more scalable and less reliant on resources such as segmentation masks. It also offers an interpretable metric that measures the alignment between the representation and its explanation, providing a degree of statistical significance.

You can it via pip as shown below:

! git clone https://github.com/lapalap/invert.git --quiet
! pip install git+file:///content/invert --quiet

You can get started with the following colab notebook.

@article{bykov2023labeling,
  title={Labeling Neural Representations with Inverse Recognition},
  author={Bykov, Kirill and Kopf, Laura and Nakajima, Shinichi and Kloft, Marius and H{\"o}hne, Marina M-C},
  journal={arXiv preprint arXiv:2311.13594},
  year={2023}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Labeling Neural Representations with Inverse Recognition

Accepted at 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

Files

README.md

Latest commit

History

README.md

File metadata and controls

Labeling Neural Representations with Inverse Recognition

Accepted at 37th Conference on Neural Information Processing Systems (NeurIPS 2023)