Full-Gradient Representation for Neural Network Visualization

Srinivas, Suraj; Fleuret, Francois

conference paper

Srinivas, Suraj

•

Fleuret, Francois

2019

Advances In Neural Information Processing Systems 32 (Nips 2019), 32

33rd Conference on Neural Information Processing Systems (NeurIPS)

We introduce a new tool for interpreting neural net responses, namely full-gradients, which decomposes the neural net response into input sensitivity and per-neuron sensitivity components. This is the first proposed representation which satisfies two key properties: completeness and weak dependence, which provably cannot be satisfied by any saliency map-based interpretability method. For convolutional nets, we also propose an approximate saliency map representation, called FullGrad, obtained by aggregating the full-gradient components. We experimentally evaluate the usefulness of FullGrad in explaining model behaviour with two quantitative tests: pixel perturbation and remove-and-retrain. Our experiments reveal that our method explains model behavior correctly, and more comprehensively, than other methods in the literature. Visual inspection also reveals that our saliency maps are sharper and more tightly confined to object regions than other methods.

Type

conference paper

ArXiv ID

1905.00780

Authors

Srinivas, Suraj

•

Fleuret, Francois

Publication date

2019

Published in

Advances In Neural Information Processing Systems 32 (Nips 2019), 32

Peer reviewed

REVIEWED

Written at

EPFL

EPFL units

LIDIAP

CVLAB

Event name	Event place	Event date
33rd Conference on Neural Information Processing Systems (NeurIPS)	Vancouver, CANADA	Dec 08-14, 2019

Available on Infoscience

November 7, 2019

Use this identifier to reference this record

https://infoscience.epfl.ch/handle/20.500.14299/162775