Integrating feature attribution methods into the loss function of deep learning classifiers
Callanan, James, Garcia-Cabrera, CarlesORCID: 0000-0001-8139-9647, Belton, NiamhORCID: 0000-0003-4949-4745, Roshchupkin, Gennady and Curran, KathleenORCID: 0000-0003-0095-9337
(2022)
Integrating feature attribution methods into the loss function of deep learning classifiers.
In: 24th Irish Machine Vision and Image Processing Conference, 31 Aug - 2 Sept 2022, Belfast.
ISBN 978-0-9934207-7-1
Feature attribution methods are typically used post-training to judge if a deep learning classifier is using meaningful concepts in an input image when making classifications. In this study, we propose using feature attribution methods to give a classifier automated feedback throughout the training process via a novel loss function. We call such a loss function, a heatmap loss function. Heatmap loss functions enable us to incentivize a model to rely on relevant sections of the input image when making classifications.
Two groups of models were trained, one group with a heatmap loss function and the other using categorical cross entropy (CCE). Models trained with the heatmap loss function were capable of achieving equivalent classification accuracies on a test dataset of synthesised cardiac MRIs. Moreover, HiResCAM heatmaps suggest that these models relied to a greater extent on regions of the input image within the heart.
A further experiment demonstrated how heatmap loss functions can be used to prevent deep learning classifiers from using non-causal concepts that disproportionately co-occur with certain classes when making classifications. This suggests that heatmap loss functions could be used to prevent models from learning dataset biases by directing where the model should be looking when making classifications.
Item Type:
Conference or Workshop Item (Paper)
Event Type:
Conference
Refereed:
Yes
Uncontrolled Keywords:
Loss function; Dataset bias; Grad-CAM; HiResCAM; Deep learning