Towards Black-Box Explainability With Gaussian Discriminant Knowledge Distillation

Haselhoff, Anselm; Kronenberger, Jan; Kuppers, Fabian; Schneider, Jonas

Anselm Haselhoff, Jan Kronenberger, Fabian Kuppers, Jonas Schneider; Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, 2021, pp. 21-28

Abstract

In this paper, we propose a method for post-hoc explainability of black-box models. The key component of the semantic and quantitative local explanation is a knowledge distillation (KD) process which is used to mimic the teacher's behavior by means of an explainable generative model. Therefore, we introduce a Concept Probability Density Encoder (CPDE) in conjunction with a Gaussian Discriminant Decoder (GDD) to describe the contribution of high-level concepts (e.g. object parts, color, shape). We argue that our objective function encourages both, an explanation given by a set of likelihood ratios and a measure to describe how far the explainer deviates from the training data distribution of the concepts. The method can leverage any pre-trained concept classifier that admits concept scores (e.g. logits) or probabilities. We demonstrate the effectiveness of the proposed method in the context of object detection utilizing the DensePose dataset.

Related Material

[pdf]

[bibtex]

@InProceedings{Haselhoff_2021_CVPR, author = {Haselhoff, Anselm and Kronenberger, Jan and Kuppers, Fabian and Schneider, Jonas}, title = {Towards Black-Box Explainability With Gaussian Discriminant Knowledge Distillation}, booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops}, month = {June}, year = {2021}, pages = {21-28} }