Contextual semantic interpretability

Marcos, Diego; Lobry, Sylvain; Fong, Ruth; Courty, Nicolas; Flamary, Remi; Tuia, Devis

doi:10.1007/978-3-030-69538-5_22

Marcos, Diego; Lobry, Sylvain; Fong, Ruth; Courty, Nicolas; Flamary, Remi; Tuia, Devis

2020

Download

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

Convolutional neural networks (CNN) are known to learn an image representation that captures concepts relevant to the task, but do so in an implicit way that hampers model interpretability. However, one could argue that such a representation is hidden in the neurons and can be made explicit by teaching the model to recognize semantically interpretable attributes that are present in the scene. We call such an intermediate layer a \emph{semantic bottleneck}. Once the attributes are learned, they can be re-combined to reach the final decision and provide both an accurate prediction and an explicit reasoning behind the CNN decision. In this paper, we look into semantic bottlenecks that capture context: we want attributes to be in groups of a few meaningful elements and participate jointly to the final decision. We use a two-layer semantic bottleneck that gathers attributes into interpretable, sparse groups, allowing them contribute differently to the final output depending on the context. We test our contextual semantic interpretable bottleneck (CSIB) on the task of landscape scenicness estimation and train the semantic interpretable bottleneck using an auxiliary database (SUN Attributes). Our model yields in predictions as accurate as a non-interpretable baseline when applied to a real-world test set of Flickr images, all while providing clear and interpretable explanations for each prediction.

Details

Title Contextual semantic interpretability

Author(s) Marcos, Diego ; Lobry, Sylvain ; Fong, Ruth ; Courty, Nicolas ; Flamary, Remi ; Tuia, Devis

Published in Proceedings of the 15th Asian Conference on Computer Vision

Pagination 17

Series Lecture Notes in Computer Science, 12625

Conference Asian Conference on Computer Vision (ACCV), Kyoto, Japan (held online), November 30, 2020

Date 2020

Publisher Springer-Verlag, Berlin, Heidelberg

ISBN 978-3-030695-31-6

Keywords

Deep learning; Interpretable deep learning; Lanscape scenicness; Ecosystem services

DOI https://doi.org/10.1007/978-3-030-69538-5_22

Other identifier(s) View record in ArXiv

Laboratories ECEO

Record Appears in Scientific production and competences > ENAC - School of Architecture, Civil and Environmental Engineering > IIE - Environmental Engineering Institute > ECEO - Environmental Computational Science and Earth Observation Laboratory
Peer-reviewed publications
Work outside EPFL
Conference Papers

Record creation date 2021-02-18

Actions

Preview

Select file: