The Effect of the Loss on Generalization: Empirical Study on Synthetic Lung Nodule Data

Vasileios Baltatzis*, Loïc Le Folgoc, Sam Ellis, Octavio E.Martinez Manzanera, Kyriaki Margarita Bintsi, Arjun Nair, Sujal Desai, Ben Glocker, Julia A. Schnabel

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference paperpeer-review

1 Citation (Scopus)

Abstract

Convolutional Neural Networks (CNNs) are widely used for image classification in a variety of fields, including medical imaging. While most studies deploy cross-entropy as the loss function in such tasks, a growing number of approaches have turned to a family of contrastive learning-based losses. Even though performance metrics such as accuracy, sensitivity and specificity are regularly used for the evaluation of CNN classifiers, the features that these classifiers actually learn are rarely identified and their effect on the classification performance on out-of-distribution test samples is insufficiently explored. In this paper, motivated by the real-world task of lung nodule classification, we investigate the features that a CNN learns when trained and tested on different distributions of a synthetic dataset with controlled modes of variation. We show that different loss functions lead to different features being learned and consequently affect the generalization ability of the classifier on unseen data. This study provides some important insights into the design of deep learning solutions for medical imaging tasks.

Original languageEnglish
Title of host publicationInterpretability of Machine Intelligence in Medical Image Computing, and Topological Data Analysis and Its Applications for Medical Data - 4th International Workshop, iMIMIC 2021, and 1st International Workshop, TDA4MedicalData 2021, Held in Conjunction with MICCAI 2021, Proceedings
EditorsMauricio Reyes, Pedro Henriques Abreu, Jaime Cardoso, Mustafa Hajij, Ghada Zamzmi, Paul Rahul, Lokendra Thakur
PublisherSpringer Science and Business Media Deutschland GmbH
Pages56-64
Number of pages9
ISBN (Print)9783030874438
DOIs
Publication statusPublished - 2021
Event4th International Workshop on Interpretability of Machine Intelligence in Medical Image Computing, iMIMIC 2020 and 1st International Workshop on Topological Data Analysis and Its Applications for Medical Data, TDA4MedicalData 2021 held in conjunction with 24th International Conference on Medical Image Computing and Computer Assisted Intervention, MICCAI 2021 - Strasbourg, France
Duration: 27 Sept 202127 Sept 2021

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume12929 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference4th International Workshop on Interpretability of Machine Intelligence in Medical Image Computing, iMIMIC 2020 and 1st International Workshop on Topological Data Analysis and Its Applications for Medical Data, TDA4MedicalData 2021 held in conjunction with 24th International Conference on Medical Image Computing and Computer Assisted Intervention, MICCAI 2021
Country/TerritoryFrance
CityStrasbourg
Period27/09/202127/09/2021

Keywords

  • Contrastive learning
  • Distribution shift
  • Interpretability

Fingerprint

Dive into the research topics of 'The Effect of the Loss on Generalization: Empirical Study on Synthetic Lung Nodule Data'. Together they form a unique fingerprint.

Cite this