Automated ranking of chest x-ray radiological finding severity in a binary label setting

Matthew Macpherson, Keerthini Muthuswamy, Ashik Amlani, Vicky Goh, Giovanni Montana

Research output: Chapter in Book/Report/Conference proceedingConference paperpeer-review

Abstract

Machine learning has demonstrated the ability to match or exceed human performance in detecting a range of abnormalities in chest x-rays. However, current models largely operate within a binary classification paradigm using fixed decision thresholds, whereas many clinical findings can be more usefully described on a scale of severity which a skilled radiologist will incorporate into a more nuanced report. This limitation is due, in part, to the difficulty and expense of manually annotating fine-grained labels for training and test images versus the relative ease of automatically extracting binary labels from the associated free text reports using NLP algorithms. In this paper we examine the ability of models trained with only binary training data to give useful abnormality severity information from their raw outputs. We assess performance using manually ranked test sets for each of five findings: cardiomegaly, consolidation, paratracheal hilar changes, pleural effusion and subcutaneous emphysema. We find the raw model output predicts human-assessed severity ranking with Spearman’s rank coefficients between 0.563 - 0.848. Using patient age as an additional variable with full ground truth ranking available, we compare a binary classifier output against a fully supervised RankNet model, quantifying the increase in training data required for equivalent performance. We show that model performance is improved using a semi-supervised approach supplementing a smaller set of fully supervised images with a larger set of binary labelled images.

Original languageEnglish
Title of host publicationAutomated ranking of chest x-ray radiological finding severity in a binary label setting
Pages949-963
Number of pages15
Volume250
Publication statusPublished - 2024
Event7th International Conference on Medical Imaging with Deep Learning, MIDL 2024 - Paris, France
Duration: 3 Jul 20245 Jul 2024

Publication series

NameProceedings of Machine Learning Research
ISSN (Print)2640-3498

Conference

Conference7th International Conference on Medical Imaging with Deep Learning, MIDL 2024
Country/TerritoryFrance
CityParis
Period3/07/20245/07/2024

Keywords

  • Chest x-ray
  • ranking
  • severity assessment
  • weakly supervised

Fingerprint

Dive into the research topics of 'Automated ranking of chest x-ray radiological finding severity in a binary label setting'. Together they form a unique fingerprint.

Cite this