Regional performance variation in external validation of four prediction models for severity of COVID-19 at hospital admission: An observational multi-centre cohort study

Kristin E. Wickstrom, Valeria Vitelli, Ewan Carr, Aleksander R. Holten, Rebecca Bendayan, Andrew H. Reiner, Daniel Bean, Tom Searle, Anthony Shek, Zeljko Kraljevic, James Teo, Richard Dobson, Kristian Tonby, Alvaro Kohn-Luque, Erik K. Amundsen

Research output: Contribution to journalArticlepeer-review

1 Citation (Scopus)

Abstract

BACKGROUND: Prediction models should be externally validated to assess their performance before implementation. Several prediction models for coronavirus disease-19 (COVID-19) have been published. This observational cohort study aimed to validate published models of severity for hospitalized patients with COVID-19 using clinical and laboratory predictors.

METHODS: Prediction models fitting relevant inclusion criteria were chosen for validation. The outcome was either mortality or a composite outcome of mortality and ICU admission (severe disease). 1295 patients admitted with symptoms of COVID-19 at Kings Cross Hospital (KCH) in London, United Kingdom, and 307 patients at Oslo University Hospital (OUH) in Oslo, Norway were included. The performance of the models was assessed in terms of discrimination and calibration.

RESULTS: We identified two models for prediction of mortality (referred to as Xie and Zhang1) and two models for prediction of severe disease (Allenbach and Zhang2). The performance of the models was variable. For prediction of mortality Xie had good discrimination at OUH with an area under the receiver-operating characteristic (AUROC) 0.87 [95% confidence interval (CI) 0.79-0.95] and acceptable discrimination at KCH, AUROC 0.79 [0.76-0.82]. In prediction of severe disease, Allenbach had acceptable discrimination (OUH AUROC 0.81 [0.74-0.88] and KCH AUROC 0.72 [0.68-0.75]). The Zhang models had moderate to poor discrimination. Initial calibration was poor for all models but improved with recalibration.

CONCLUSIONS: The performance of the four prediction models was variable. The Xie model had the best discrimination for mortality, while the Allenbach model had acceptable results for prediction of severe disease.

Original languageEnglish
Article numbere0255748
Pages (from-to)e0255748
JournalPLoS ONE
Volume16
Issue number8
DOIs
Publication statusPublished - 25 Aug 2021

Fingerprint

Dive into the research topics of 'Regional performance variation in external validation of four prediction models for severity of COVID-19 at hospital admission: An observational multi-centre cohort study'. Together they form a unique fingerprint.

Cite this