On Classifying Sepsis Heterogeneity in the ICU: Insight Using Machine Learning

Zina Ibrahim, Honghan Wu, Ahmed Hamoud, Lukas Stappen, Richard James Butler Dobson, Andrea Agarossi

Research output: Contribution to journalArticlepeer-review

42 Citations (Scopus)
46 Downloads (Pure)

Abstract

Current machine learning models aiming to predict sepsis from Electronic Health Records (EHR) do not account for the heterogeneity of the condition, despite its emerging importance in prognosis and treatment. This work demonstrates the added value of stratifying the types of organ dysfunction observed in patients who develop sepsis in the ICU in improving the ability to recognise patients at risk of sepsis from their EHR data. Using an ICU dataset of 13,728 records, we identify clinically significant sepsis subpopulations with distinct organ dysfunction patterns. Classification experiments using Random Forest, Gradient Boost Trees and Support Vector Machines, aiming to distinguish patients who develop sepsis in the ICU from those who do not, show that features selected using sepsis subpopulations as background knowledge yield a superior performance regardless of the classification model used. Our findings can steer machine learning efforts towards more personalised models for complex conditions including sepsis.
Original languageEnglish
JournalJournal of the American Medical Informatics Association : JAMIA
DOIs
Publication statusAccepted/In press - 24 Nov 2019

Fingerprint

Dive into the research topics of 'On Classifying Sepsis Heterogeneity in the ICU: Insight Using Machine Learning'. Together they form a unique fingerprint.

Cite this