Multi-omic Data Integration and Feature Selection for Survival-Based Patient Stratification via Supervised Concrete Autoencoders

Pedro Henrique da Costa Avelar*, Roman Laddach, Sophia N. Karagiannis, Min Wu, Sophia Tsoka

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference paperpeer-review

Abstract

Cancer is a complex disease with significant social and economic impact. Advancements in high-throughput molecular assays and the reduced cost for performing high-quality multi-omic measurements have fuelled insights through machine learning. Previous studies have shown promise on using multiple omic layers to predict survival and stratify cancer patients. In this paper, we develop and report a Supervised Autoencoder (SAE) model for survival-based multi-omic integration, which improves upon previous work, as well as a Concrete Supervised Autoencoder model (CSAE) which uses feature selection to jointly reconstruct the input features as well as to predict survival. Our results show that our models either outperform or are on par with some of the most commonly used baselines, while either providing a better survival separation (SAE) or being more interpretable (CSAE). Feature selection stability analysis on our models shows a power-law relationship with features commonly associated with survival. The code for this project is available at: https://github.com/phcavelar/coxae.

Original languageEnglish
Title of host publicationMachine Learning, Optimization, and Data Science - 8th International Workshop, LOD 2022, Revised Selected Papers
EditorsGiuseppe Nicosia, Giovanni Giuffrida, Varun Ojha, Emanuele La Malfa, Gabriele La Malfa, Panos Pardalos, Giuseppe Di Fatta, Renato Umeton
PublisherSpringer Science and Business Media Deutschland GmbH
Pages47-61
Number of pages15
ISBN (Print)9783031258909
DOIs
Publication statusPublished - 2023
Event8th International Conference on Machine Learning, Optimization, and Data Science, LOD 2022, held in conjunction with the 2nd Advanced Course and Symposium on Artificial Intelligence and Neuroscience, ACAIN 2022 - Certosa di Pontignano, Italy
Duration: 18 Sept 202222 Sept 2022

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume13811 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference8th International Conference on Machine Learning, Optimization, and Data Science, LOD 2022, held in conjunction with the 2nd Advanced Course and Symposium on Artificial Intelligence and Neuroscience, ACAIN 2022
Country/TerritoryItaly
CityCertosa di Pontignano
Period18/09/202222/09/2022

Keywords

  • Concrete autoencoders
  • Multi-omic integration
  • Supervised autoencoders
  • Survival prediction
  • Survival stratification

Fingerprint

Dive into the research topics of 'Multi-omic Data Integration and Feature Selection for Survival-Based Patient Stratification via Supervised Concrete Autoencoders'. Together they form a unique fingerprint.

Cite this