Suspect screening of large numbers of emerging contaminants in environmental waters using artificial neural networks for chromatographic retention time prediction and high resolution mass spectrometry data analysis

Research output: Contribution to journalArticlepeer-review

107 Citations (Scopus)
268 Downloads (Pure)

Abstract

The recent development of broad-scope high resolution mass spectrometry (HRMS) screening methods has resulted in a much improved capability for new compound identification in environmental samples. However, positive identifications at the ng/L concentration level rely on analytical reference standards for chromatographic retention time (t<inf>R</inf>) and mass spectral comparisons. Chromatographic t<inf>R</inf> prediction can play a role in increasing confidence in suspect screening efforts for new compounds in the environment, especially when standards are not available, but reliable methods are lacking. The current work focuses on the development of artificial neural networks (ANNs) for t<inf>R</inf> prediction in gradient reversed-phase liquid chromatography and applied along with HRMS data to suspect screening of wastewater and environmental surface water samples. Based on a compound t<inf>R</inf> dataset of >500 compounds, an optimized 4-layer back-propagation multi-layer perceptron model enabled predictions for 85% of all compounds to within 2min of their measured t<inf>R</inf> for training (n=344) and verification (n=100) datasets. To evaluate the ANN ability for generalization to new data, the model was further tested using 100 randomly selected compounds and revealed 95% prediction accuracy within the 2-minute elution interval. Given the increasing concern on the presence of drug metabolites and other transformation products (TPs) in the aquatic environment, the model was applied along with HRMS data for preliminary identification of pharmaceutically-related compounds in real samples. Examples of compounds where reference standards were subsequently acquired and later confirmed are also presented. To our knowledge, this work presents for the first time, the successful application of an accurate retention time predictor and HRMS data-mining using the largest number of compounds to preliminarily identify new or emerging contaminants in wastewater and surface waters.
Original languageEnglish
Article number18268
Pages (from-to)934-941
Number of pages8
JournalScience of the Total Environment
Volume538
Early online date28 Sept 2015
DOIs
Publication statusPublished - 15 Dec 2015

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 6 - Clean Water and Sanitation
    SDG 6 Clean Water and Sanitation

Keywords

  • Artificial neural networks
  • Retention time prediction
  • Screening of emerging contaminants
  • Time-of-flight high resolution mass spectrometry

Fingerprint

Dive into the research topics of 'Suspect screening of large numbers of emerging contaminants in environmental waters using artificial neural networks for chromatographic retention time prediction and high resolution mass spectrometry data analysis'. Together they form a unique fingerprint.

Cite this