RetroSnake: a Modular Pipeline for Detection of Human Endogenous Retroviruses in Genome Sequencing data
*Corresponding author for this work
Research output: Contribution to journal › Article › peer-review
Human Endogenous Retroviruses (HERVs) integrated into the genome of vertebrates as a result of ancient exogenous infections and currently comprise ∼8% of our genome. The members of the most recently acquired HERV family, HERV-Ks, still retain potential to produce viral molecules and have been linked to a wide range of diseases including cancer and neurodegeneration. Although a range of tools for HERV discovery in NGS data exist, most of them lack wet-lab validation and they do not cover all steps of the analysis. Here we describe RetroSnake, an end-to-end, modular, computationally efficient and customisable pipeline for the discovery of HERVs in short-read NGS data. RetroSnake is based on an extensively wet-lab validated protocol, it covers all steps of the analysis from raw data to the generation of annotated results presented as an interactive html file, and it is easy to use by life scientists without substantial computational training.