Mapping uniquely occurring short sequences derived from high throughput technologies to a reference genome

P Antoniou, J Daykin, Costas Iliopoulos, D Kourie, L Mouchard, Solon Pissis

Research output: Chapter in Book/Report/Conference proceedingConference paper

16 Citations (Scopus)

Abstract

Novel high throughput sequencing technology methods have redefined the way genome sequencing is performed. They are able to produce tens of millions of short sequences (reads) in a single experiment and with a much lower cost than previous sequencing methods. Due to this massive amount of data generated by the above systems, efficient algorithms for mapping short sequences to a reference genome are in great demand. In this paper, we present a practical algorithm for addressing the problem of efficiently mapping uniquely occuring short reads to a reference genome. This requires the classification of these short reads into unique and duplicate matches. In particular, we define and solve the Massive Exact Unique Pattern Matching problem in genomes.
Original languageEnglish
Title of host publicationITAB 2009
Subtitle of host publication9th International Conference on Information Technology and Applications in Biomedicine, 2009
Place of PublicationPiscataway, N.J.
PublisherIEEE
PagesN/A
Number of pages4
VolumeN/A
EditionN/A
ISBN (Print)9781424453795
DOIs
Publication statusPublished - 2009

Fingerprint

Dive into the research topics of 'Mapping uniquely occurring short sequences derived from high throughput technologies to a reference genome'. Together they form a unique fingerprint.

Cite this