GapsMis: flexible sequence alignment with a bounded number of gaps

Research output: Chapter in Book/Report/Conference proceedingConference paper

5 Citations (Scopus)

Abstract

Motivation: Recent developments in next-generation sequencing technologies have renewed interest in pairwise sequence alignment techniques, particularly so for the application of re-sequencing---the assembly of a genome directed by a reference sequence. After the fast alignment between a factor of the reference sequence and the high-quality fragment of a short read, an important problem is to find the best possible alignment between a succeeding factor of the reference sequence and the remaining low-quality part of the read; allowing a number of mismatches and the insertion of gaps in the alignment.

Results: We present GapsMis, a tool for pairwise global and semi-global sequence alignment with a variable, but bounded, number of gaps. It is based on a new algorithm, which computes a different version of the traditional dynamic programming matrix. Millions of pairwise sequence alignments, performed under realistic conditions based on the properties of real full-length genomes, show that GapsMis can increase the accuracy of extending short-read alignments end-to-end compared to more traditional approaches.
Original languageEnglish
Title of host publicationProceedings of the International Conference on Bioinformatics, Computational Biology and Biomedical Informatics
Place of PublicationNew York
PublisherACM
Pages402-411
Number of pages10
ISBN (Print)9781450324342
DOIs
Publication statusPublished - 2013

Fingerprint

Dive into the research topics of 'GapsMis: flexible sequence alignment with a bounded number of gaps'. Together they form a unique fingerprint.

Cite this