Linear-Time Computation of Prefix Table for Weighted Strings

Research output: Chapter in Book/Report/Conference proceedingConference paperpeer-review

5 Citations (Scopus)

Abstract

The prefix table of a string is one of the most fundamental data structures of algorithms on strings: it determines the longest factor at each position of the string that matches a prefix of the string. It can be computed in time linear with respect to the size of the string, and hence it can be used efficiently for locating patterns or for regularity searching in strings. A weighted string is a string in which a set of letters may occur at each position with respective occurrence probabilities. Weighted strings, also known as position weight matrices, naturally arise in many biological contexts; for example, they provide a method to realise approximation among occurrences of the same DNA segment. In this article, given a weighted string x of length n and a constant cumulative weight threshold 1 / z, defined as the minimal probability of occurrence of factors in x, we present an O(n)-time algorithm for computing the prefix table of x.
Original languageEnglish
Title of host publicationCombinatorics on Words
Subtitle of host publication10th International Conference, WORDS 2015, Kiel, Germany, September 14-17, 2015, Proceedings
EditorsFlorin Manea, Dirk Nowotka
PublisherSpringer International Publishing
Pages73-84
Number of pages12
ISBN (Electronic) 978-3-319-23660-5
ISBN (Print)978-3-319-23659-9
DOIs
Publication statusPublished - 27 Aug 2015

Publication series

NameLecture Notes in Computer Science
PublisherSpringer International Publishing
Volume9304
ISSN (Print)0302-9743

Fingerprint

Dive into the research topics of 'Linear-Time Computation of Prefix Table for Weighted Strings'. Together they form a unique fingerprint.

Cite this