TY - CHAP
T1 - Efficient computation of clustered-clumps in degenerate strings
AU - Iliopoulos, Costas
AU - Kundu, Ritu
AU - Mohamed, Manal
PY - 2016/9/2
Y1 - 2016/9/2
N2 - Given a finite set of patterns, a clustered-clump is a maximal overlapping set of occurrences of such patterns. Several solutions have been presented for identifying clustered-clumps based on statistical, probabilistic, and most recently, formal language theory techniques. Here, motivated by applications in molecular biology and computer vision, we present efficient algorithms, using String Algorithm techniques, to identify clustered-clumps in a given text. The proposed algorithms compute in O(n + m) time the occurrences of all clusteredclumps for a given set of degenerate patterns P and/or degenerate text T of total lengths m and n, respectively; such that the total number of non-solid symbols in P and T is bounded by a fixed positive integer d.
AB - Given a finite set of patterns, a clustered-clump is a maximal overlapping set of occurrences of such patterns. Several solutions have been presented for identifying clustered-clumps based on statistical, probabilistic, and most recently, formal language theory techniques. Here, motivated by applications in molecular biology and computer vision, we present efficient algorithms, using String Algorithm techniques, to identify clustered-clumps in a given text. The proposed algorithms compute in O(n + m) time the occurrences of all clusteredclumps for a given set of degenerate patterns P and/or degenerate text T of total lengths m and n, respectively; such that the total number of non-solid symbols in P and T is bounded by a fixed positive integer d.
KW - Clustered-clump
KW - Conservative degenerate string
KW - Overlapping occurrences
KW - Pattern
UR - http://www.scopus.com/inward/record.url?scp=84988521326&partnerID=8YFLogxK
U2 - 10.1007/978-3-319-44944-9_45
DO - 10.1007/978-3-319-44944-9_45
M3 - Conference paper
AN - SCOPUS:84988521326
SN - 9783319449432
VL - 475
T3 - IFIP Advances in Information and Communication Technology
SP - 510
EP - 519
BT - IFIP Advances in Information and Communication Technology
PB - Springer New York LLC
T2 - 12th IFIP WG 12.5 International Conference and Workshops on Artificial Intelligence Applications and Innovations, AIAI 2016
Y2 - 16 September 2016 through 18 September 2016
ER -