# King's College London

## Off-line and on-line algorithms for closed string factorization

Research output: Contribution to journalArticlepeer-review

Original language English Theoretical Computer Science 30 Oct 2018 https://doi.org/10.1016/j.tcs.2018.10.033 24 Oct 2018 30 Oct 2018

### Documents

• Off_line_and_on_line_algorithms_ALZAMEL_Firstonline30October2018_GREEN_AAM_CC_BY_NC_ND_.pdf, 304 KB, application/pdf

Version:Accepted author manuscript

Licence:CC BY-NC-ND

## Abstract

A string X=X[1..n], n>1, is said to be closed if it has a nonempty proper prefix that is also a suffix, but that otherwise occurs nowhere else in X; for n=1, every X is closed. Closed strings were introduced by Fici in [1] as objects of combinatorial interest. Recently Badkobeh et al.[2] described a variety of algorithms to factor a given string into closed factors. In particular, they studied the Longest Closed Factorization (LCF) problem, which greedily computes the decomposition X=X1X2⋯Xk, where X1 is the longest closed prefix of X, X2 the longest closed prefix of X with prefix X1 removed, and so on. In this paper we present an O(log⁡n) amortized per character algorithm to compute LCF on-line, where n is the length of the string. We also introduce the Minimum Closed Factorization (MCF) problem, which identifies the minimum number of closed factors that cover X. We first describe an off-line O(nlog2⁡n)-time algorithm to compute MCF(X), then we present an on-line algorithm for the same problem. In fact, we show that MCF(X) can be computed in O(Llog⁡n) time from MCF(X′), where X′=X[1..n−1], and L is the largest integer such that the suffix X[n−L+1..n] is a substring of X′.