In several areas, for example in bioinformatics and in AI planning, the Shortest Common Superstring problem (SCS) and variants thereof have been successfully applied for string comparison. In this paper we consider two variants of SCS recently introduced, namely Restricted Common Superstring (RCS) and Swapped Common Superstring (SRCS). In RCS we are given a set S of strings and a multiset M of symbols, and we look for an ordering Mo of M such that the number of input strings which are substrings of Mo is maximized. In SRCS we are given a set S of strings and a text T, and we look for a swap ordering To of T (an ordering of T obtained by swapping only some pairs of adjacent symbols) such that the number of input strings which are substrings of To is maximized. In this paper we propose a multivariate algorithmic analysis of the complexity of the two problems, aiming at determining how different parameters influence the complexity of the two problems. We consider as interesting parameters the size of the solutions (that is the number of input strings contained in the computed superstring), the maximum length of the given input strings, the size of the alphabet over which the input strings range. First, we give two fixed-parameter algorithms, where the parameter is the size of the solution, for SRCS and lRCS (the RCS problem restricted to strings of length bounded by a parameter ℓ). Furthermore, we complement these results by showing that SRCS and lRCS do not admit a polynomial kernel unless NP⊆coNP/Poly. Then, we show that SRCS is APX-hard even when the input strings have length bounded by a constant (equal to 10) or are over a binary alphabet.
Restricted and swap common superstring: a multivariate algorithmic perspective
DONDI, Riccardo;
2015-01-01
Abstract
In several areas, for example in bioinformatics and in AI planning, the Shortest Common Superstring problem (SCS) and variants thereof have been successfully applied for string comparison. In this paper we consider two variants of SCS recently introduced, namely Restricted Common Superstring (RCS) and Swapped Common Superstring (SRCS). In RCS we are given a set S of strings and a multiset M of symbols, and we look for an ordering Mo of M such that the number of input strings which are substrings of Mo is maximized. In SRCS we are given a set S of strings and a text T, and we look for a swap ordering To of T (an ordering of T obtained by swapping only some pairs of adjacent symbols) such that the number of input strings which are substrings of To is maximized. In this paper we propose a multivariate algorithmic analysis of the complexity of the two problems, aiming at determining how different parameters influence the complexity of the two problems. We consider as interesting parameters the size of the solutions (that is the number of input strings contained in the computed superstring), the maximum length of the given input strings, the size of the alphabet over which the input strings range. First, we give two fixed-parameter algorithms, where the parameter is the size of the solution, for SRCS and lRCS (the RCS problem restricted to strings of length bounded by a parameter ℓ). Furthermore, we complement these results by showing that SRCS and lRCS do not admit a polynomial kernel unless NP⊆coNP/Poly. Then, we show that SRCS is APX-hard even when the input strings have length bounded by a constant (equal to 10) or are over a binary alphabet.File | Dimensione del file | Formato | |
---|---|---|---|
algorithmica2015.pdf
Solo gestori di archivio
Versione:
publisher's version - versione editoriale
Licenza:
Licenza default Aisberg
Dimensione del file
620.74 kB
Formato
Adobe PDF
|
620.74 kB | Adobe PDF | Visualizza/Apri |
Pubblicazioni consigliate
Aisberg ©2008 Servizi bibliotecari, Università degli studi di Bergamo | Terms of use/Condizioni di utilizzo