descriptionPublicationkeyboard_double_arrow_right Article 05 Jul 2022 Finland English Publisher:Springer Science and Business Media LLCJournal:Algorithmica, volume 84, pages 3,008-3,033 (issn: 0178-4617, eissn: 1432-0541,

Copyright policy )Funded by:AKA | Foundations of Safe and C..., EC | SAFEBIO, UKRI | Digitilisation of medicin... +1 projects

Authors: Nicola Rizzo; Alexandru I. Tomescu; Alberto Policriti;

doi: 10.1007/s00453-022-00989-x

handle: 10138/349646 , 11390/1229284

Solving String Problems on Graphs Using the Labeled Direct Product

- Summary
- Subjects
- Related research
  (2)
- Metrics

Abstract

AbstractSuffix trees are an important data structure at the core of optimal solutions to many fundamental string problems, such as exact pattern matching, longest common substring, matching statistics, and longest repeated substring. Recent lines of research focused on extending some of these problems to vertex-labeled graphs, either by using efficient ad-hoc approaches which do not generalize to all input graphs, or by indexing difficult graphs and having worst-case exponential complexities. In the absence of an ubiquitous and polynomial tool like the suffix tree for labeled graphs, we introduce the labeled direct product of two graphs as a general tool for obtaining optimal algorithms in the worst case: we obtain conceptually simpler algorithms for the quadratic problems of string matching () and longest common substring () in labeled graphs. Our algorithms run in time linear in the size of the labeled product graph, which may be smaller than quadratic for some inputs, and their run-time is predictable, because the size of the labeled direct product graph can be precomputed efficiently. We also solve on graphs containing cycles, which was left as an open problem by Shimohira et al. in 2011. To show the power of the labeled product graph, we also apply it to solve the matching statistics () and the longest repeated string () problems in labeled graphs. Moreover, we show that our (worst-case quadratic) algorithms are also optimal, conditioned on the Orthogonal Vectors Hypothesis. Finally, we complete the complexity picture around by studying it on undirected graphs.

Country

Finland

Related Organizations

University of Helsinki
Finland
University of Udine
Italy

Keywords

FINITE AUTOMATA, COMPLEXITY, Computer and information sciences, Fine-grained complexity, Graph algorithm, SUFFIX TREE, Longest common substring, String algorithm, AMBIGUITY, Longest repeated substring, Motif discovery

2 Research products, page 1 of 1

The Labeled Direct Product Optimally Solves String Problems on Graphs
2021HasVersion
The Labeled Direct Product Optimally Solves String Problems on Graphs
2021IsAmongTopNSimilarDocuments

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	3
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average