
XPath is the standard language for addressing parts of an XML document. We present a sound and complete decision procedure for containment of XPath queries. The considered XPath fragment covers most of the language features used in practice. Specifically, we show how XPath queries can be translated into equivalent formulas in monadic second-order logic. Using this translation, we construct an optimized logical formulation of the containment problem, which is decided using tree automata. When the containment relation does not hold between two XPath expressions, a counter-example XML tree is generated. We provide a complexity analysis together with practical experiments that illustrate the efficiency of the decision procedure for realistic scenarios.
[INFO.INFO-TT]Computer Science [cs]/Document and Text Processing, [INFO.INFO-PL]Computer Science [cs]/Programming Languages [cs.PL], [INFO.INFO-WB] Computer Science [cs]/Web, [INFO.INFO-WB]Computer Science [cs]/Web, [INFO.INFO-TT] Computer Science [cs]/Document and Text Processing, 004, [INFO.INFO-PL] Computer Science [cs]/Programming Languages [cs.PL]
[INFO.INFO-TT]Computer Science [cs]/Document and Text Processing, [INFO.INFO-PL]Computer Science [cs]/Programming Languages [cs.PL], [INFO.INFO-WB] Computer Science [cs]/Web, [INFO.INFO-WB]Computer Science [cs]/Web, [INFO.INFO-TT] Computer Science [cs]/Document and Text Processing, 004, [INFO.INFO-PL] Computer Science [cs]/Programming Languages [cs.PL]
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 12 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
