
Threshold queries are an important class of queries that only require computing or counting answers up to a specified threshold value. To the best of our knowledge, threshold queries have been largely disregarded in the research literature, which is surprising considering how common they are in practice. In this paper, we present a deep theoretical analysis of threshold query evaluation and show that thresholds can be used to significantly improve the asymptotic bounds of state-of-the-art query evaluation algorithms. We also empirically show that threshold queries are significant in practice. In surprising contrast to conventional wisdom, we found important scenarios in real-world data sets in which users are interested in computing the results of queries up to a certain threshold, independent of a ranking function that orders the query results.
FOS: Computer and information sciences, Theory and algorithms for application domains, Database query processing, Database theory, Databases (cs.DB), Database management system engines, Computer Science - Databases, Query languages, Information systems, [INFO.INFO-DB] Computer Science [cs]/Databases [cs.DB], Data management systems, Theory of computation
FOS: Computer and information sciences, Theory and algorithms for application domains, Database query processing, Database theory, Databases (cs.DB), Database management system engines, Computer Science - Databases, Query languages, Information systems, [INFO.INFO-DB] Computer Science [cs]/Databases [cs.DB], Data management systems, Theory of computation
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 8 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
