Multi-query Optimization for Distributed Similarity Query Processing

descriptionPublicationkeyboard_double_arrow_right Article , Conference object 01 Jun 2008Publisher:IEEEJournal:2008 The 28th International Conference on Distributed Computing Systems

Authors: Yi Zhuang 0001; Qing Li 0001; Lei Chen 0002;

doi: 10.1109/icdcs.2008.58

Multi-query Optimization for Distributed Similarity Query Processing

- Summary
- Metrics

Abstract

This paper considers a multi-query optimization issue for distributed similarity query processing, which attempts to exploit the dependencies in the derivation of a query evaluation plan. To the best of our knowledge, this is the first work investigating a multi- query optimization technique for distributed similarity query processing (MDSQ). Four steps are incorporated in our MDSQ algorithm. First when a number of query requests(i.e., m query vectors and m radiuses) are simultaneously submitted by users, then a cost-based dynamic query scheduling(DQS) procedure is invoked to quickly and effectively identify the correlation among the query spheres (requests). After that, an index-based vector set reduction is performed at data node level in parallel. Finally, a refinement process of the candidate vectors is conducted to get the answer set. The proposed method includes a cost-based dynamic query scheduling, a Start-Distance(SD)-based load balancing scheme, and an index-based vector set reduction algorithm. The experimental results validate the efficiency and effectiveness of the algorithm in minimizing the response time and increasing the parallelism of I/O and CPU.

Related Organizations

University of Hong Kong
China (People's Republic of)
The Hong Kong University of Science and Technology (Guangzhou)
China (People's Republic of)
Hong Kong University of Science and Technology
Hong Kong
Zhejiang Ocean University
China (People's Republic of)
City University of Hong Kong
China (People's Republic of)

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	6
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average