Similarity Grid for Searching in Metric Spaces

descriptionPublicationkeyboard_double_arrow_right Part of book or chapter of book , Article , Conference object , Other literature type 01 Jan 2005 Italy English Publisher:Springer Berlin Heidelberg

Authors: Batko M; Gennaro C; Zezula P;

doi: 10.1007/11549819_3

handle: 20.500.14243/39993

Similarity Grid for Searching in Metric Spaces

- Summary
- Subjects
- Metrics

Abstract

Similarity search in metric spaces represents an important paradigm for content-based retrieval of many applications. Existing centralized search structures can speed-up retrieval, but they do not scale up to large volume of data because the response time is linearly increasing with the size of the searched file. The proposed GHT* index is a scalable and distributed structure. By exploiting parallelism in a dynamic network of computers, the GHT* achieves practically constant search time for similarity range queries in data-sets of arbitrary size. The structure also scales well with respect to the growing volume of retrieved data. Moreover, a small amount of replicated routing information on each server increases logarithmically. At the same time, the potential for interquery parallelism is increasing with the growing data-sets because the relative number of servers utilized by individual queries is decreasing. All these properties are verified by experiments on a prototype system using real-life data-sets.

Country

Italy

Related Organizations

National Research Council
Italy
National Research Council
Romania
Masaryk University
Czech Republic
Institute of Information Science and Technologies "A. Faedo"
Italy

Keywords

H.3.3 Information Search and Retrieval, H.3.4 Systems and Software

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	16
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%