D 2006

M-Chord: A Scalable Distributed Similarity Search Structure

NOVÁK, David and Pavel ZEZULA

Basic information

Original name

M-Chord: A Scalable Distributed Similarity Search Structure

Name in Czech

M-Chord: Škálovatelná distribuovaná struktura pro podobnostní vyhledávání

Authors

NOVÁK, David (203 Czech Republic, guarantor, belonging to the institution) and Pavel ZEZULA (203 Czech Republic)

Edition

New York, NY, USA, InfoScale '06: Proceedings of the 1st international conference on Scalable information systems, p. 1-10, 10 pp. 2006

Publisher

ACM Press

Other information

Language

English

Type of outcome

Stať ve sborníku

Field of Study

10201 Computer sciences, information science, bioinformatics

Country of publisher

United States of America

Confidentiality degree

není předmětem státního či obchodního tajemství

Publication form

printed version "print"

References:

RIV identification code

RIV/00216224:14330/06:00015364

Organization unit

Faculty of Informatics

ISBN

1-59593-428-6

Keywords in English

distributed data structures; peer-to-peer; similarity search; indexing
Změněno: 17/9/2013 08:56, RNDr. David Novák, Ph.D.

Abstract

V originále

The need for a retrieval based not on the attribute values but on the very data content has recently led to rise of the metric-based similarity search. The computational complexity of such a retrieval and large volumes of processed data call for distributed processing which allows to achieve scalability. In this paper, we propose M-Chord, a distributed data structure for metric-based similarity search. The structure takes advantage of the idea of a vector index method iDistance in order to transform the issue of similarity searching into the problem of interval search in one dimension. The proposed peer-to-peer organization, based on the Chord protocol, distributes the storage space and parallelizes the execution of similarity queries. Promising features of the structure are validated by experiments on the prototype implementation and two real-life datasets.

In Czech

Clanek popisuje novy distribuovany system pro podobnostni vyhledavani v metrickych prostorech. System je zalozeny na peer-to-peer paradigmatu a vyuziva transformacni metodu iDistance a navigacni protokol Chord.

Links

GD102/05/H050, research and development project
Name: Integrovaný přístup k výchově studentů DSP v oblasti paralelních a distribuovaných systémů
Investor: Czech Science Foundation, Integrated approach to education of PhD students in the area of parallel and distributed systems
1ET100300419, research and development project
Name: Inteligentní modely, algoritmy, metody a nástroje pro vytváření sémantického webu
Investor: Academy of Sciences of the Czech Republic, Intelligent Models, Algorithms, Methods and Tools for the Semantic Web (realization)