Podrobný výpis o publikaci

V originále

Processing large volumes of various data needs index structures that can efficiently organize them on secondary memory. Methods based on so-called pivot permutations have become popular in addressing these requirements because of their tremendous querying performance. They localize data objects by ordering preselected anchor objects by their distances to the data objects, and so no coordinate system is exploited to partition the data. This represents a generic solution for unstructured and high-dimensional data. In principle, pivot permutations implement recursive Voronoi tessellation. Also, due to the fixed preselected anchors, such partitioning cannot adapt to the data distribution and leads to very unbalanced cells. In this paper, we address this issue and propose a novel schema called BM-index. It exploits weighted Voronoi partitioning to create pivot permutations that adapt to data distribution. Secondary memory is then accessed efficiently with respect to the existing disk-oriented structures, such as M-index. We present an algorithm to balance the data partitions, and we show its correctness. In experiments on a real-life image collection CoPhIR, we show superior performance in I/O costs when evaluating k-nearest neighbors queries.

BM-index: Balanced Metric Space Index based on Weighted Voronoi Partitioning

Základní údaje

Originální název

Autoři

Vydání

Nakladatel

Další údaje

Jazyk

Typ výsledku

Obor

Stát vydavatele

Utajení

Forma vydání

Odkazy

Impakt faktor

Kód RIV

Organizační jednotka

ISBN

ISSN

DOI

UT WoS

Klíčová slova anglicky

Štítky

Příznaky

Anotace

V originále

Návaznosti