ANTOL, Matej and Vlastislav DOHNAL. BM-index: Balanced Metric Space Index based on Weighted Voronoi Partitioning. In Lecture Notes in Computer Science. Advances in Databases and Information Systems, 23th East European Conference, ADBIS 2019. LNCS 11695. Cham: Springer International Publishing, 2019, p. 337-353. ISBN 978-3-030-28729-0. Available from: https://dx.doi.org/10.1007/978-3-030-28730-6_21.
Other formats:   BibTeX LaTeX RIS
Basic information
Original name BM-index: Balanced Metric Space Index based on Weighted Voronoi Partitioning
Authors ANTOL, Matej (703 Slovakia, belonging to the institution) and Vlastislav DOHNAL (203 Czech Republic, guarantor, belonging to the institution).
Edition LNCS 11695. Cham, Advances in Databases and Information Systems, 23th East European Conference, ADBIS 2019, p. 337-353, 17 pp. 2019.
Publisher Springer International Publishing
Other information
Original language English
Type of outcome Proceedings paper
Field of Study 10201 Computer sciences, information science, bioinformatics
Country of publisher Netherlands
Confidentiality degree is not subject to a state or trade secret
Publication form printed version "print"
WWW URL
Impact factor Impact factor: 0.402 in 2005
RIV identification code RIV/00216224:14330/19:00109747
Organization unit Faculty of Informatics
ISBN 978-3-030-28729-0
ISSN 0302-9743
Doi http://dx.doi.org/10.1007/978-3-030-28730-6_21
UT WoS 000558104700024
Keywords in English Indexing structure;k-nearest neighbor query;Approximate search;Metric space;Voronoi partitioning
Tags DISA, firank_B
Tags International impact, Reviewed
Changed by Changed by: doc. RNDr. Vlastislav Dohnal, Ph.D., učo 2952. Changed: 30/9/2020 12:42.
Abstract
Processing large volumes of various data needs index structures that can efficiently organize them on secondary memory. Methods based on so-called pivot permutations have become popular in addressing these requirements because of their tremendous querying performance. They localize data objects by ordering preselected anchor objects by their distances to the data objects, and so no coordinate system is exploited to partition the data. This represents a generic solution for unstructured and high-dimensional data. In principle, pivot permutations implement recursive Voronoi tessellation. Also, due to the fixed preselected anchors, such partitioning cannot adapt to the data distribution and leads to very unbalanced cells. In this paper, we address this issue and propose a novel schema called BM-index. It exploits weighted Voronoi partitioning to create pivot permutations that adapt to data distribution. Secondary memory is then accessed efficiently with respect to the existing disk-oriented structures, such as M-index. We present an algorithm to balance the data partitions, and we show its correctness. In experiments on a real-life image collection CoPhIR, we show superior performance in I/O costs when evaluating k-nearest neighbors queries.
Links
EF16_019/0000822, research and development projectName: Centrum excelence pro kyberkriminalitu, kyberbezpečnost a ochranu kritických informačních infrastruktur
MUNI/A/1411/2019, interní kód MUName: Aplikovaný výzkum: softwarové architektury kritických infrastruktur, bezpečnost počítačových systémů, zpracování přirozeného jazyka a jazykové inženýrství, vizualizaci velkých dat a rozšířená realita.
Investor: Masaryk University, Category A
PrintDisplayed: 17/7/2024 05:36