V originále
In order to speedup retrieval in large collections of data, index structures partition the data into subsets so that query requests can be evaluated without examining the entire collection. As the complexity of modern data types grows, metric spaces have become a popular paradigm for similarity retrieval. We propose a new index structure, called D-Index, that combines a novel clustering technique and the pivot-based distance searching strategy to speed up execution of similarity range and nearest neighbor queries for large files with objects stored in disk memories. We have qualitatively analyzed D-Index and verified its properties on actual implementation. We have also compared D-Index with other index structures and demonstrated its superiority on several real-life data sets. Contrary to tree organizations, the D-Index structure is suitable for dynamic environments with a high rate of delete/insert operations.
Česky
Navrhujeme novou indexovací strukturu D-Index, která kombinuje novou shlukovací techniku a pivotovací filtrování s cílem urychlit provádění podobnostních dotazů. Vlastnosti D-Indexu jsou experimentálně ověřeny a struktura je porovnávána s jinými řešeními pro metrické prostory.