MÍČ, Vladimír and Pavel ZEZULA. Data-dependent Metric Filtering. Information systems. 2022, vol. 108, 12.5.2022, p. "101980", 21 pp. ISSN 0306-4379. Available from: https://dx.doi.org/10.1016/j.is.2021.101980.
Other formats:   BibTeX LaTeX RIS
Basic information
Original name Data-dependent Metric Filtering
Authors MÍČ, Vladimír (203 Czech Republic, belonging to the institution) and Pavel ZEZULA (203 Czech Republic, guarantor, belonging to the institution).
Edition Information systems, 2022, 0306-4379.
Other information
Original language English
Type of outcome Article in a journal
Field of Study 10200 1.2 Computer and information sciences
Country of publisher Netherlands
Confidentiality degree is not subject to a state or trade secret
WWW URL
Impact factor Impact factor: 3.700
RIV identification code RIV/00216224:14330/22:00125539
Organization unit Faculty of Informatics
Doi http://dx.doi.org/10.1016/j.is.2021.101980
Keywords in English Metric Space Searching;Similarity Search;Metric Filtering;Data Dependent Filtering
Tags best, DISA
Tags International impact, Reviewed
Changed by Changed by: RNDr. Vladimír Míč, Ph.D., učo 359890. Changed: 12/5/2022 12:50.
Abstract
Filtering is a fundamental strategy of metric similarity indexes to minimise the number of computed distances. Given a triplet of objects for which distances of two pairs are known, the lower and upper bounds on the third distance can be determined using the triangle inequality property. Obviously, tightness of the bounds is crucial for efficiency reasons — the more precise the estimation, the more distance computations can be avoided, and the more efficient the search is. We show that it is not necessary to consider arbitrary angles in triangles formed by pairwise distances of three objects, as specific range of possible angles is data dependent. When considering realistic ranges of angles, the bounds on distances can be much more tight and filtering much more effective. We formalise the problem of the data dependent estimation of bounds on distances and deeply analyse limited angles in triangles of distances. We justify the potential of the data dependent metric filtering both, analytically and experimentally, executing many distance estimations on several real-life datasets.
Links
EF16_019/0000822, research and development projectName: Centrum excelence pro kyberkriminalitu, kyberbezpečnost a ochranu kritických informačních infrastruktur
PrintDisplayed: 27/4/2024 10:51