NÁLEPA, Filip, Michal BATKO and Pavel ZEZULA. Cache and Priority Queue Based Approximation Technique for a Stream of Similarity Search Queries. In Christian Beecks, Felix Borutta, Peer Kröger, Thomas Seidl. Similarity Search and Applications : 10th International Conference, SISAP 2017, Munich, Germany, October 4-6, 2017, Proceedings. Cham: Springer, Cham, 2017, p. 17-33. ISBN 978-3-319-68473-4. Available from: https://dx.doi.org/10.1007/978-3-319-68474-1_2.
Other formats:   BibTeX LaTeX RIS
Basic information
Original name Cache and Priority Queue Based Approximation Technique for a Stream of Similarity Search Queries
Authors NÁLEPA, Filip (203 Czech Republic, guarantor, belonging to the institution), Michal BATKO (203 Czech Republic, belonging to the institution) and Pavel ZEZULA (203 Czech Republic, belonging to the institution).
Edition Cham, Similarity Search and Applications : 10th International Conference, SISAP 2017, Munich, Germany, October 4-6, 2017, Proceedings, p. 17-33, 17 pp. 2017.
Publisher Springer, Cham
Other information
Original language English
Type of outcome Proceedings paper
Field of Study 10201 Computer sciences, information science, bioinformatics
Country of publisher Switzerland
Confidentiality degree is not subject to a state or trade secret
Publication form printed version "print"
Impact factor Impact factor: 0.402 in 2005
RIV identification code RIV/00216224:14330/17:00095026
Organization unit Faculty of Informatics
ISBN 978-3-319-68473-4
ISSN 0302-9743
Doi http://dx.doi.org/10.1007/978-3-319-68474-1_2
UT WoS 000616693000002
Keywords in English approximate similarity search; stream of kNN queries
Tags DISA, firank_B
Tags International impact, Reviewed
Changed by Changed by: RNDr. Filip Nálepa, Ph.D., učo 359760. Changed: 23/5/2018 11:10.
Abstract
Content-based similarity search techniques have been employed in a variety of today applications. In our work, we aim at the scenario when the similarity search is applied in the context of stream processing. In particular, there is a stream of query objects which need to be evaluated. Our goal is to be able to cope with the rate of incoming query objects (i.e., to reach sufficient throughput) and, at the same time, to preserve the quality of the obtained results at high levels. We propose an approximation technique for the similarity search which combines the probability of an indexed object to be a part of a query result and the time needed to examine the object. We are able to achieve better trade-off between the efficiency (processing time) and the quality (precision) of the similarity search compared to traditional priority queue based approximation techniques.
Links
GA16-18889S, research and development projectName: Analytika pro velká nestrukturovaná data (Acronym: Big Data Analytics for Unstructured Data)
Investor: Czech Science Foundation
PrintDisplayed: 27/4/2024 15:17