Cache and Priority Queue Based Approximation Technique for a
Stream of Similarity Search Queries

NÁLEPA, Filip, Michal BATKO and Pavel ZEZULA. Cache and Priority Queue Based Approximation Technique for a Stream of Similarity Search Queries. In Christian Beecks, Felix Borutta, Peer Kröger, Thomas Seidl. Similarity Search and Applications : 10th International Conference, SISAP 2017, Munich, Germany, October 4-6, 2017, Proceedings. Cham: Springer, Cham, 2017, p. 17-33. ISBN 978-3-319-68473-4. Available from: https://dx.doi.org/10.1007/978-3-319-68474-1_2.

Other formats: BibTeX LaTeX RIS

Basic information
Original name	Cache and Priority Queue Based Approximation Technique for a Stream of Similarity Search Queries
Authors	NÁLEPA, Filip (203 Czech Republic, guarantor, belonging to the institution), Michal BATKO (203 Czech Republic, belonging to the institution) and Pavel ZEZULA (203 Czech Republic, belonging to the institution).
Edition	Cham, Similarity Search and Applications : 10th International Conference, SISAP 2017, Munich, Germany, October 4-6, 2017, Proceedings, p. 17-33, 17 pp. 2017.
Publisher	Springer, Cham

Other information
Original language	English
Type of outcome	Proceedings paper
Field of Study	10201 Computer sciences, information science, bioinformatics
Country of publisher	Switzerland
Confidentiality degree	is not subject to a state or trade secret
Publication form	printed version "print"
Impact factor	Impact factor: 0.402 in 2005
RIV identification code	RIV/00216224:14330/17:00095026
Organization unit	Faculty of Informatics
ISBN	978-3-319-68473-4
ISSN	0302-9743
Doi	http://dx.doi.org/10.1007/978-3-319-68474-1_2
UT WoS	000616693000002
Keywords in English	approximate similarity search; stream of kNN queries
Tags	DISA, firank_B
Tags	International impact, Reviewed
Changed by	Changed by: RNDr. Filip Nálepa, Ph.D., učo 359760. Changed: 23/5/2018 11:10.

Abstract

Content-based similarity search techniques have been employed in a variety of today applications. In our work, we aim at the scenario when the similarity search is applied in the context of stream processing. In particular, there is a stream of query objects which need to be evaluated. Our goal is to be able to cope with the rate of incoming query objects (i.e., to reach sufficient throughput) and, at the same time, to preserve the quality of the obtained results at high levels. We propose an approximation technique for the similarity search which combines the probability of an indexed object to be a part of a query result and the time needed to examine the object. We are able to achieve better trade-off between the efficiency (processing time) and the quality (precision) of the similarity search compared to traditional priority queue based approximation techniques.

Links
GA16-18889S, research and development project	Name: Analytika pro velká nestrukturovaná data (Acronym: Big Data Analytics for Unstructured Data)
GA16-18889S, research and development project	Investor: Czech Science Foundation

PrintDisplayed: 27/4/2024 15:17

Cache and Priority Queue Based Approximation Technique for a Stream of Similarity Search Queries

Other applications