NÁLEPA, Filip, Michal BATKO and Pavel ZEZULA. Enhancing Similarity Search Throughput by Dynamic Query Reordering. In Hartmann, Sven and Ma, Hui. Database and Expert Systems Applications: 27th International Conference, DEXA 2016, Porto, Portugal, September 5-8, 2016, Proceedings, Part II. Cham: Springer International Publishing, 2016, p. 185-200. ISBN 978-3-319-44405-5. Available from: https://dx.doi.org/10.1007/978-3-319-44406-2_14.
Other formats:   BibTeX LaTeX RIS
Basic information
Original name Enhancing Similarity Search Throughput by Dynamic Query Reordering
Authors NÁLEPA, Filip (203 Czech Republic, guarantor, belonging to the institution), Michal BATKO (203 Czech Republic, belonging to the institution) and Pavel ZEZULA (203 Czech Republic, belonging to the institution).
Edition Cham, Database and Expert Systems Applications: 27th International Conference, DEXA 2016, Porto, Portugal, September 5-8, 2016, Proceedings, Part II, p. 185-200, 16 pp. 2016.
Publisher Springer International Publishing
Other information
Original language English
Type of outcome Proceedings paper
Field of Study 10201 Computer sciences, information science, bioinformatics
Country of publisher Portugal
Confidentiality degree is not subject to a state or trade secret
Publication form printed version "print"
Impact factor Impact factor: 0.402 in 2005
RIV identification code RIV/00216224:14330/16:00088102
Organization unit Faculty of Informatics
ISBN 978-3-319-44405-5
ISSN 0302-9743
Doi http://dx.doi.org/10.1007/978-3-319-44406-2_14
UT WoS 000389020200014
Keywords in English Stream processing; Similarity Search
Tags DISA, firank_B
Tags International impact, Reviewed
Changed by Changed by: RNDr. Pavel Šmerk, Ph.D., učo 3880. Changed: 13/5/2020 19:24.
Abstract
A lot of multimedia data are being created nowadays, which can only be searched by content since no searching metadata are available for them. To make the content search efficient, similarity indexing structures based on the metric-space model can be used. In our work, we focus on a scenario where the similarity search is used in the context of stream processing. In particular, there is a potentially infinite sequence (stream) of query objects, and a query needs to be executed for each of them. The goal is to maximize the throughput of processed queries while maintaining an acceptable delay. We propose an approach based on dynamic reordering of the incoming queries combined with caching of recent results. We were able to achieve up to 3.7 times higher throughput compared to the base case when no reordering and caching is used.
Links
GA16-18889S, research and development projectName: Analytika pro velká nestrukturovaná data (Acronym: Big Data Analytics for Unstructured Data)
Investor: Czech Science Foundation
PrintDisplayed: 25/4/2024 03:03