NÁLEPA, Filip, Michal BATKO and Pavel ZEZULA. Towards High Similarity Search Throughput by Dynamic Query Reordering and Parallel Processing. In Mārīte Kirikova, Kjetil Nørvåg, George Angelos Papadopoulos. Advances in Databases and Information Systems : 21st European Conference, ADBIS 2017, Nicosia, Cyprus, September 24-27, 2017, Proceedings. Cham: Springer International Publishing, 2017, p. 262-277. ISBN 978-3-319-66916-8. Available from: https://dx.doi.org/10.1007/978-3-319-66917-5_18.
Other formats:   BibTeX LaTeX RIS
Basic information
Original name Towards High Similarity Search Throughput by Dynamic Query Reordering and Parallel Processing
Authors NÁLEPA, Filip (203 Czech Republic, guarantor, belonging to the institution), Michal BATKO (203 Czech Republic, belonging to the institution) and Pavel ZEZULA (203 Czech Republic, belonging to the institution).
Edition Cham, Advances in Databases and Information Systems : 21st European Conference, ADBIS 2017, Nicosia, Cyprus, September 24-27, 2017, Proceedings, p. 262-277, 16 pp. 2017.
Publisher Springer International Publishing
Other information
Original language English
Type of outcome Proceedings paper
Field of Study 10201 Computer sciences, information science, bioinformatics
Country of publisher Switzerland
Confidentiality degree is not subject to a state or trade secret
Publication form printed version "print"
Impact factor Impact factor: 0.402 in 2005
RIV identification code RIV/00216224:14330/17:00094944
Organization unit Faculty of Informatics
ISBN 978-3-319-66916-8
ISSN 0302-9743
Doi http://dx.doi.org/10.1007/978-3-319-66917-5_18
UT WoS 000463611400018
Keywords in English stream processing; similarity search; parallel processing
Tags DISA, firank_B
Tags International impact, Reviewed
Changed by Changed by: RNDr. Pavel Šmerk, Ph.D., učo 3880. Changed: 13/5/2020 19:24.
Abstract
Current era of digital data explosion calls for employment of content-based similarity search techniques since traditional searchable metadata like annotations are not always available. In our work, we focus on a scenario where the similarity search is used in the context of stream processing, which is one of the suitable approaches to deal with huge amounts of data. Our goal is to maximize the throughput of processed queries while a slight delay is acceptable. We extend our previously published technique that dynamically reorders the incoming queries in order to use our caching mechanism more effectively. The extension lies in adoption of a parallel computing environment which allows us to process multiple queries simultaneously.
Links
GA16-18889S, research and development projectName: Analytika pro velká nestrukturovaná data (Acronym: Big Data Analytics for Unstructured Data)
Investor: Czech Science Foundation
PrintDisplayed: 16/7/2024 14:53