Towards High Similarity Search Throughput by Dynamic Query
Reordering and Parallel Processing

NÁLEPA, Filip, Michal BATKO and Pavel ZEZULA. Towards High Similarity Search Throughput by Dynamic Query Reordering and Parallel Processing. In Mārīte Kirikova, Kjetil Nørvåg, George Angelos Papadopoulos. Advances in Databases and Information Systems : 21st European Conference, ADBIS 2017, Nicosia, Cyprus, September 24-27, 2017, Proceedings. Cham: Springer International Publishing, 2017, p. 262-277. ISBN 978-3-319-66916-8. Available from: https://dx.doi.org/10.1007/978-3-319-66917-5_18.

Other formats: BibTeX LaTeX RIS

Basic information
Original name	Towards High Similarity Search Throughput by Dynamic Query Reordering and Parallel Processing
Authors	NÁLEPA, Filip (203 Czech Republic, guarantor, belonging to the institution), Michal BATKO (203 Czech Republic, belonging to the institution) and Pavel ZEZULA (203 Czech Republic, belonging to the institution).
Edition	Cham, Advances in Databases and Information Systems : 21st European Conference, ADBIS 2017, Nicosia, Cyprus, September 24-27, 2017, Proceedings, p. 262-277, 16 pp. 2017.
Publisher	Springer International Publishing

Other information
Original language	English
Type of outcome	Proceedings paper
Field of Study	10201 Computer sciences, information science, bioinformatics
Country of publisher	Switzerland
Confidentiality degree	is not subject to a state or trade secret
Publication form	printed version "print"
Impact factor	Impact factor: 0.402 in 2005
RIV identification code	RIV/00216224:14330/17:00094944
Organization unit	Faculty of Informatics
ISBN	978-3-319-66916-8
ISSN	0302-9743
Doi	http://dx.doi.org/10.1007/978-3-319-66917-5_18
UT WoS	000463611400018
Keywords in English	stream processing; similarity search; parallel processing
Tags	DISA, firank_B
Tags	International impact, Reviewed
Changed by	Changed by: RNDr. Pavel Šmerk, Ph.D., učo 3880. Changed: 13/5/2020 19:24.

Abstract

Current era of digital data explosion calls for employment of content-based similarity search techniques since traditional searchable metadata like annotations are not always available. In our work, we focus on a scenario where the similarity search is used in the context of stream processing, which is one of the suitable approaches to deal with huge amounts of data. Our goal is to maximize the throughput of processed queries while a slight delay is acceptable. We extend our previously published technique that dynamically reorders the incoming queries in order to use our caching mechanism more effectively. The extension lies in adoption of a parallel computing environment which allows us to process multiple queries simultaneously.

Links
GA16-18889S, research and development project	Name: Analytika pro velká nestrukturovaná data (Acronym: Big Data Analytics for Unstructured Data)
GA16-18889S, research and development project	Investor: Czech Science Foundation

PrintDisplayed: 16/7/2024 14:53

Towards High Similarity Search Throughput by Dynamic Query Reordering and Parallel Processing

Other applications