D 2018

Continuous Time-Dependent kNN Join by Binary Sketches

NÁLEPA, Filip, Michal BATKO and Pavel ZEZULA

Basic information

Original name

Continuous Time-Dependent kNN Join by Binary Sketches

Authors

NÁLEPA, Filip (203 Czech Republic, guarantor, belonging to the institution), Michal BATKO (203 Czech Republic, belonging to the institution) and Pavel ZEZULA (203 Czech Republic, belonging to the institution)

Edition

New York, IDEAS 2018 : 22nd International Database Engineering & Applications Symposium, June 18-20, 2018, Villa San Giovanni, Italy, p. 64-73, 10 pp. 2018

Publisher

ACM

Other information

Language

English

Type of outcome

Stať ve sborníku

Field of Study

10201 Computer sciences, information science, bioinformatics

Country of publisher

United States of America

Confidentiality degree

není předmětem státního či obchodního tajemství

Publication form

electronic version available online

RIV identification code

RIV/00216224:14330/18:00100951

Organization unit

Faculty of Informatics

ISBN

978-1-4503-6527-7

Keywords in English

continuous kNN similarity join; time-dependent similarity; binary sketches

Tags

International impact, Reviewed
Změněno: 30/4/2019 07:40, RNDr. Pavel Šmerk, Ph.D.

Abstract

V originále

An important functionality of current social applications is real-time recommendation, which is responsible for suggesting relevant published data to the users based on their preferences. By representing the users and the published data in a metric space, each user can be recommended with their k nearest neighbors among the published data. We consider the scenario when the relevance of a published data item to a user decreases as the data gets older, i.e., a time-dependent distance function is applied. We define the problem as the continuous time-dependent kNN join and provide a solution to a broad range of time-dependent functions. In addition, we propose a binary sketch-based approximation technique used to speed up the join evaluation by replacing expensive metric distance computations with cheap Hamming distances.

Links

GA16-18889S, research and development project
Name: Analytika pro velká nestrukturovaná data (Acronym: Big Data Analytics for Unstructured Data)
Investor: Czech Science Foundation