R 2013

PPP-Codes: Similarity Search Index

NOVÁK, David

Basic information

Original name

PPP-Codes: Similarity Search Index

Name in Czech

Podobnostní index PPP-Codes

Authors

NOVÁK, David (203 Czech Republic, guarantor, belonging to the institution)

Edition

2013

Other information

Language

English

Type of outcome

Software

Field of Study

10201 Computer sciences, information science, bioinformatics

Country of publisher

Czech Republic

Confidentiality degree

není předmětem státního či obchodního tajemství

References:

RIV identification code

RIV/00216224:14330/13:00065750

Organization unit

Faculty of Informatics

Keywords (in Czech)

PPP-Codes; podobnostní vyhledávání; metrický prostor; index

Keywords in English

PPP-Codes; similarity search; metric space; index

Technical parameters

Pro využití softwaru je nutné postupovat podle licence GNU GPL. Odpovědná osoba pro jednání: David Novák, Fakulta informatiky, Masarykova univerzita, Botanická 68a, Brno, 602 00, david.novak@fi.muni.cz, tel. 549495062

Tags

Tags

International impact
Změněno: 17/7/2014 14:48, RNDr. David Novák, Ph.D.

Abstract

V originále

Many current applications need to organize data with respect to mutual similarity between data objects (for instance biometric systems). A typical general strategy to retrieve the most similar objects to a given example is to access and then refine a candidate set of objects; the overall search costs (and search time) then typically correlate with the candidate set size. The PPP-Codes index provides a generic approach that combines several independent indexes by aggregating their candidate sets in such a way that the resulting candidate set can be one or two orders of magnitude smaller (while keeping the answer quality). This achievement comes at the expense of higher computational costs of the ranking algorithm but our experiments on various datasets indicate that the overall gain can be significant, especially for data types with large objects or expensive similarity function such as biometric systems.

Links

VG20122015073, research and development project
Name: Efektivní vyhledávání v rozsáhlých biometrických datech (Acronym: EFBIO)
Investor: Ministry of the Interior of the CR