J 2017

Speeding up the multimedia feature extraction: a comparative study on the big data approach

MERA PÉREZ, David, Michal BATKO and Pavel ZEZULA

Basic information

Original name

Speeding up the multimedia feature extraction: a comparative study on the big data approach

Authors

MERA PÉREZ, David (724 Spain), Michal BATKO (203 Czech Republic, guarantor, belonging to the institution) and Pavel ZEZULA (203 Czech Republic, belonging to the institution)

Edition

Multimedia Tools and Applications, Springer, 2017, 1380-7501

Other information

Language

English

Type of outcome

Článek v odborném periodiku

Field of Study

10201 Computer sciences, information science, bioinformatics

Country of publisher

United States of America

Confidentiality degree

není předmětem státního či obchodního tajemství

References:

Impact factor

Impact factor: 1.541

RIV identification code

RIV/00216224:14330/17:00094702

Organization unit

Faculty of Informatics

UT WoS

000397278400062

Keywords in English

Big data;Image feature extraction;Map Reduce;Apache Storm;Apache Spark;Grid computing

Tags

International impact, Reviewed
Změněno: 27/4/2018 10:32, RNDr. Pavel Šmerk, Ph.D.

Abstract

V originále

The current explosion of multimedia data is significantly increasing the amount of potential knowledge. However, to get to the actual information requires to apply novel content-based techniques which in turn require time consuming extraction of indexable features from the raw data. In order to deal with large datasets, this task needs to be parallelized. However, there are multiple approaches to choose from, each with its own benefits and drawbacks. There are also several parameters that must be taken into consideration, for example the amount of available resources, the size of the data and their availability. In this paper, we empirically evaluate and compare approaches based on Apache Hadoop, Apache Storm, Apache Spark, and Grid computing, employed to distribute the extraction task over an outsourced and distributed infrastructure.

Links

GBP103/12/G084, research and development project
Name: Centrum pro multi-modální interpretaci dat velkého rozsahu
Investor: Czech Science Foundation