A 2011

Employing Subsequence Matching in Audio Data Processing

VOLNÝ, Petr, David NOVÁK and Pavel ZEZULA

Basic information

Original name

Employing Subsequence Matching in Audio Data Processing

Authors

VOLNÝ, Petr (203 Czech Republic, guarantor, belonging to the institution), David NOVÁK (203 Czech Republic, belonging to the institution) and Pavel ZEZULA (203 Czech Republic, belonging to the institution)

Edition

Brno, FIMU-RS-2011-04, 2011

Publisher

Faculty of Informatics, Masaryk University, Brno

Other information

Language

English

Type of outcome

Audiovizuální tvorba

Field of Study

10201 Computer sciences, information science, bioinformatics

Country of publisher

Czech Republic

Confidentiality degree

není předmětem státního či obchodního tajemství

References:

RIV identification code

RIV/00216224:14330/11:00049994

Organization unit

Faculty of Informatics

Keywords in English

audio retrieval; subsequnce matching; similarity search; time series

Tags

Změněno: 18/4/2012 00:11, RNDr. Pavel Šmerk, Ph.D.

Abstract

V originále

We overview current problems of audio retrieval and time-series subsequence matching. We discuss the usage of subsequence matching approaches in audio data processing, especially in automatic speech recognition (ASR) area and we aim at improving performance of the retrieval process. To overcome the problems known from the time-series area like the occurrence of implementation bias and data bias we present a Subsequence Matching Framework as a tool for fast prototyping, building, and testing similarity search subsequence matching applications. The framework is build on top of MESSIF (Metric Similarity Search Implementation Framework) and thus the subsequence matching algorithms can exploit advanced similarity indexes in order to significantly increase their query processing performance. To prove our concept we provide a design of query-by-example spoken term detection type of application with the usage of phonetic posteriograms and subsequence matching approach.

Links

GPP202/10/P220, research and development project
Name: Podobnostní vyhledávání s konstantní škálovatelností (Acronym: SIM-SCALE)
Investor: Czech Science Foundation
VF20102014004, research and development project
Name: Multimediální analýza (Acronym: Multimediální analýza)
Investor: Ministry of the Interior of the CR