VOLNÝ, Petr, David NOVÁK and Pavel ZEZULA. Employing Subsequence Matching in Audio Data Processing. Brno: Faculty of Informatics, Masaryk University, Brno, 2011. FIMU-RS-2011-04.
Other formats:   BibTeX LaTeX RIS
Basic information
Original name Employing Subsequence Matching in Audio Data Processing
Authors VOLNÝ, Petr (203 Czech Republic, guarantor, belonging to the institution), David NOVÁK (203 Czech Republic, belonging to the institution) and Pavel ZEZULA (203 Czech Republic, belonging to the institution).
Edition Brno, FIMU-RS-2011-04, 2011.
Publisher Faculty of Informatics, Masaryk University, Brno
Other information
Original language English
Type of outcome Audiovisual works
Field of Study 10201 Computer sciences, information science, bioinformatics
Country of publisher Czech Republic
Confidentiality degree is not subject to a state or trade secret
WWW full text
RIV identification code RIV/00216224:14330/11:00049994
Organization unit Faculty of Informatics
Keywords in English audio retrieval; subsequnce matching; similarity search; time series
Tags DISA
Changed by Changed by: RNDr. Pavel Šmerk, Ph.D., učo 3880. Changed: 18/4/2012 00:11.
Abstract
We overview current problems of audio retrieval and time-series subsequence matching. We discuss the usage of subsequence matching approaches in audio data processing, especially in automatic speech recognition (ASR) area and we aim at improving performance of the retrieval process. To overcome the problems known from the time-series area like the occurrence of implementation bias and data bias we present a Subsequence Matching Framework as a tool for fast prototyping, building, and testing similarity search subsequence matching applications. The framework is build on top of MESSIF (Metric Similarity Search Implementation Framework) and thus the subsequence matching algorithms can exploit advanced similarity indexes in order to significantly increase their query processing performance. To prove our concept we provide a design of query-by-example spoken term detection type of application with the usage of phonetic posteriograms and subsequence matching approach.
Links
GPP202/10/P220, research and development projectName: Podobnostní vyhledávání s konstantní škálovatelností (Acronym: SIM-SCALE)
Investor: Czech Science Foundation
VF20102014004, research and development projectName: Multimediální analýza (Acronym: Multimediální analýza)
Investor: Ministry of the Interior of the CR
PrintDisplayed: 27/4/2024 10:14