SVEC, Jan, Josef V. PSUTKA, Jan TRMAL, Lubos SMIDL, Pavel IRCING and Jan SEDMIDUBSKÝ. ON THE USE OF GRAPHEME MODELS FOR SEARCHING IN LARGE SPOKEN ARCHIVES. In 43rd IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2018). Neuveden: IEEE Computer Society, 2018, p. 6259-6263. ISBN 978-1-5386-4658-8. Available from: https://dx.doi.org/10.1109/ICASSP.2018.8461774.
Other formats:   BibTeX LaTeX RIS
Basic information
Original name ON THE USE OF GRAPHEME MODELS FOR SEARCHING IN LARGE SPOKEN ARCHIVES
Authors SVEC, Jan (203 Czech Republic), Josef V. PSUTKA (203 Czech Republic), Jan TRMAL (203 Czech Republic), Lubos SMIDL (203 Czech Republic), Pavel IRCING (203 Czech Republic) and Jan SEDMIDUBSKÝ (203 Czech Republic, guarantor, belonging to the institution).
Edition Neuveden, 43rd IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2018), p. 6259-6263, 5 pp. 2018.
Publisher IEEE Computer Society
Other information
Original language English
Type of outcome Proceedings paper
Field of Study 10201 Computer sciences, information science, bioinformatics
Confidentiality degree is not subject to a state or trade secret
Publication form storage medium (CD, DVD, flash disk)
RIV identification code RIV/00216224:14330/18:00100802
Organization unit Faculty of Informatics
ISBN 978-1-5386-4658-8
ISSN 1520-6149
Doi http://dx.doi.org/10.1109/ICASSP.2018.8461774
UT WoS 000446384606084
Keywords in English spoken term detection; speech indexing; grapheme-based speech recognition; keyword search
Tags DISA, firank_A
Tags International impact, Reviewed
Changed by Changed by: RNDr. Pavel Šmerk, Ph.D., učo 3880. Changed: 13/5/2020 19:39.
Abstract
This paper explores the possibility to use grapheme-based word and sub-word models in the task of spoken term detection (STD). The usage of grapheme models eliminates the need for expert-prepared pronunciation lexicons (which are often far from complete) and/or trainable grapheme-to-phoneme (G2P) algorithms that are frequently rather inaccurate, especially for rare words (words coming from a different language). Moreover, the G2P conversion of the search terms that need to be performed on-line can substantially increase the response time of the STD system. Our results show that using various grapheme-based models, we can achieve STD performance (measured in terms of ATWV) comparable with phoneme-based models but without the additional burden of G2P conversion.
Links
GBP103/12/G084, research and development projectName: Centrum pro multi-modální interpretaci dat velkého rozsahu
Investor: Czech Science Foundation
PrintDisplayed: 28/7/2024 09:16