2024
Speech production under stress for machine learning : multimodal dataset of 79 cases and 8 signals
PEŠÁN, Jan; Vojtěch JUŘÍK; Alexandra RUŽIČKOVÁ; Vojtěch SVOBODA; Oto JANOUŠEK et. al.Basic information
Original name
Speech production under stress for machine learning : multimodal dataset of 79 cases and 8 signals
Authors
PEŠÁN, Jan; Vojtěch JUŘÍK; Alexandra RUŽIČKOVÁ ORCID; Vojtěch SVOBODA; Oto JANOUŠEK; Andrea NĚMCOVÁ; Hana BOJANOVSKÁ; Jasmína ALDABAGHOVÁ; Filip KYSLÍK ORCID; Kateřina VODIČKOVÁ; Adéla SODOMOVÁ; Patrik BARTYS ORCID; Peter CHUDÝ and Jan ČERNOCKÝ
Edition
Scientific Data, London, Springer Nature, 2024, 2052-4463
Other information
Language
English
Type of outcome
Article in a journal
Field of Study
50101 Psychology
Country of publisher
United Kingdom of Great Britain and Northern Ireland
Confidentiality degree
is not subject to a state or trade secret
References:
Impact factor
Impact factor: 6.900
RIV identification code
RIV/00216224:14210/24:00137626
Organization unit
Faculty of Arts
UT WoS
001353330000007
EID Scopus
2-s2.0-85209350842
Keywords in English
Speech production; stress; machine learning; dataset
Tags
Tags
International impact, Reviewed
Changed: 11/3/2025 08:47, Mgr. Pavla Martinková
Abstract
In the original language
Early identification of cognitive or physical overload is critical in fields where human decision making matters when preventing threats to safety and property. Pilots, drivers, surgeons, and operators of nuclear plants are among those affected by this challenge, as acute stress can impair their cognition. In this context, the significance of paralinguistic automatic speech processing increases for early stress detection. The intensity, intonation, and cadence of an utterance are examples of paralinguistic traits that determine the meaning of a sentence and are often lost in the verbatim transcript. To address this issue, tools are being developed to recognize paralinguistic traits effectively. However, a data bottleneck still exists in the training of paralinguistic speech traits, and the lack of high-quality reference data for the training of artificial systems persists. Regarding this, we present an original empirical dataset collected using the BESST experimental protocol for capturing speech signals under induced stress. With this data, our aim is to promote the development of pre-emptive intervention systems based on stress estimation from speech.
Links
| MUNI/A/1519/2023, interní kód MU |
|