J 2024

Speech production under stress for machine learning : multimodal dataset of 79 cases and 8 signals

PEŠÁN, Jan; Vojtěch JUŘÍK; Alexandra RUŽIČKOVÁ; Vojtěch SVOBODA; Oto JANOUŠEK et. al.

Basic information

Original name

Speech production under stress for machine learning : multimodal dataset of 79 cases and 8 signals

Authors

PEŠÁN, Jan; Vojtěch JUŘÍK; Alexandra RUŽIČKOVÁ ORCID; Vojtěch SVOBODA; Oto JANOUŠEK; Andrea NĚMCOVÁ; Hana BOJANOVSKÁ; Jasmína ALDABAGHOVÁ; Filip KYSLÍK ORCID; Kateřina VODIČKOVÁ; Adéla SODOMOVÁ; Patrik BARTYS ORCID; Peter CHUDÝ and Jan ČERNOCKÝ

Edition

Scientific Data, London, Springer Nature, 2024, 2052-4463

Other information

Language

English

Type of outcome

Article in a journal

Field of Study

50101 Psychology

Country of publisher

United Kingdom of Great Britain and Northern Ireland

Confidentiality degree

is not subject to a state or trade secret

References:

Impact factor

Impact factor: 6.900

RIV identification code

RIV/00216224:14210/24:00137626

Organization unit

Faculty of Arts

UT WoS

001353330000007

EID Scopus

2-s2.0-85209350842

Keywords in English

Speech production; stress; machine learning; dataset

Tags

Tags

International impact, Reviewed
Changed: 11/3/2025 08:47, Mgr. Pavla Martinková

Abstract

In the original language

Early identification of cognitive or physical overload is critical in fields where human decision making matters when preventing threats to safety and property. Pilots, drivers, surgeons, and operators of nuclear plants are among those affected by this challenge, as acute stress can impair their cognition. In this context, the significance of paralinguistic automatic speech processing increases for early stress detection. The intensity, intonation, and cadence of an utterance are examples of paralinguistic traits that determine the meaning of a sentence and are often lost in the verbatim transcript. To address this issue, tools are being developed to recognize paralinguistic traits effectively. However, a data bottleneck still exists in the training of paralinguistic speech traits, and the lack of high-quality reference data for the training of artificial systems persists. Regarding this, we present an original empirical dataset collected using the BESST experimental protocol for capturing speech signals under induced stress. With this data, our aim is to promote the development of pre-emptive intervention systems based on stress estimation from speech.

Links

MUNI/A/1519/2023, interní kód MU
Name: Možnosti aplikačního využití poznatků základního psychologického výzkumu
Investor: Masaryk University