2025
Enhanced metabolomic predictions using concept drift analysis: identification and correction of confounding factors
SCHWARZEROVA, Jana; Dominika OLESOVA; Katerina JURECKOVA; Ales KVASNICKA; Ales KOSTOVAL et al.Základní údaje
Originální název
Enhanced metabolomic predictions using concept drift analysis: identification and correction of confounding factors
Autoři
SCHWARZEROVA, Jana; Dominika OLESOVA; Katerina JURECKOVA; Ales KVASNICKA; Ales KOSTOVAL; David FRIEDECKY; Jiri SEKORA; Jitka POMENKOVA; Valentýna PROVAZNÍK ORCID; Lubomír POPELÍNSKÝ a Wolfram WECKWERTH
Vydání
Bioinformatics Advances, Oxford, Oxford University Press, 2025, 2635-0041
Další údaje
Jazyk
angličtina
Typ výsledku
Článek v odborném periodiku
Obor
10201 Computer sciences, information science, bioinformatics
Stát vydavatele
Velká Británie a Severní Irsko
Utajení
není předmětem státního či obchodního tajemství
Odkazy
Impakt faktor
Impact factor: 2.800 v roce 2024
Označené pro přenos do RIV
Ano
Kód RIV
RIV/00216224:14110/25:00143831
Organizační jednotka
Lékařská fakulta
UT WoS
EID Scopus
Klíčová slova anglicky
metabolomics; concept drift analysis; predictive modeling; confounding factors; model correction
Štítky
Příznaky
Mezinárodní význam, Recenzováno
Změněno: 2. 3. 2026 13:36, Mgr. Tereza Miškechová
Anotace
V originále
Motivation The increasing use of big data and optimized prediction methods in metabolomics requires techniques aligned with biological assumptions to improve early symptom diagnosis. One major challenge in predictive data analysis is handling confounding factors-variables influencing predictions but not directly included in the analysis.Results Detecting and correcting confounding factors enhances prediction accuracy, reducing false negatives that contribute to diagnostic errors. This study reviews concept drift detection methods in metabolomic predictions and selects the most appropriate ones. We introduce a new implementation of concept drift analysis in predictive classifiers using metabolomics data. Known confounding factors were confirmed, validating our approach and aligning it with conventional methods. Additionally, we identified potential confounding factors that may influence biomarker analysis, which could introduce bias and impact model performance.Availability and implementation Based on biological assumptions supported by detected concept drift, these confounding factors were incorporated into correction of prediction algorithms to enhance their accuracy. The proposed methodology has been implemented in Semi-Automated Pipeline using Concept Drift Analysis for improving Metabolomic Predictions (SAPCDAMP), an open-source workflow available at https://github.com/JanaSchwarzerova/SAPCDAMP.