J 2025

Enhanced metabolomic predictions using concept drift analysis: identification and correction of confounding factors

SCHWARZEROVA, Jana; Dominika OLESOVA; Katerina JURECKOVA; Ales KVASNICKA; Ales KOSTOVAL et al.

Základní údaje

Originální název

Enhanced metabolomic predictions using concept drift analysis: identification and correction of confounding factors

Autoři

SCHWARZEROVA, Jana; Dominika OLESOVA; Katerina JURECKOVA; Ales KVASNICKA; Ales KOSTOVAL; David FRIEDECKY; Jiri SEKORA; Jitka POMENKOVA; Valentýna PROVAZNÍK ORCID; Lubomír POPELÍNSKÝ a Wolfram WECKWERTH

Vydání

Bioinformatics Advances, Oxford, Oxford University Press, 2025, 2635-0041

Další údaje

Jazyk

angličtina

Typ výsledku

Článek v odborném periodiku

Obor

10201 Computer sciences, information science, bioinformatics

Stát vydavatele

Velká Británie a Severní Irsko

Utajení

není předmětem státního či obchodního tajemství

Odkazy

Impakt faktor

Impact factor: 2.800 v roce 2024

Označené pro přenos do RIV

Ano

Kód RIV

RIV/00216224:14110/25:00143831

Organizační jednotka

Lékařská fakulta

EID Scopus

Klíčová slova anglicky

metabolomics; concept drift analysis; predictive modeling; confounding factors; model correction

Štítky

Příznaky

Mezinárodní význam, Recenzováno
Změněno: 2. 3. 2026 13:36, Mgr. Tereza Miškechová

Anotace

V originále

Motivation The increasing use of big data and optimized prediction methods in metabolomics requires techniques aligned with biological assumptions to improve early symptom diagnosis. One major challenge in predictive data analysis is handling confounding factors-variables influencing predictions but not directly included in the analysis.Results Detecting and correcting confounding factors enhances prediction accuracy, reducing false negatives that contribute to diagnostic errors. This study reviews concept drift detection methods in metabolomic predictions and selects the most appropriate ones. We introduce a new implementation of concept drift analysis in predictive classifiers using metabolomics data. Known confounding factors were confirmed, validating our approach and aligning it with conventional methods. Additionally, we identified potential confounding factors that may influence biomarker analysis, which could introduce bias and impact model performance.Availability and implementation Based on biological assumptions supported by detected concept drift, these confounding factors were incorporated into correction of prediction algorithms to enhance their accuracy. The proposed methodology has been implemented in Semi-Automated Pipeline using Concept Drift Analysis for improving Metabolomic Predictions (SAPCDAMP), an open-source workflow available at https://github.com/JanaSchwarzerova/SAPCDAMP.