J 2024

DOME Registry: implementing community-wide recommendations for reporting supervised machine learning in biology

ATTAFI, Omar Abdelghani; Damiano CLEMENTEL; Konstantinos KYRITSIS; Emidio CAPRIOTTI; Gavin FARRELL et al.

Základní údaje

Originální název

DOME Registry: implementing community-wide recommendations for reporting supervised machine learning in biology

Autoři

ATTAFI, Omar Abdelghani; Damiano CLEMENTEL; Konstantinos KYRITSIS; Emidio CAPRIOTTI; Gavin FARRELL; Styliani-Christina FRAGKOULI; Leyla Jael CASTRO; Andras HATOS; Tom LENAERTS; Stanislav MAZURENKO; Soroush MOZAFFARI; Franco PRADELLI; Patrick RUCH; Castrense SAVOJARDO; Paola TURINA; Federico ZAMBELLI; Damiano PIOVESAN; Alexander Miguel MONZON; Fotis PSOMOPOULOS a Silvio C. E. TOSATTO

Vydání

GigaScience, Oxford University Press, 2024, 2047-217X

Další údaje

Jazyk

angličtina

Typ výsledku

Článek v odborném periodiku

Obor

10700 1.7 Other natural sciences

Stát vydavatele

Velká Británie a Severní Irsko

Utajení

není předmětem státního či obchodního tajemství

Odkazy

Impakt faktor

Impact factor: 3.900

Označené pro přenos do RIV

Ano

Kód RIV

RIV/00216224:14310/24:00138456

Organizační jednotka

Přírodovědecká fakulta

EID Scopus

Klíčová slova anglicky

machine learning; standards; transparency; reproducibility

Štítky

Příznaky

Mezinárodní význam, Recenzováno
Změněno: 28. 2. 2025 16:05, Mgr. Marie Novosadová Šípková, DiS.

Anotace

V originále

Supervised machine learning (ML) is used extensively in biology and deserves closer scrutiny. The Data Optimization Model Evaluation (DOME) recommendations aim to enhance the validation and reproducibility of ML research by establishing standards for key aspects such as data handling and processing, optimization, evaluation, and model interpretability. The recommendations help to ensure that key details are reported transparently by providing a structured set of questions. Here, we introduce the DOME registry (URL: registry.dome-ml.org), a database that allows scientists to manage and access comprehensive DOME-related information on published ML studies. The registry uses external resources like ORCID, APICURON, and the Data Stewardship Wizard to streamline the annotation process and ensure comprehensive documentation. By assigning unique identifiers and DOME scores to publications, the registry fosters a standardized evaluation of ML methods. Future plans include continuing to grow the registry through community curation, improving the DOME score definition and encouraging publishers to adopt DOME standards, and promoting transparency and reproducibility of ML in the life sciences.