a 2023

Onedata4Sci: Life-science experimental datasets management system

ROŠINEC, Adrián, Tomáš SVOBODA, Tomáš RAČEK, Josef HANDL, Jozef SABO et. al.

Základní údaje

Originální název

Onedata4Sci: Life-science experimental datasets management system

Vydání

3D-BioInfo | ISCB 3D-SIG | ELIXIR Czech Republic Structural Bioinformatics, 2023

Další údaje

Jazyk

angličtina

Typ výsledku

Konferenční abstrakt

Stát vydavatele

Česká republika

Utajení

není předmětem státního či obchodního tajemství

Odkazy

Organizační jednotka

Přírodovědecká fakulta

Klíčová slova anglicky

datasets management;FAIR principles;life-science data;Onedata

Příznaky

Mezinárodní význam
Změněno: 16. 11. 2023 12:34, Mgr. Ing. Tomáš Svoboda

Anotace

V originále

In many scientific disciplines, especially life-sciences, expensive equipment is shared nowadays (like cryoEM devices, optical microscopes, …). The users – scientists request specific experiments from facilities, which perform the experiments on their behalf. The outcome of such an experiment is a dataset, which can get quite large in many cases (tens of gigabytes to terabytes). Data are then processed in order to draw scientific conclusions from their interpretation, and the results are published. However, today more and more emphasis is being placed on sharing the primary data itself - not only for the purpose of verification of scientific findings, but also for the re-use of the dataset to be used in future research. Automatic/manual annotation with appropriate metadata, storage or archiving of the dataset, assignment of DOIs, and subsequent publication of the dataset in disciplinary metadata catalogues or data repositories are necessary. To address these challenges, we design and develop a system Onedata4Sci, that automates acquiring, sharing, and publishing of data produced by specialized scientific devices. The proposed solution automatically makes experimental data available to the scientific community in a predefined way. It is particularly useful for on-the-fly processing in local or distant data centers, real-time analysis, or archiving to permanent storage according to defined quality of service (e.g., data distribution). The solution includes a web-based system that can be used to manage emerging datasets and annotate them with metadata (automatically extracted from the data produced by the instruments or manually entered by users according to defined templates). The system makes it easy to automate the individual steps of dataset preparation, checking compliance with FAIR principles, and publishing the dataset to the scientific community. The development of the system is guided by FAIR principles and national EOSC-CZ activities.

Návaznosti

744R1/2023, interní kód MU
Název: Portálové řešení pro správu a zpracování life-science datových sad a jejich metadat ukládaných v systému Onedata
Investor: CESNET, Portálové řešení pro správu a zpracování life-science datových sad a jejich metadat ukládaných v systému Onedata