k 2024

Integration, Cataloguing and FAIRification of Biobank and Clinical Data

KACOVÁ, Radoslava, Zdenka DUDOVÁ, Tomáš HOUFEK, Radovan TOMÁŠIK, Roman HRSTKA et. al.

Základní údaje

Originální název

Integration, Cataloguing and FAIRification of Biobank and Clinical Data

Autoři

KACOVÁ, Radoslava (703 Slovensko, garant, domácí), Zdenka DUDOVÁ (203 Česká republika), Tomáš HOUFEK (203 Česká republika), Radovan TOMÁŠIK (703 Slovensko) a Roman HRSTKA (203 Česká republika)

Vydání

Europe Biobank Week Congress 2024, 2024

Další údaje

Jazyk

angličtina

Typ výsledku

Prezentace na konferencích

Obor

10200 1.2 Computer and information sciences

Stát vydavatele

Rakousko

Utajení

není předmětem státního či obchodního tajemství

Organizační jednotka

Fakulta informatiky

Klíčová slova anglicky

FAIRification; data management; sensitive data; data reusability

Příznaky

Mezinárodní význam
Změněno: 24. 5. 2024 10:40, Mgr. Radoslava Kacová

Anotace

V originále

Sequencing of somatic DNA generates a vast amount of data with the potential for secondary usage in research, however, sharing this data for research purposes is challenging. This is often due to inadequate description, management, and the absence of a platform for presenting its existence. Associated issues include high capacity demands for storing large volumes of data and ensuring their security and long-term sustainability. Data is not only exposed to the risk of complete loss due to the degradation of storage media, often stored unprofessionally, but their retrospective retrieval and linking to other data for the same patient are complicated due to inconsistently used identifiers and labels. This study aimed to design a method for managing sensitive hospital data and establish the foundations of an integration centre that ensures data compliance with the so-called FAIR principles. These principles ensure that data is findable, accessible, interoperable, and ultimately reusable. The output of the study is a data management proposal, within which a metadata catalogue was created as a unified platform for exposing metadata, and a cloud environment was designed for storing sensitive data using the SensitiveCloud service. The purpose of the poster is to inform about the newly established data pipeline, which can serve as inspiration for biobanks wishing to offer their requestors associated data (such as sequencing, radiological, etc.) alongside biological samples. These data have the potential to enrich research and provide interesting insights into the world of medicine.