PLASS, Markus, Rudolf WITTNER, Petr HOLUB, Francesca FREXIA, Cecilia MASCIA, Matej GALLO, Heimo MÜLLER and Jörg GEIGER. Provenance of specimen and data – A prerequisite for AI development in computational pathology. NEW BIOTECHNOLOGY. NETHERLANDS: ELSEVIER, 2023, vol. 78, DEC, p. 22-28. ISSN 1871-6784. Available from: https://dx.doi.org/10.1016/j.nbt.2023.09.006.
Other formats:   BibTeX LaTeX RIS
Basic information
Original name Provenance of specimen and data – A prerequisite for AI development in computational pathology
Authors PLASS, Markus, Rudolf WITTNER (703 Slovakia, belonging to the institution), Petr HOLUB (203 Czech Republic, guarantor, belonging to the institution), Francesca FREXIA, Cecilia MASCIA, Matej GALLO (703 Slovakia, belonging to the institution), Heimo MÜLLER and Jörg GEIGER.
Edition NEW BIOTECHNOLOGY, NETHERLANDS, ELSEVIER, 2023, 1871-6784.
Other information
Original language English
Type of outcome Article in a journal
Field of Study 10201 Computer sciences, information science, bioinformatics
Country of publisher Germany
Confidentiality degree is not subject to a state or trade secret
WWW URL
Impact factor Impact factor: 5.400 in 2022
RIV identification code RIV/00216224:14610/23:00131767
Organization unit Institute of Computer Science
Doi http://dx.doi.org/10.1016/j.nbt.2023.09.006
UT WoS 001084774500001
Keywords in English Artificial intelligence; Provenance; Biological material; Traceability
Tags International impact, Reviewed
Changed by Changed by: RNDr. Pavel Šmerk, Ph.D., učo 3880. Changed: 8/4/2024 16:28.
Abstract
AI development in biotechnology relies on high-quality data to train and validate algorithms. The FAIR principles (Findable, Accessible, Interoperable, and Reusable) and regulatory frameworks such as the In Vitro Diagnostic Regulation (IVDR) and the Medical Device Regulation (MDR) specify requirements on specimen and data provenance to ensure the quality and traceability of data used in AI development. In this paper, a framework is presented for recording and publishing provenance information to meet these requirements. The framework is based on the use of standardized models and protocols, such as the W3C PROV model and the ISO 23494 series, to capture and record provenance information at various stages of the data generation and analysis process. The framework and use case illustrate the role of provenance information in supporting the development of high-quality AI algorithms in biotechnology. Finally, the principles of the framework are illustrated in a simple computational pathology use case, showing how specimen and data provenance can be used in the development and documentation of an AI algorithm. The use case demonstrates the importance of managing and integrating distributed provenance information and highlights the complex task of considering factors such as semantic interoperability, confidentiality, and the verification of authenticity and integrity.
Links
MUNI/A/1339/2022, interní kód MUName: Rozvoj technik pro zpracování dat pro podporu vyhledávání, analýz a vizualizací rozsáhlých datových souborů s využitím umělé inteligence
Investor: Masaryk University, Development of data processing techniques to support search, analysis and visualization of large datasets using artificial intelligence
PrintDisplayed: 14/5/2024 15:52