WALSH, Brian, Sameh K MOHAMED and Vít NOVÁČEK. Biokg: A knowledge graph for relational learning on biological data. Online. In Mathieu d'Aquin, Stefan Dietze, Claudia Hauff, Edward Curry, Philippe Cudre Mauroux. Proceedings of the 29th ACM International Conference on Information & Knowledge Management. New YorkNYUnited States: Association for Computing Machinery, 2020, p. 3173-3180. ISBN 978-1-4503-6859-9. Available from: https://dx.doi.org/10.1145/3340531.3412776.
Other formats:   BibTeX LaTeX RIS
Basic information
Original name Biokg: A knowledge graph for relational learning on biological data
Authors WALSH, Brian, Sameh K MOHAMED and Vít NOVÁČEK.
Edition New YorkNYUnited States, Proceedings of the 29th ACM International Conference on Information & Knowledge Management, p. 3173-3180, 8 pp. 2020.
Publisher Association for Computing Machinery
Other information
Original language English
Type of outcome Proceedings paper
Field of Study 10201 Computer sciences, information science, bioinformatics
Country of publisher United States of America
Confidentiality degree is not subject to a state or trade secret
Publication form electronic version available online
WWW URL
Organization unit Faculty of Informatics
ISBN 978-1-4503-6859-9
Doi http://dx.doi.org/10.1145/3340531.3412776
UT WoS 000749561303020
Keywords in English knowledge graphs; biomedical knowledge integration
Tags International impact, Reviewed
Changed by Changed by: RNDr. Pavel Šmerk, Ph.D., učo 3880. Changed: 7/4/2024 23:34.
Abstract
Knowledge graphs became a popular means for modeling complex biological systems where they model the interactions between biological entities and their effects on the biological system. They also provide support for relational learning models which are known to provide highly scalable and accurate predictions of associations between biological entities. Despite the success of the combination of biological knowledge graph and relation learning models in biological predictive tasks, there is a lack of unified biological knowledge graph resources. This forced all current efforts and studies for applying a relational learning model on biological data to compile and build biological knowledge graphs from open biological databases. This process is often performed inconsistently across such efforts, especially in terms of choosing the original resources, aligning identifiers of the different databases, and assessing the quality of included data. To make relational learning on biomedical data more standardised and reproducible, we propose a new biological knowledge graph which provides a compilation of curated relational data from open biological databases in a unified format with common, interlinked identifiers. We also provide a new module for mapping identifiers and labels from different databases which can be used to align our knowledge graph with biological data from other heterogeneous sources. Finally, to illustrate the practical relevance of our work, we provide a set of benchmarks based on the presented data that can be used to train and assess the relational learning models in various tasks related to pathway and drug discovery.
PrintDisplayed: 1/9/2024 02:30