APPASAMY, Sri Devan, John BERRISFORD, Romana GÁBOROVÁ, Sreenath NAIR, Stephen ANYANGO, Sergei GRUDININ, Mandar DESHPANDE, David ARMSTRONG, Ivanna PIDRUCHNA, Joseph I. J. ELLAWAY, Grisell Díaz LEINES, Deepti GUPTA, Deborah HARRUS, Mihaly VARADI and Sameer VELANKAR. Annotating Macromolecular Complexes in the Protein Data Bank: Improving the FAIRness of Structure Data. Scientific Data. Nature Portfolio, 2023, vol. 10, No 1, p. 1-13. ISSN 2052-4463. Available from: https://dx.doi.org/10.1038/s41597-023-02778-9.
Other formats:   BibTeX LaTeX RIS
Basic information
Original name Annotating Macromolecular Complexes in the Protein Data Bank: Improving the FAIRness of Structure Data
Authors APPASAMY, Sri Devan (guarantor), John BERRISFORD, Romana GÁBOROVÁ (703 Slovakia, belonging to the institution), Sreenath NAIR, Stephen ANYANGO, Sergei GRUDININ, Mandar DESHPANDE, David ARMSTRONG, Ivanna PIDRUCHNA, Joseph I. J. ELLAWAY, Grisell Díaz LEINES, Deepti GUPTA, Deborah HARRUS, Mihaly VARADI and Sameer VELANKAR.
Edition Scientific Data, Nature Portfolio, 2023, 2052-4463.
Other information
Original language English
Type of outcome Article in a journal
Field of Study 10201 Computer sciences, information science, bioinformatics
Country of publisher Germany
Confidentiality degree is not subject to a state or trade secret
WWW URL
Impact factor Impact factor: 9.800 in 2022
RIV identification code RIV/00216224:14740/23:00132419
Organization unit Central European Institute of Technology
Doi http://dx.doi.org/10.1038/s41597-023-02778-9
UT WoS 001116661600010
Keywords in English CRYSTAL-STRUCTURE; 20S PROTEASOME; MECHANISM; PDB; ASSEMBLIES; RECEPTOR; REVEALS; RESOURCE; YEAST; STATE
Tags CF BDMA, rivok
Tags International impact, Reviewed
Changed by Changed by: Mgr. Eva Dubská, učo 77638. Changed: 5/4/2024 09:14.
Abstract
Macromolecular complexes are essential functional units in nearly all cellular processes, and their atomic-level understanding is critical for elucidating and modulating molecular mechanisms. The Protein Data Bank (PDB) serves as the global repository for experimentally determined structures of macromolecules. Structural data in the PDB offer valuable insights into the dynamics, conformation, and functional states of biological assemblies. However, the current annotation practices lack standardised naming conventions for assemblies in the PDB, complicating the identification of instances representing the same assembly. In this study, we introduce a method leveraging resources external to PDB, such as the Complex Portal, UniProt and Gene Ontology, to describe assemblies and contextualise them within their biological settings accurately. Employing the proposed approach, we assigned standard names to over 90% of unique assemblies in the PDB and provided persistent identifiers for each assembly. This standardisation of assembly data enhances the PDB, facilitating a deeper understanding of macromolecular complexes. Furthermore, the data standardisation improves the PDB’s FAIR attributes, fostering more effective basic and translational research and scientific education.
Links
LM2018131, research and development projectName: Česká národní infrastruktura pro biologická data (Acronym: ELIXIR-CZ)
Investor: Ministry of Education, Youth and Sports of the CR, Czech National Infrastructure for Biological Data
PrintDisplayed: 2/7/2024 10:48