PROCHÁZKA, David, Terézia SLANINÁKOVÁ, Jaroslav OĽHA, Adrián ROŠINEC, Katarína GREŠOVÁ, Miriama JÁNOŠOVÁ, Jakub ČILLÍK, Jana PORUBSKÁ, Radka SVOBODOVÁ, Vlastislav DOHNAL and Matej ANTOL. AlphaFind: discover structure similarity across the proteome in AlphaFold DB. Nucleic acids research. Oxford: Oxford University Press, 2024, Neuveden, May 15, p. 1-5. ISSN 0305-1048. Available from: https://dx.doi.org/10.1093/nar/gkae397.
Other formats:   BibTeX LaTeX RIS
Basic information
Original name AlphaFind: discover structure similarity across the proteome in AlphaFold DB
Authors PROCHÁZKA, David, Terézia SLANINÁKOVÁ, Jaroslav OĽHA, Adrián ROŠINEC, Katarína GREŠOVÁ, Miriama JÁNOŠOVÁ, Jakub ČILLÍK, Jana PORUBSKÁ, Radka SVOBODOVÁ, Vlastislav DOHNAL and Matej ANTOL.
Edition Nucleic acids research, Oxford, Oxford University Press, 2024, 0305-1048.
Other information
Original language English
Type of outcome Article in a journal
Field of Study 10201 Computer sciences, information science, bioinformatics
Country of publisher United Kingdom of Great Britain and Northern Ireland
Confidentiality degree is not subject to a state or trade secret
WWW URL
Impact factor Impact factor: 14.900 in 2022
Organization unit Faculty of Informatics
Doi http://dx.doi.org/10.1093/nar/gkae397
Keywords in English AlphaFind;Protein similarity search;Structure-based retrieval;Protein tertiary structure;AlphaFold DB;Machine learning
Tags AlphaFind, CODA Research Group, DISA, learned index, LMI, protein similarity search
Tags International impact, Reviewed
Changed by Changed by: doc. RNDr. Vlastislav Dohnal, Ph.D., učo 2952. Changed: 18/5/2024 21:22.
Abstract
AlphaFind is a web-based search engine that provides fast structure-based retrieval in the entire set of AlphaFold DB structures. Unlike other protein processing tools, AlphaFind is focused entirely on tertiary structure, automatically extracting the main 3D features of each protein chain and using a machine learning model to find the most similar structures. This indexing approach and the 3D feature extraction method used by AlphaFind have both demonstrated remarkable scalability to large datasets as well as to large protein structures. The web application itself has been designed with a focus on clarity and ease of use. The searcher accepts any valid UniProt ID, Protein Data Bank ID or gene symbol as input, and returns a set of similar protein chains from AlphaFold DB, including various similarity metrics between the query and each of the retrieved results. In addition to the main search functionality, the application provides 3D visualizations of protein structure superpositions in order to allow researchers to instantly analyze the structural similarity of the retrieved results. The AlphaFind web application is available online for free and without any registration at https://alphafind.fi.muni.cz.
Links
GF23-07040K, research and development projectName: Naučené indexy pro podobností hledání
Investor: Czech Science Foundation, Learned Indexing for Similarity Searching, Lead Agency
LM2023055, research and development projectName: Česká národní infrastruktura pro biologická data
Investor: Ministry of Education, Youth and Sports of the CR, ELIXIR-CZ: Czech National Infrastructure for Biological Data
90254, large research infrastructuresName: e-INFRA CZ II
PrintDisplayed: 15/7/2024 22:02