SILLITOE, Ian, Nicola BORDIN, Natalie DAWSON, Vaishali P. WAMAN, Paul ASHFORD, Harry M. SCHOLES, Camilla S.M. PANG, Laurel WOODRIDGE, Clemens RAUER, Neeladri SEN, Mahnaz ABBASIAN, Sean LE CORNU, Su Datt LAM, Karel BERKA, Ivana HUTAŘOVÁ VAŘEKOVÁ, Radka SVOBODOVÁ, Jon LEES and Christine A. ORENGO. CATH: increased structural coverage of functional space. Nucleic Acids Research. Oxford: Oxford University Press, 2021, vol. 49, D1, p. "D266"-"D273", 8 pp. ISSN 0305-1048. Available from: https://dx.doi.org/10.1093/nar/gkaa1079.
Other formats:   BibTeX LaTeX RIS
Basic information
Original name CATH: increased structural coverage of functional space
Authors SILLITOE, Ian, Nicola BORDIN, Natalie DAWSON, Vaishali P. WAMAN, Paul ASHFORD, Harry M. SCHOLES, Camilla S.M. PANG, Laurel WOODRIDGE, Clemens RAUER, Neeladri SEN, Mahnaz ABBASIAN, Sean LE CORNU, Su Datt LAM, Karel BERKA, Ivana HUTAŘOVÁ VAŘEKOVÁ (203 Czech Republic, belonging to the institution), Radka SVOBODOVÁ (203 Czech Republic, guarantor, belonging to the institution), Jon LEES and Christine A. ORENGO.
Edition Nucleic Acids Research, Oxford, Oxford University Press, 2021, 0305-1048.
Other information
Original language English
Type of outcome Article in a journal
Field of Study 10608 Biochemistry and molecular biology
Country of publisher United Kingdom of Great Britain and Northern Ireland
Confidentiality degree is not subject to a state or trade secret
WWW URL
Impact factor Impact factor: 19.160
RIV identification code RIV/00216224:14740/21:00120994
Organization unit Central European Institute of Technology
Doi http://dx.doi.org/10.1093/nar/gkaa1079
UT WoS 000608437800035
Keywords in English protein structures; CATH
Tags CF BioData, rivok
Tags International impact, Reviewed
Changed by Changed by: Mgr. Pavla Foltynová, Ph.D., učo 106624. Changed: 26/2/2022 14:29.
Abstract
CATH (https://www.cathdb.info) identifies domains in protein structures from wwPDB and classifies these into evolutionary superfamilies, thereby providing structural and functional annotations. There are two levels: CATH-B, a daily snapshot of the latest domain structures and superfamily assignments, and CATH+, with additional derived data, such as predicted sequence domains, and functionally coherent sequence subsets (Functional Families or FunFams). The latest CATH+ release, version 4.3, significantly increases coverage of structural and sequence data, with an addition of 65,351 fully-classified domains structures (+15%), providing 500 238 structural domains, and 151 million predicted sequence domains (+59%) assigned to 5481 superfamilies. The FunFam generation pipeline has been re-engineered to cope with the increased influx of data. Three times more sequences are captured in FunFams, with a concomitant increase in functional purity, information content and structural coverage. FunFam expansion increases the structural annotations provided for experimental GO terms (+59%). We also present CATH-FunVar web-pages displaying variations in protein sequences and their proximity to known or predicted functional sites. We present two case studies (1) putative cancer drivers and (2) SARS-CoV-2 proteins. Finally, we have improved links to and from CATH including SCOP, InterPro, Aquaria and 2DProt.
Links
EF16_013/0001777, research and development projectName: ELIXIR-CZ: Budování kapacit
LM2018131, research and development projectName: Česká národní infrastruktura pro biologická data (Acronym: ELIXIR-CZ)
Investor: Ministry of Education, Youth and Sports of the CR, Czech National Infrastructure for Biological Data
PrintDisplayed: 1/5/2024 07:43