J 2021

CATH: increased structural coverage of functional space

SILLITOE, Ian; Nicola BORDIN; Natalie DAWSON; Vaishali P. WAMAN; Paul ASHFORD et. al.

Základní údaje

Originální název

CATH: increased structural coverage of functional space

Autoři

SILLITOE, Ian; Nicola BORDIN; Natalie DAWSON; Vaishali P. WAMAN; Paul ASHFORD; Harry M. SCHOLES; Camilla S.M. PANG; Laurel WOODRIDGE; Clemens RAUER; Neeladri SEN; Mahnaz ABBASIAN; Sean LE CORNU; Su Datt LAM; Karel BERKA; Ivana HUTAŘOVÁ VAŘEKOVÁ ORCID; Radka SVOBODOVÁ; Jon LEES a Christine A. ORENGO

Vydání

Nucleic Acids Research, Oxford, Oxford University Press, 2021, 0305-1048

Další údaje

Jazyk

angličtina

Typ výsledku

Článek v odborném periodiku

Obor

10608 Biochemistry and molecular biology

Stát vydavatele

Velká Británie a Severní Irsko

Utajení

není předmětem státního či obchodního tajemství

Odkazy

Impakt faktor

Impact factor: 19.160

Kód RIV

RIV/00216224:14740/21:00120994

Organizační jednotka

Středoevropský technologický institut

UT WoS

000608437800035

EID Scopus

2-s2.0-85099427458

Klíčová slova anglicky

protein structures; CATH

Štítky

Příznaky

Mezinárodní význam, Recenzováno
Změněno: 31. 10. 2024 08:36, Ing. Monika Szurmanová, Ph.D.

Anotace

V originále

CATH (https://www.cathdb.info) identifies domains in protein structures from wwPDB and classifies these into evolutionary superfamilies, thereby providing structural and functional annotations. There are two levels: CATH-B, a daily snapshot of the latest domain structures and superfamily assignments, and CATH+, with additional derived data, such as predicted sequence domains, and functionally coherent sequence subsets (Functional Families or FunFams). The latest CATH+ release, version 4.3, significantly increases coverage of structural and sequence data, with an addition of 65,351 fully-classified domains structures (+15%), providing 500 238 structural domains, and 151 million predicted sequence domains (+59%) assigned to 5481 superfamilies. The FunFam generation pipeline has been re-engineered to cope with the increased influx of data. Three times more sequences are captured in FunFams, with a concomitant increase in functional purity, information content and structural coverage. FunFam expansion increases the structural annotations provided for experimental GO terms (+59%). We also present CATH-FunVar web-pages displaying variations in protein sequences and their proximity to known or predicted functional sites. We present two case studies (1) putative cancer drivers and (2) SARS-CoV-2 proteins. Finally, we have improved links to and from CATH including SCOP, InterPro, Aquaria and 2DProt.

Návaznosti

EF16_013/0001777, projekt VaV
Název: ELIXIR-CZ: Budování kapacit
90131, velká výzkumná infrastruktura
Název: ELIXIR-CZ II