k 2019

Creation and visualization of secondary structure consensus for protein families

MIDLIK, Adam, Ivana HUTAŘOVÁ VAŘEKOVÁ, Jan HUTAŘ, Veronika NAVRÁTILOVÁ, Jaroslav KOČA et. al.

Základní údaje

Originální název

Creation and visualization of secondary structure consensus for protein families

Autoři

MIDLIK, Adam, Ivana HUTAŘOVÁ VAŘEKOVÁ, Jan HUTAŘ, Veronika NAVRÁTILOVÁ, Jaroslav KOČA, Karel BERKA a Radka SVOBODOVÁ VAŘEKOVÁ

Vydání

XX. setkání biochemiků a molekulárních biologů, 2019

Další údaje

Jazyk

angličtina

Typ výsledku

Prezentace na konferencích

Obor

10608 Biochemistry and molecular biology

Stát vydavatele

Česká republika

Utajení

není předmětem státního či obchodního tajemství

Odkazy

Organizační jednotka

Přírodovědecká fakulta
Změněno: 22. 1. 2020 13:53, Mgr. et Mgr. Adam Midlik, Ph.D.

Anotace

V originále

Protein structural data, deposited in the Protein Data Bank, are a valuable source of information and their amount is continuously growing (currently more than 150 000 structures). Furthermore, most protein structures can be classified into protein families based on their similarity [1]. Systematic study of these families is gaining importance and can yield interesting research results. Every protein family has a set of characteristic secondary structure elements (SSEs, namely helices and β-strands). Their arrangement is well defined and consistent throughout the whole family. However, there will always be some differences between the members of the family and a single structure is not enough to represent the whole family of structures, just as a single amino acid sequence is not enough to represent the whole family of sequences. For sequences, this problem is solved by multiple sequence alignment, which produces the consensus sequence and can be visualized by a sequence logo [2] – this extracts the essential features of the family and shows the similarities and differences within the family. For secondary structure, such an approach is currently missing. In this work, we introduce computational methods for extracting and visualizing the secondary structure consensus for a given protein family. Apart from giving an overview of the family, this consensus can also be used as an annotation template for our previously developed program SecStrAnnotator [3]. This allows annotation of SSEs in any family and unlocks the possibility of automated annotation of the key regions (e.g. active sites and channels) based on their position relative to the SSEs. [1] Dawson NL, Lewis TE, Das S, Lees JG, Lee D, Ashford P, Orengo CA, Sillitoe I (2017) CATH: an expanded resource to predict protein function through structure and sequence. Nucleic Acids Res 45(D1):D289-D295. https://doi.org/10.1093/nar/gkw1098