BAKKER, Michael J., Arnošt MLÁDEK, Hugo SEMRÁD, Vojtěch ZAPLETAL and Jana PAVLÍKOVÁ PŘECECHTĚLOVÁ. Improving IDP theoretical chemical shift accuracy and efficiency through a combined MD/ADMA/DFT and machine learning approach. Physical Chemistry Chemical Physics. Royal Society of Chemistry, 2022, vol. 24, No 45, p. 27678-27692. ISSN 1463-9076. Available from: https://dx.doi.org/10.1039/D2CP01638A.
Other formats:   BibTeX LaTeX RIS
Basic information
Original name Improving IDP theoretical chemical shift accuracy and efficiency through a combined MD/ADMA/DFT and machine learning approach
Authors BAKKER, Michael J., Arnošt MLÁDEK, Hugo SEMRÁD (203 Czech Republic, belonging to the institution), Vojtěch ZAPLETAL and Jana PAVLÍKOVÁ PŘECECHTĚLOVÁ (guarantor).
Edition Physical Chemistry Chemical Physics, Royal Society of Chemistry, 2022, 1463-9076.
Other information
Original language English
Type of outcome Article in a journal
Field of Study 10403 Physical chemistry
Country of publisher United Kingdom of Great Britain and Northern Ireland
Confidentiality degree is not subject to a state or trade secret
WWW URL
Impact factor Impact factor: 3.300
RIV identification code RIV/00216224:14310/22:00127220
Organization unit Faculty of Science
Doi http://dx.doi.org/10.1039/D2CP01638A
UT WoS 000885111900001
Keywords (in Czech) NMR spectroscopy; chemical shift; disordered protein; cluster analysis; optimization in normal modes; molecular dynamics; protein fragmentation; adjustable density matrix assembler; density functional theory; multi-scale modelling
Keywords in English NMR spectroscopy; chemical shift; disordered protein; cluster analysis; optimization in normal modes; molecular dynamics; protein fragmentation; adjustable density matrix assembler; density functional theory; multi-scale modelling
Tags rivok
Tags International impact, Reviewed
Changed by Changed by: Mgr. Hugo Semrád, Ph.D., učo 375827. Changed: 1/12/2022 15:07.
Abstract
This work extends the multi-scale computational scheme for the quantum mechanics (QM) calculations of Nuclear Magnetic Resonance (NMR) chemical shifts (CSs) in proteins that lack a well-defined 3D structure. The scheme couples the sampling of an intrinsically disordered protein (IDP) by classical molecular dynamics (MD) with protein fragmentation using the adjustable density matrix assembler (ADMA) and density functional theory (DFT) calculations. In contrast to our early investigation on IDPs (Pavlíková Přecechtělová et al., J. Chem. Theory Comput., 2019, 15, 5642–5658) and the state-of-the art NMR calculations for structured proteins, a partial re-optimization was implemented on the raw MD geometries in vibrational normal mode coordinates to enhance the accuracy of the MD/ADMA/DFT computational scheme. In addition, machine-learning based cluster analysis was performed on the scheme to explore its potential in producing protein structure ensembles (CLUSTER ensembles) that yield accurate CSs at a reduced computational cost. The performance of the cluster-based calculations is validated against results obtained with conventional structural ensembles consisting of MD snapshots extracted from the MD trajectory at regular time intervals (REGULAR ensembles). CS calculations performed with the refined MD/ADMA/DFT framework employed the 6-311++G(d,p) basis set that outperformed IGLO-III calculations with the same density functional approximation (B3LYP) and both explicit and implicit solvation. The partial geometry optimization did not universally improve the agreement of computed CSs with the experiment but substantially decreased errors associated with the ensemble averaging. A CLUSTER ensemble with 50 structures yielded ensemble averages close to those obtained with a REGULAR ensemble consisting of 500 MD frames. The cluster based calculations thus required only a fraction of the computational time.
Links
LM2018140, research and development projectName: e-Infrastruktura CZ (Acronym: e-INFRA CZ)
Investor: Ministry of Education, Youth and Sports of the CR
PrintDisplayed: 21/8/2024 02:16