Showcasing research from Dr Martin Marek's Structural Biology group, Loschmidt Laboratories, Masaryk University, Czech Republic. D e c o d i n g t h e i n t r i c a t e n e t w o r k o f m o l e c u l a r i n t e r a c t i o n s o f a h y p e r s t a b l e e n g i n e e r e d b i o c a t a l y s t Computational design of protein catalysts with enhanced stabilities is a challenging task. We report X-ray structures of a hyperstable engineered haloalkane dehalogenase (Tm = 73.5 °C), which highlight key thermostabilization effects. Unexpectedly, mutations toward bulky aromatic amino acids at the protein surface triggered long-distance (-27 A) backbone changes due to cooperative effects. These cooperative interactions produced an unprecedented double-lock system that dramatically reduced the volumes of enzyme access tunnels. Despite these spatial restrictions, experimental tracing of the access tunnels using krypton derivative crystals demonstrates that the transport of ligands is still effective. A s f e a t u r e d i n : S e e Jiri D a m b o r s k y , M a r t i n M a r e k e r a / . , Chem. Sci., 2020,11,11162. ^ £ 2 ^ ™ S I I w rsc.li/chemical-scienceOF CHEMISTRY Registered charity number: 20789C Chemical Science ROYAL SOCIETY OF CHEMISTRY EDGE ARTICLE View Article Online V i e w Journal I View Issue 11) Check for updates Cite this: Chem. ScL, 2020,11,11162 All publication charges for this article have been paid for by the Royal Society of Chemistry Received 17th June 2020 Accepted 10th September 2020 DOI: 10.1039/d0sc03367g rsc.li/chemical-science Decoding the intricate network of molecular interactions of a hyperstable engineered biocatalystf Klara Markova4a b Klaudia Chmelova4a D Sergio M. Marques,a D Philippe Carpentier, .,.ab ab . cd David Bednar,a b Jiri Damborsky * a b and Martin Marek * a b C o m p u t a t i o n a l design of protein catalysts w i t h e n h a n c e d stabilities for use in research a n d e n z y m e t e c h n o l o g i e s is a c h a l l e n g i n g task. Using f o r c e - f i e l d calculations a n d p h y l o g e n e t i c analysis, w e previously d e s i g n e d t h e haloalkane d e h a l o g e n a s e DhaA115 w h i c h c o n t a i n s 11 m u t a t i o n s that c o n f e r u p o n it o u t s t a n d i n g t h e r m o s t a b i l i t y ( T m = 73.5 °C; A T m > 23 °C). A n u n d e r s t a n d i n g o f t h e structural basis o f this hyperstabilization is required in o r d e r t o d e v e l o p c o m p u t e r a l g o r i t h m s a n d predictive tools. Here, w e r e p o r t X-ray structures of DhaA115 at 1.55 A a n d 1.6 A resolutions a n d their m o l e c u l a r d y n a m i c s trajectories, w h i c h unravel t h e intricate n e t w o r k o f interactions that r e i n f o r c e t h e a f t a - s a n d w i c h architecture. Unexpectedly, m u t a t i o n s t o w a r d bulky a r o m a t i c a m i n o acids at t h e p r o t e i n surface triggered l o n g - d i s t a n c e (~27 A) b a c k b o n e c h a n g e s d u e t o c o o p e r a t i v e effects. T h e s e c o o p e r a t i v e interactions p r o d u c e d an u n p r e c e d e n t e d d o u b l e - l o c k system that: (i) i n d u c e d b a c k b o n e c h a n g e s , (ii) closed t h e m o l e c u l a r gates t o t h e active site, (iii) r e d u c e d t h e v o l u m e s o f t h e m a i n a n d slot access tunnels, a n d (iv) o c c l u d e d t h e active site. Despite these spatial restrictions, e x p e r i m e n t a l tracing o f t h e access t u n n e l s using k r y p t o n derivative crystals d e m o n s t r a t e s that t r a n s p o r t of ligands is still effective. O u r findings highlight key t h e r m o s t a b i l i z a t i o n effects a n d provide a structural basis for designing n e w t h e r m o s t a b l e protein catalysts. Introduction Enzymes have evolved for b i l l i o n s of years, a n d w i l l c o n t i n u e to do so as l o n g as life o n earth exists.1 T h e y catalyze a l m o s t a l l c h e m i c a l reactions that occur i n l i v i n g organisms, a n d m a n y of t h e m have been successfully incorporated into diverse industrial, e n v i r o n m e n t a l a n d b i o m e d i c a l technologies.2 Often, w i l d type enzymes do n o t fully m e e t the d e m a n d s of these h a r s h technological processes, a n d p u n c t u a l m u t a t i o n s are engineered into t h e m to improve their physico-chemical properties for technological applications. T h e key parameter for a l l enzymes to be employed i n i n d u s t r i a l catalysis is "Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Kamenice 5, 625 00 Brno, Czech Republic. E-mail: jiri@chemi. muni, cz; martin. marek@recetox. muni, cz international Clinical Research Center, St. Anne's University Hospital Brno, Pekařská 53, 656 91 Brno, Czech Republic 'Universitě Grenoble Alpes, CNRS, CEA, Interdisciplinary Research Institute of Grenoble (IRIG), Laboratoire Chimie et Biologie des Métaux (LCBM), 17 Avenue des Martyrs, 38054 Grenoble, France d European Synchrotron Radiation Facility (ESRF), 71 Avenue des Martyrs, 38043 Grenoble, France f Electronic supplementary information (ESI) available. See DOI: 10.1039/d0sc03367g % These authors contributed equally. thermostability, w h i c h allows t h e m to w i t h s t a n d elevated temperatures d u r i n g biocatalytic processes.3 E n h a n c i n g protein thermostability involves changes that shift the f o l d i n g - u n f o l d i n g balance toward the folded f o r m . Stabilizing substitutions c a n either stabilize the folded conform a t i o n or destabilize the u n f o l d e d one. T h e m o s t direct way to stabilize proteins is to create or strengthen attractive interactions between a m i n o acids i n the folded c o n f o r m a t i o n . A l t h o u g h proteins w i l l c o n t i n u e to u n f o l d anyway, these stronger interactions w i l l either slow d o w n u n f o l d i n g or speed up r e f o l d i n g processes." A structured f o r m c a n be stabilized t h r o u g h non-covalent interactions i n c l u d i n g h y d r o p h o b i c interactions, hydrogen b o n d s , salt bridges a n d v a n der W a a l s forces.5 Increasing the n u m b e r of s t a b i l i z i n g electrostatic interactions between residues of opposite charge reinforces proteins' t h e r m a l stability.6 H y d r o p h o b i c interactions have been s h o w n to contribute proportionally m o r e effectively to protein stability t h a n hydrogen b o n d s . 7 T h e h y d r o p h o b i c effect is i n d e e d the d o m i n a n t d r i v i n g force i n protein folding, a n d d e s i g n i n g a well-packed h y d r o p h o b i c core is therefore usually a n efficient strategy for e n g i n e e r i n g stable proteins.4 Haloalkane dehalogenases (HLDs; E C 3.8.1.5) are a/p-hydrolases that catalyze the hydrolytic cleavage of the carbon-halogen b o n d i n diverse halogenated aliphatic hydrocarbons via S N 2 nucleophilic substitution. The reaction requires the a d d i t i o n of 11162 I Chem. Sei., 2 0 2 0 , 11, 11162-11178 This journal is © The Royal Society of Chemistry 2 0 2 0 V i e w Article Online Edge Article a water molecule a n d releases a halide i o n together w i t h a proton, finally p r o d u c i n g the corresponding alcohol.8 Structurally, H L D s consist o f a canonical a/p-hydrolase fold, w h i c h is composed of a central eight-stranded P-sheet d o m a i n surrounded by several ahelices {i.e. a P a sandwich architecture). A n additional versatile helical cap d o m a i n is observed to be specific to each H L D enzyme.9 T h e active site contains a catalytic pentad, w h i c h consists of a nucleophile, a base, a catalytic acid, a n d two halidestabilizing residues.9 I n a l l H L D s , the active site is positioned i n a hydrophobic pocket buried between the a/p-fold core a n d the cap d o m a i n , a n d this catalytic center is connected w i t h the b u l k solvent via a m a i n t u n n e l a n d a slot t u n n e l . 1 0 B o t h o f these tunnels are crucial determinants of the specific catalytic activity and the substrate selectivity o f each H L D enzyme.1 1 '1 2 Recently, we developed FireProt,1 3 '1 4 a fully automated a n d robust computational pipeline c o m b i n i n g energy- a n d evolutionbased approaches to design highly stable multi-point mutant proteins. W e employed FireProt to enhance the thermostability of DhaA, a n H L D enzyme f r o m Rhodococcus rhodochrous [Tm = 50.5 °C; To p t = 45 °C). After several iteration cycles, we obtained an 11-point D h a A mutant, hereafter referred to as DhaA115, with outstanding thermostability (Tm = 73.5 °C) a n d thermophilicity, as demonstrated by a substantial shift i n the o p t i m a l catalytic temperature (To p t = 65 °C).1 3 C o m p u t a t i o n a l m o d e l i n g showed that 3 of the 11 stabilizing residues line the m a i n access tunnel, 3 other residues are buried w i t h i n the protein core a n d the last 5 residues are exposed to solvent o n the protein surface.1 3 W e inferred that 8 of these mutations (C128F, T148L, A172I, C176F, D198W, V219W, C262L a n d D266F), w h i c h were identified by the energy-based approach, potentially enhance the stability of the enzyme by i m p r o v i n g the p a c k i n g of atoms w i t h i n the protein interior and/or by strengthening hydrophobic interactions.1 3 However, the stabilizing effects of the 3 r e m a i n i n g mutations (E20S, F80R a n d A155P), proposed by the evolution-based approach, cannot be reproduced by force-field calculations.1 5 Experimental data are l a c k i n g to explain the structural basis for the engineered hyperstability of DhaA115. To fill this gap, we crystallized a n d solved high-resolution structures of the hyperstable enzyme D h a A 1 1 5 . Analyses of these crystal structures h i g h l i g h t specific a m i n o acid constellations that p r i m a r i l y reinforce the a P a - s a n d w i c h architecture and the helical cap d o m a i n via m u l t i p l e newly-established interactions of the non-polar, h y d r o p h o b i c a n d aromatic 7 t - u stacking types. Surprisingly, we f o u n d that placement of bulky aromatic a m i n o acids o n the protein surface triggered some unexpected long-distance changes i n the protein backbone. Essentially, these changes cause the gates a n d the internal volumes of b o t h the m a i n a n d the slot access tunnels to be restricted, a n d consequently the enzyme active site appears somewhat occluded. Interestingly, despite the active site occlusion, experimental m a p p i n g of the enzyme tunnels by krypton derivatization o f the DhaA115 crystals, supported by protein dynamics simulations, showed that l i g a n d molecules can still be transported t h r o u g h the enzyme tunnels. Collectively, o u r findings demonstrate that the hyperstabilization engineered i n D h a A led to massive r e d u c t i o n i n the v o l u m e of its access tunnels, a n d that the enzymes are still capable of This journal is © The Royal Society of Chemistry 2 0 2 0 Chemical Science operating since they are permeable to substrates, products a n d water molecules. T h i s permeability is t h e n increased at elevated temperature, as previously demonstrated by the shifted o p t i m a l catalytic temperature (Topt = 65 °C) of the DhaA115 enzyme.1 3 Results Crystal structure of the hyperstable enzyme DhaA115 To o b t a i n precise structural i n f o r m a t i o n about h o w the D h a A enzyme is thermostabilized, we focused o u r efforts o n crystallization of the m o s t stabilized enzyme variant, DhaA115. W e obtained crystals that belonged to the space group P12tl a n d diffracted at 1.6 A resolution (Table 1). T h e final m o d e l contains Table 1 Crystallographic data c o l l e c t i o n a n d r e f i n e m e n t statistics Krypton-soaked Data collection" Native DhaA115 DhaA115 Wavelength (A) 0.861 0.861 Space group P12il P21 21 21 Cell dimensions a, b, c (A) 70.19, 68.12, 83.92 67.98, 82.04, 144.18 c, ß, J (°) 90, 104.82, 90 90, 90, 90 Resolution (A) 48.07-1.6 (1.657-1.6) 49.2-1.55 (1.605-1.55) Total reflections 673 251 (64 239) 1 562 749 (155 793) Unique reflections 99 075 (9734) 116 063 (11 342) ^merge 6.85 (61.9) 6.3 (83.5) Hal 15.55 (2.55) 24.89 (3.45) Completeness (%) 98.16 (97.14) 98.76 (98.12) Multiplicity 6.8 (6.6) 13.5 (13.7) CC(l/2) 99.9 (85.6) 100 (94.7) Wilson B-factor 17.23 18.26 Refinement Resolution (A) 48.07-1.6 49.2-1.55 (1.657-1.6) (1.605-1.55) No. reflections 99 075 (9733) 116 063 (11 335) ^work/^free 0.158/0.178 0.160/0.181 Number of atoms Protein 4837 4827 Ligand 62 125 Water 567 622 B-factors Protein 20.72 19.20 Ligand 31.74 31.56 Water 32.49 33.33 RMS deviations Bond lengths (A) 0.006 0.006 Bond angles (°) 0.84 0.90 Ramachandran 96.02 96.37 favored (%) Ramachandran 3.98 3.63 allowed (%) Ramachandran 0 0 outliers (%) PDB ID code 6SP5 6SP8 " Values i n parentheses are for the highest-resolution shell. Chem. Sei., 2 0 2 0 , 11, 11162-11178 | 11163 Chemical Science V i e w Article Online Edge Article two enzyme molecules per asymmetric u n i t a n d has good values for deviation f r o m the ideal (root mean-square deviation o n the C a atoms of 0.3 A; F i g . S l f ) , w i t h .R-factor a n d R-free values of 0.16 a n d 0.18 respectively (Table 1). A l m o s t a l l of the residues were built i n density, except for a few residues at the disordered a m i n o - a n d carboxy-terminal ends. DhaA115 adopts a c a n o n i c a l H L D fold similar to that o f the wild-type D h a A ( R M S D o n the C a atoms of 0.6 A ; F i g . S2f), f o r m i n g a single a|3a s a n d w i c h architecture (a/p-hydrolase core] w i t h a characteristic h e l i c a l cap d o m a i n (Fig. 1). T h e a/p core is c o m p o s e d of a central eight-stranded P-sheet, w i t h a P2 strand i n a n anti-parallel orientation. T h i s a/P core is s a n d w i c h e d by helices ( a l - a 3 ) o n one side a n d ( a 8 - a l l ) o n the other. The helical (a4-a7) cap d o m a i n , w h i c h is positioned between the P6 strand a n d the a8 helix, shields the a/p-hydrolase core to w h i c h it is a n c h o r e d via L 9 a n d L 1 4 loops. T h e enzyme active site is located i n a p r e d o m i n a n t l y h y d r o p h o b i c cavity f o r m e d at the interface between the a/p-hydrolase core a n d the cap d o m a i n . The overall topology o f the secondary structure elements is very similar to that of the wild-type D h a A . However, specific backbone re-arrangements are observed, w h i c h encompass the L 9 , L10 a n d L 1 4 loops a n d the a4 a n d a9 helices (Fig. 1 a n d S2f). A B Stabilizing rK mutations v> N - h y d r o l a s e c o r e j i _ c a p d o m a i n 4F i_ J L h y d r o l a s e core - C DhaA115 L5 u2 - 6 E DhaA115 1 M S E I G T G F P F D P HY V E V LGMRMHY V D V G P R D G T PV L F L H G N P T S S Y LWRN I I P H V A P S H R C I A P D L I G M G K S D K P D L D Y R F D 82 DhaA a2 L6 ß5 L7 a 3 L8 L9 a4 L10 a 5 ' DhaA115 83 D H V R Y L D A F I E A L G L E E V V L V I HDWG S A LG F HWA KR N P ERV KG I A FjM E F I RP I P T W D E W P E F A R E DhaA D H V R Y L D A F I E A L G L E E V V L V I H D W G S A L G F H W A K R N P E R V K G I A I M E F I R P I P T W D E W P E L11 a 5 L12 a6 L13 a7 L14 L F Q A F R T P i _^L15 _ß7 I D V G R E L I I D 164 E L I I D 164 L16 DhaA115 165 Q N A F I EG| DhaA J L P K j V VW,V V R P L T E V E M D H Y R E P F L K P VWR E P LWR F P N E L P I A G E P A N I WA L V E A Y M NWL HQ S P V P K L L FWG T PG V L 246 5 E P A N I H A L V L18 «10 L19 DhaA115 247 I P P A DhaA E A A R L A E S L P N B K T V B I G P G L H Y L Q E D N P D L I G S E I A R W L P A L 293 E A A R L A E S L P N | K T V | I G P G L H Y L Q E D N P D L I G S E I A R W L P A L 293 L148 P155 Side view Top view Fig. 1 Overall s t r u c t u r e of DhaA115. (A) S c h e m a t i c r e p r e s e n t a t i o n of t h e p r o t e i n s e q u e n c e s h o w i n g t h e d o m a i n t o p o l o g y of DhaA115 a n d t h e positions of t h e stabilizing m u t a t i o n s . (B) S t r u c t u r e - b a s e d s e q u e n c e a l i g n m e n t of DhaA115 a n d DhaA. T h e stabilizing m u t a t i o n s are s h o w n in violet frames. S e c o n d a r y s t r u c t u r e e l e m e n t s f o u n d in DhaA115 are s h o w n a b o v e t h e a l i g n m e n t . Catalytically essential residues are p o i n t e d o u t w i t h red dots. (C) C a r t o o n r e p r e s e n t a t i o n of DhaA115 s t r u c t u r e w i t h t h e c e n t r a l e i g h t - s t r a n d e d (3-sheet (yellow), t h e a/(3-hydrolase helices (blue) and t h e helical c a p d o m a i n ( b r o w n ) . T h e stabilizing m u t a t i o n s are s h o w n as p u r p l e spheres. T h e i s o t h i o c y a n a t e (SCN) m o l e c u l e b o u n d in t h e e n z y m e active site is s h o w n as spheres. (D) C l o s e - u p v i e w of t h e e n z y m e ' s active site w i t h key catalytically essential residues (grey sticks) a n d s i m u l a t e d a n n e a l i n g o m i t e l e c t r o n density m a p c o n t o u r e d at 2a a r o u n d t h e i s o t h i o c y a n a t e (SCN) m o l e c u l e . M o l e c u l a r c o n t a c t s b e t w e e n p r o t e i n residues a n d SCN are s h o w n as y e l l o w dashed lines. 11164 | Chem. ScL, 2 0 2 0 , 11, 11162-11178 This journal is © The Royal Society of Chemistry 2 0 2 0 V i e w Article Online Edge Article Additionally, we u n a m b i g u o u s l y identified i n the electron density m a p the presence of bis-tris propane (B3P), glycerol (GOL) a n d isothiocyanate (SCN) molecules, w h i c h were b o u n d to the DhaA115 enzyme. Consistent w i t h this, bis-tris propane a n d isothiocyanate were required i n the crystallization solution, while the glycerol was used for cryo-protection. T h e bis-tris propane a n d glycerol molecules are b o u n d o n the protein surface, the former b e i n g also involved i n crystal-packing contacts. There are three S C N - b i n d i n g sites per enzyme molecule; two of t h e m are also located o n the enzyme surface while the last one is deeply b u r i e d i n the active site cavity (Fig. S l f ) . A s s h o w n i n F i g . I D , the latter S C N a n i o n interacts w i t h three catalytic residues: the n u c l e o p h i l i c aspartate D 1 0 6 (2.6 A) a n d the two halide-stabilizing residues, W 1 0 7 (3.5 A) a n d N 4 1 (3.6 A). It is also i n close contact w i t h the non-catalytic proline P206 (3.3 A). T h i s S C N - b i n d i n g site thus overlaps w i t h the halideb i n d i n g site, where the halide a n i o n product is usually captured d u r i n g the dehalogenation reaction. Solution structure of the hyperstable DhaA115 W h i l s t the wild-type D h a A is a m o n o m e r i c enzyme, we previously noted that the stabilized DhaA115 variant forms a m i n o r i t y of dimers a n d high-molecular-weight oligomeric states.1 5 W e therefore speculated as to whether the d i m e r observed i n the asymmetric u n i t of the crystal (Fig. S l f ) m i g h t also exist i n solution. T o test this hypothesis, we employed small-angle X-ray light scattering (SAXS) analysis to probe the DhaA115 structure i n solution. T h e SAXS profile of the DhaA115 solution closely fits the scattering profile calculated u s i n g a single DhaA115 m o n o m e r of the crystal structure ( x 2 = 1.25), but consistently does n o t correspond at a l l to the scattering curve calculated u s i n g the d i m e r of the crystal asymmetric u n i t (x2 = 50.3; F i g . 2). Furthermore, the radius of gyration (Rg ) determined for the merged data has a value o f 18.34 A. T h e representative pair distance d i s t r i b u t i o n function, P(r), evaluated by the indirect Fourier transform w i t h the G N O M Chemical Science package,1 6 is s h o w n i n F i g . 2. T h e profile has a bell-like shape w i t h a m a i n peak at 23.4 A , a n d trails off to a m a x i m u m d i m e n s i o n (Dm a x) o f ~ 5 7 A . Finally, the ab initio m o d e l reconstructed f r o m the experimental SAXS data perfectly a c c o m m o dates a m o n o m e r of the DhaA115 crystal structure (Fig. 2). O u r SAXS results demonstrate that the purified DhaA115 is indeed a m o n o m e r i c enzyme. C o m p l e m e n t a r y PISA calculat i o n s 1 7 showed that the b u r i e d solvent-accessible area i n the crystal contact DhaA115 d i m e r is ~ 2 4 1 A 2 , w h i c h represents only 2 . 1 % o f the total solvent-accessible surface area of the m o n o m e r (~11 298 A 2 ) . T a k e n together, the SAXS experiments a n d the PISA calculations provide evidence that the crystal contact d i m e r observed i n the asymmetric u n i t (Fig. S l f ) is n o t biologically relevant a n d does n o t exist i n solution. O u r data suggest that the DhaA115 dimers observed by Beerens a n d cow o r k e r s 1 5 m u s t employ a d i m e r i z a t i o n interface different f r o m that observed i n o u r crystal p a c k i n g (Fig. S l f ) . Localization of the stabilizing mutations Computer-aided design predicted eleven a m i n o acid substitutions, whose simultaneous i n t r o d u c t i o n into the D h a A enzyme resulted i n a highly thermostable enzyme variant, D h a A 1 1 5 , w i t h Tm = 73.5 °C a n d shifted o p t i m a l catalytic temperature [Topt = 65 °C).1 3 A s s h o w n i n Fig. 1A, the designed mutations are evenly distributed a l o n g the protein sequence, w i t h 6 of t h e m localized i n the a/p-hydrolase core (E20S + F80R + C128F + V 2 1 9 W + C 2 6 2 L + D266F) a n d the r e m a i n i n g 5 m u t a t i o n s i n the cap d o m a i n (T148L + A155P + A172I + C 1 7 6 F + D198W). Careful inspection o f the DhaA115 structure revealed that n i n e o u t of the eleven mutations are located i n the secondary structure elements (Fig. IB), the other two (F80R + A155P) i n the secondary structure/loop transitions. Structural implications of evolution-based mutations There are three m u t a t i o n s that were designed by a protein evolution-based a p p r o a c h , 1 3 n a m e l y E20S, F80R a n d A155P. A l l Scattering vector q ( A ) r (A) view Fig. 2 Solution s t r u c t u r e o f DhaA115 d e t e r m i n e d by SAXS. (A) E x p e r i m e n t a l SAXS scattering c u r v e for DhaA115 (black dots) is s h o w n against t h e calculated scattering curves for t h e DhaA115 m o n o m e r (blue line) a n d DhaA115 crystal d i m e r (green line). (B) Distance distribution f u n c t i o n of DhaA115 c o m p u t e d f r o m t h e X - r a y scattering pattern using t h e G N O M p r o g r a m . (C) Ab initio m o l e c u l a r e n v e l o p e g e n e r a t e d f r o m SAXS data analysis. T h e m o l e c u l a r SAXS e n v e l o p e of t h e DhaA115 m o n o m e r is s h o w n in a s e m i - t r a n s p a r e n t grey c o l o r s u p e r p o s e d o n t h e DhaA115 m o n o m e r of t h e crystal s t r u c t u r e r e p r e s e n t e d as a blue c a r t o o n . This journal is © The Royal Society of Chemistry 2 0 2 0 Chem. Sei., 2 0 2 0 , 11, 11162-11178 | 11165 Chemical Science V i e w Article Online Edge Article these residues are located o n the enzyme surface (Fig. 1C), where they were f o u n d to either participate i n the surface charge network i m p o r t a n t for protein-solvent interactions (Fig. 3A) or rigidify the solvent-exposed flexible loop (Fig. 3B). Specifically, the replacement of a surface phenylalanine w i t h a n arginine (F80R) disrupted the cation-7t interaction between F80 a n d R204 (4.0 A) present i n the wild-type D h a A a n d established new ionic interactions w i t h D78, D82 a n d D 8 3 . Moreover, there is a newly established water-mediated hydrogen-bonded network involving R80, D82, D83 a n d R86 (Fig. 3A). Similarly, the serine residue (E20S) participates i n the formation of a n extensive solvent-mediated interaction network, i n w h i c h the residues L18, S20, D73 a n d Y87 are involved. Strikingly, the water-mediated interactions between the L18, S20 a n d Y87 residues apparently rigidify the L I loop connecting the p i a n d P2 strands, a n d help to protect the central P-sheet (Fig. 3A). B o t h stabilizing S20 a n d R80 residues, w h i c h are located ~ 1 6 . 5 A apart f r o m one another, participate extensively i n local protein-water interactions, w h i c h contribute to the global solvent hydrogen-bonded network (Fig. 3A). The last of the m u t a t i o n s deduced by the evolution-based a p p r o a c h is the substitution of a n alanine by p r o l i n e (A155P] i n the L 1 0 loop that connects the a4 a n d a 5 ' helices w i t h i n the cap d o m a i n . T h i s s u b s t i t u t i o n forces the L10 loop to adopt a c o n f o r m a t i o n different f r o m that observed i n D h a A . Specifically, the i n t r o d u c e d p r o l i n e residue (P155) is present i n transc o n f o r m a t i o n , w h i c h brings its carbonyl oxygen into a p o s i t i o n where it c a n interact w i t h the m a i n - c h a i n nitrogen of V157 (2.6 A) (Fig. 3B). I n a d d i t i o n , the new c o n f o r m a t i o n of the L10 loop enables the molecule to establish two new m a i n - c h a i n to m a i n c h a i n hydrogen b o n d s , n a m e l y between the carbonyl oxygen of T154 a n d the nitrogen of G158 (2.9 A), a n d between the carbonyl a t o m of A151 a n d the nitrogen of T154 (3.1 A). T h e L 1 0 loop interacts extensively w i t h a n u n d e r n e a t h a7 helix t h r o u g h m u l t i p l e water-mediated hydrogen b o n d s i n D h a A 1 1 5 , b u t not i n D h a A (Fig. 3B), w h i c h again m a y have a positive effect o n protein-solvent interactions. Structural implications of energy-based mutations The r e m a i n i n g 8 m u t a t i o n s i n D h a A 1 1 5 (C128F, T148L, A172I, C176F, D 1 9 8 W , V219W, C 2 6 2 L a n d D266F) were inferred by force-field c a l c u l a t i o n s . 1 3 Interestingly, all these a m i n o acids were m u t a t i o n s toward residues of the h y d r o p h o b i c or aromatic type a n d always w i t h a sterically b u l k i e r side c h a i n . Prior to our E v o l u t i o n - i n f e r r e d m u t a t i o n s E n e r g y c a l c u l a t i o n - d e r i v e d m u t a t i o n s S20 + R80 L148 + I172 + F176 W219 + F266 B P155 F128 + L262 D198W Fig. 3 Stabilizing a m i n o acid interactions o b s e r v e d in DhaA115. C l o s e - u p views of stabilizing residues a n d their interacting n e i g h b o u r s : S20 and R80 (A), P155 (B), L148,1172 a n d F176 (C), F128 and L262 (D), W219 and F266 (E) and W 1 9 8 (F). T h e stabilizing residues are s h o w n as p u r p l e sticks and s e m i - t r a n s p a r e n t spheres, t h e s u r r o u n d i n g p r o t e i n residues are s h o w n as grey sticks, and w a t e r m o l e c u l e s as red spheres. H y d r o g e n b o n d s are s h o w n as y e l l o w dashed lines. T h e p r o t e i n p h y l o g e n y - i n f e r r e d m u t a t i o n s are in panels A and B, w h i l e t h e e n e r g y c a l c u l a t i o n - p r e d i c t e d m u t a t i o n s are in panels C - F . 11166 | Chem. ScL, 2 0 2 0 , 11, 11162-11178 This journal is © The Royal Society of Chemistry 2 0 2 0 Edge Article V i e w Article Online Chemical Science current work, these mutations were a s s u m e d to reinforce h y d r o p h o b i c interactions a n d improve the protein p a c k i n g . 1 3 O u r DhaA115 structure strongly confirms these a s s u m p t i o n s , but we also see n e w structural effects that were n o t previously predicted by the c o m p u t a t i o n a l design. Crucially, we observe that the majority of the energy-based mutations (7 o u t of the 8, D 1 9 8 W b e i n g the exception) cooperate w i t h each other a n d jointly contribute to the stabilization of the protein fold. Firstly, a triplet of mutations (T148L, A172I a n d C176F) localized i n the cap d o m a i n interact w i t h each other, b u t also strongly reinforce the h y d r o p h o b i c a n d aromatic Tt-Tt interactions w i t h the n e i g h b o r i n g residues. These three stabilizing m u t a t i o n s interlock the a.4,