Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Omics
View Article Online
RESEARCH ARTICLE View Journal
Published on 13 May 2019. Downloaded by MTA Research Centre for Natural Science on 5/27/2019 2:40:11 PM.
Many cell signaling pathways are orchestrated by the weak, transient, and reversible protein–protein
interactions that are mediated by the binding of a short peptide segment in one protein (parent protein)
to a globular domain in another (partner protein), known as peptide-mediated interactions (PMIs).
Previous studies normally had an implicit hypothesis that a PMI is functionally equivalent or analogous to
the protein–peptide interaction (PTI) involved in the PMI system, while ignoring parent context contribu-
tion to the peptide binding. Here, we perform a systematic investigation on the reasonability and applic-
ability of the hypothesis at structural, energetic and dynamic levels. It is revealed that the context
impacts PMIs primarily through conformational constraint of the peptide segments, which can (i) reduce
the peptide flexibility and disorder in an unbound state, (ii) help the peptide conformational selection to
fit the active pocket of partner proteins, and (iii) enhance the peptide packing tightness against the part-
ners. Long, unstructured and/or middle-located peptide segments seem to be more vulnerable to their
context than short, structured and/or terminal ones. The context is found to moderately or considerably
improve both the binding affinity and specificity of PMIs as compared to their PTI counterparts; with the
context support a peptide segment can contribute to B30–60% total binding energy of the whole PMI
system, whereas the contribution is reduced to B5–50% when the context constraint is released. In
addition, we also observe that peptide selectivity is largely impaired or even reversed upon stripping of
their parent context (global selectivity decreases from 34.2 to 1.7-fold), by examining the crystal
Received 12th March 2019, structures of full-length Src family kinases in an autoinhibitory state. Instead of the direct interaction and
Accepted 10th May 2019 desolvation that are primarily concerned in traditional studies, peptide flexibility and the entropy penalty
DOI: 10.1039/c9mo00041k should also play a crucial role in the context effect on PMIs. Overall, we suggest that the context factor
should not be ignored in most cases, particularly those with peptide segments that are long, highly
rsc.li/molomics disordered, and/or located at the middle region of their parent proteins.
kinds of contexts contribute significantly or insignificantly to monomer or a multiprotein complex. Most residues in the
PMIs. First, we systematically compared the crystal conforma- peptide segment belong to the interfacial residues of M1–M2
tions of peptide ligands in complex with their protein receptors interaction, although a few non-interfacial residues are allowed
and the same peptide segments in their parent proteins. to discretely distribute over peptide sequence to keep the
Second, twenty-third representative PMI systems were curated peptide continuity. We considered two residues separately
and subjected to atomistic molecular dynamics simulations to coming from M1 and M2 to be ‘‘in contact’’ (and thus as
Published on 13 May 2019. Downloaded by MTA Research Centre for Natural Science on 5/27/2019 2:40:11 PM.
characterize the structural dynamics of the peptide segments interfacial residues) if there was a hydrogen bond, a water-
with or without the support from their parent context. Thirdly, mediated hydrogen bond, a van der Waals interaction, or at
the PMI binding energy was also derived, analyzed and decom- least one pair of contacting nonhydrogen atoms (r4.5 Å)
posed in detail to examine the context contribution to different between them. This strategy was modified from our previous
energetic components involved in the PMIs. This work would work on protein–nucleic acid interactions.25 The hydrogen
establish a complete profile for the structural basis, energetic bonds, water-mediated hydrogen bonds and van der Waals
property and dynamic behavior of the context effect on PMI interactions were detected using HBPLUS26 and PROBE27
intermolecular recognition and interaction, and may help to programs, respectively.
correct the inappropriate applications of PTI succedaneums in Here, we only chose the PMI systems that have high-
future PMI studies. resolution crystal structures (r3 Å) and also the structures
should be intact in and around their peptide-mediated inter-
action site. In addition, the PMI should be a typical single
Materials and methods peptide-mediated interaction whose interaction site only
Crystal structure survey presents a sole successive peptide segment and other inter-
facial residues, if present, out of the segment in the parent
The current deposition (Sept. 2018) of the RCSB Protein Data protein are very few (r1/5 of the peptide residues). According
Bank (PDB) database24 was systematically surveyed using the to the above rule, a total of 23 representative PMI systems
following criteria implemented with an advanced search tool were curated from a number of literature reports.20,28,29 The
provided by the database: collected PMIs are diverse in terms of their biological functions
(i) Macromolecule types: only protein entities, no nucleic as well as the size, location, structural type and amino acid
acids. composition of their core peptide segments (Table 1). Some
(ii) Experimental method: X-ray diffraction. examples of these PMI structural architectures are shown in
(iii) X-ray resolution: r3 Å. ESI,† Fig. S1. The crystal structures of the PMI complexes were
(iv) Number of chains in biological assembly: Z2. used to derive their respective PTI counterparts. As shown in
(v) Length of at least one chain: 5–30. Fig. 2, the PMI system of subtilisin–chymotrypsin inhibitor 2
(vi) Sequence identity: r30%. complex (PDB: 2SNI_I:E) is mediated by the intermolecular
The criteria were applied for filtering the entire PDB database interaction between subtilisin (partner protein) and a loop
to obtain the high-resolution, non-homologous crystal structure peptide segment of chymotrypsin inhibitor 2 (parent protein).
data of the PTI complexes. Consequently, in total 6497 records The peptide bonds separately at the N- and C-termini of the
meeting the criteria were retrieved from the database, which peptide segment are manually broken and then capped by
represents a distinct panel of PTI complex crystal structures acetyl (–Ac) and amide (–NH2) moieties to eliminate the formal
(listed in ESI,† Table S1). We manually excluded some invalid charges at the free N- and C-termini of the generated peptide
samples from the panel, such as those that were largely incom- ligand, respectively, while rest of the parent protein (protein
plete and highly modified. Next, the peptide sequences were context) is discarded.
extracted from these collected complexes, which were then one-
by-one searched back against the database to identify the crystal Molecular dynamics simulation
structures that contain the same peptides as a segment of their The 23 representative PMIs and their PTI counterparts (Table 1)
parent proteins. The crystal structures are solved in high resolu- were subjected to atomistic molecular dynamics (MD) simula-
tion (r3 Å) and their peptide segment regions should be intact tions to examine the context effect on the structural dynamics
(e.g. no backbone breaking and/or side-chain missing). of peptide segments involved in these PMIs. All simulations were
carried out using the AMBER ff03 force field30 implemented with
Definition, curation, and setup of PMI systems and their PTI the sander module of the Amber package.31 According to our
counterparts previous studies,32–34 dynamics may be sensitive to the combi-
A PMI system is defined as a binary or multi-PPI complex that nation of parameter settings, solvent conditions, forcefield
consists of two members M1 and M2. The M1 is termed as the versions and so on for different systems. Here, we have per-
parent protein that contains a functional peptide segment as the formed pre-simulations to optimize the combination, although
core sequence element to mediate M1–M2 interaction; the M2 the optimization procedure is empirical and intuitive, and no
is designated as the partner protein that recognizes and inter- general/objective rule is applicable for guiding it. Briefly, a
acts with the peptide segment of M1. The M1 is commonly a truncated octahedral box of TIP3P water molecules35 was added
monomeric protein, whereas the M2 can be either a protein with a 12 Å buffer around the simulated system. Counterions of
Mol. Omics
Research Article
Fig. 2 Schematic representation of modeling PTI counterparts from PMI systems. The subtilisin–chymotrypsin inhibitor 2 complex (PDB: 2SNI_I:E) is a
PMI system mediated by the intermolecular interaction between subtilisin (partner protein) and a loop peptide segment of chymotrypsin inhibitor 2
(parent protein). The peptide bonds separately at the N- and C-termini of the peptide segment are manually broken and then capped by acetyl (–Ac) and
amide (–NH2) moieties to eliminate the formal charges at the free N- and C-termini of the split peptide ligand, respectively.
Na+ or Cl were placed based on the coulombic potential to keep interaction), the trajectory data of only the segment section
the whole systems electroneutral. Initially, the system was energy were extracted from the MD simulations of the whole PMI
minimized in two steps. First, only the water molecules and ions system to perform the energetics analysis, where the N- and
were minimized in 5000 steps while keeping the protein struc- C-termini of the segment did not need to be capped by –Ac or
ture restricted by weak harmonic constraints of force constant –NH2 groups, since the extraction did not really break the
2 kcal mol1 Å2. Second, a 3000 steps minimization with peptide segment from its parent protein, which just used the
steepest descent (500 steps) and conjugate gradient (2500 steps) dynamics trajectory of the segment section to analyze its sole
methods on the whole system was performed to remove bad binding behavior in the whole PMI system.
atomic contacts and bond distortions involved in the crude The conformational snapshots were also used to derive
structure. After minimizations the system was heated in the binding free energy DGttl for the system:42
NVT ensemble from 0 to 300 K over 50 ps followed by constant D E
ðiÞ ðiÞ
temperature equilibration at 300 K for 300 ps. Subsequently, MD DGttl ¼ DEint þ DGslv TDS ðiÞ
equilibrium simulations were run for 50 ns in the NPT ensemble D E
ðiÞ ðiÞ ðiÞ ðiÞ
with periodic boundary conditions. An integration step of 2 fs ¼ DEelc þ DEvdW þ DGplr þ DGnplr TDS ðiÞ
was set for the simulations and the particle mesh Ewald (PME) (1)
method36 was employed to analyze the long-range electrostatic
energy of a unit cell in a macroscopic lattice of repeating images. where h i represents average over snapshots from a single
A cut-off distance of 10 Å was considered to calculate the short- simulation trajectory and (i) corresponds to the ith snapshot of
range electrostatics and van der Waals interactions. The SHAKE the complex system. The DEint is the intermolecular interaction
strategy was used to restrain all covalent bonds involving energy between the complex members, which can be divided
hydrogen atoms.37 Temperature regulation was carried out using into electrostatic (DEelc) and van der Waals (DEvdW) potentials
a Langevin thermostat with a collision frequency of 2 ps1.38 and was calculated with a molecular mechanics (MM) approach.
DGslv is the solvent effect associated with the complex formation,
Snapshot collection, conformational clustering and energetics which is contributed from polar (DGplr) and nonpolar (DGnplr)
analysis desolvations. The polar aspect was computed by numerical
The conformational space of the investigated PMI/PTI systems solution of the nonlinear Poisson–Boltzmann (PB) equation,
(i.e. PMI protein–protein interaction/PTI protein–peptide inter- while the nonpolar facet was described using a surface model
action) during MD simulations was characterized using a (SA) as DGnplr = b + gSASA.43 The grid size for the PB calculations
clustering data analysis technique. A total of 1000 conforma- was set to 0.5 Å, and the interior and exterior dielectric constants
tional snapshots (frames) of each investigated system were were 2 and 78, respectively. TDS is the conformational free
saved evenly over the equilibrium trajectory of MD production energy due to the entropy penalty upon complex binding,
simulations.39 The average-linkage algorithm40 was employed which was characterized by normal mode analysis (NMA).44
to produce a number of clusters using the pairwise root-mean- Frequencies of the vibrational modes were computed at 300 K
square-deviations between frames as a metric comparing all the for conformational snapshots and using a harmonic approxi-
atoms from residues of the peptide segment (for PMI system) or mation of the energies.34 Considering the high computational
peptide ligand (for PTI system). PDB files were dumped for the demand, only 100 snapshots for each system were utilized to
average and representative structures from the clusters.41 In estimate the entropy penalty.45 Here, the MM/PBSA and NMA
addition, for examining the sole binding behavior of a peptide calculations were implemented using the mmpbsa and nmode
segment in an intact PMI system (i.e. PMI protein–peptide modules of the Amber package,31 respectively.
Table 2 Binding energetic components of the PPII peptide (with or without context) to SH3 domain of Src family kinases Src, Abl and Hck
240
Abl 1OPL NKPTVYGVSP249 Yes 0.19 127.0 89.8 24.2 13.0
No 0.83 115.4 69.5 39.5 6.4
223
Hck 2C0T SKPQKPWEKD232 Yes 0.25 109.4 71.0 27.6 10.8
No 0.91 96.3 49.3 42.8 4.2
a
The class II binding motif PxxPx+ is in bold. The three PPII peptides cannot meet the motif, although they bind to SH3 domains in the class II
orientation according to our previous analysis.11
Fig. 3 Histogram characterization of context effect on 109 peptide crystal conformations when binding to their partner proteins. (A) Frequency
distribution plot of peptide samples in different RMSD intervals (in 1 Å bin). (B) Average RMSD SE values of peptide samples in different length intervals
(5 AAs bin). The distribution can be fitted using a linear function with Pearson’s correlation of Rp = 0.86. (C) Average RMSD SE values of peptide samples
with different structural types.
and positively correlated, with Pearson’s correlation of Rp = 0.86. bonds and hydrophobic contacts across the hairpin arms, that
The average RMSD SE value of the 27 shortest peptides is only is to say, the two peptides can keep in the well-structured
3.1 0.8 Å, which is considerably smaller than that (7.2 3.2 Å) hairpin structures and are affected slightly by the splitting
of the 7 longest peptides, suggesting that the polypeptide seg- and binding.
ments can be impacted more significantly by their parent The long thrombin L peptide is highly flexible and locally
context as compared to oligopeptide segments—this is not helicated within its protein context of prothrombin (Fig. 4B).
unexpected if considering that, relative to short peptides, long However, the peptide has a substantial conformational differ-
peptides generally have a large conformational degree of free- ence relative to its split counterpart in complex with its partner
dom and thus are more vulnerable to context influence. How- protein prothrombin H, with a superposed RMSD value of
ever, the long peptides are structurally diverse and should be 5.32 Å, although the peptide backbone trace seems to be
inconsistent in response to different protein contexts, thus roughly consistent between the two conformations. The short
exhibiting a large RMSD variation (SE). On the other hand, the H4 peptide and epitope peptide 2 are located in the intrinsically
peptide structural type (assigned using the DSSP dictionary47) disordered regions of histone H4 (Fig. 4C) and glycoprotein
has also a substantial effect on conformational difference. As can RSV F (Fig. 4E-b), respectively. As might be expected, their
be seen in Fig. 3C, the 32 structured a-helical peptides are quite conformations are also changed considerably with superposed
rigid and can only be impacted modestly by protein context with RMSD values of 5.19 and 4.17 Å, respectively. This is expected
RMSD SE of 1.8 0.9 Å, while the 17 b-stranded peptides are due to the highly flexible nature of the two peptides and context
partially flexible and can be influenced moderately by the context support should play an important role in their respective PMIs.
with RMSD SE of 4.4 2.4 Å. The folding-upon-binding effect The midsized peptide segment of tumor-suppressor p53 is
seems to be also responsible for the conformational change of partially structured within its protein context, which contains
these b-stranded peptides upon splitting and binding. In con- a a-helix core capped by disordered N- and C-termini (Fig. 4D).
trast, the other 60 peptides, most of them coiled or loops, are Splitting and binding of the p53 peptide would not have a
intrinsically disordered and have large flexibility, and are very significant effect on the core helical conformation, but can
vulnerable to the context with RMSD SE of 6.0 2.9 Å. For largely influence the two termini, thus inducing a moderate
these flexible systems fuzziness was also observed, which, conformational change for the global structure of the peptide,
according to a previous study, may modulate the entropy and with a superposed RMSD value of 3.18 Å.
enthalpy by adapting their bound-state structural ensemble
to achieve optimal binding, thus allowing PMIs to minimize Structural dynamics analysis and comparison of representative
energetically unfavorable folding.9 PMIs and their PTI counterparts
Several peptide conformational difference cases are shown A total of 23 representative PMI systems are compiled in
in Fig. 4. The details can be found in the figure legends. As can Table 1, and their PTI counterparts are set as shown in Fig. 2.
be seen, a short b-hairpin peptide segment of complement These PMI/PTI systems were one-by-one subjected to 50 ns MD
protein C8a (Fig. 4A) and a long helical hairpin peptide simulations and their root mean-square deviation (RMSD)
segment of RSV F glycoprotein (Fig. 4E-a) are rigid and highly fluctuations (only regarding the regions of PMI peptide
structured. The two peptide conformations are changed quite segments or PTI peptide ligands) during the simulations were
modestly when they are split from their parent context and then analyzed and compared. Here, the RMSD fluctuations of
bind to their partner proteins C8g and motavizumab, with peptide segments/peptide ligands in six PMI/PTI systems are
superposed RMSD values of 0.87 and 1.45 Å, respectively. shown in Fig. 5. At a first glance, an evident difference can be
Structural analysis revealed that the rigid structure of the two observed between the RMSD fluctuations of these systems. The
hairpin peptides is tightly maintained by a number of hydrogen dynamic conformation of highly flexible loop peptides such as
Fig. 4 Conformational difference of the same peptides in complex with their partner proteins and within the context of their parent proteins. (A) The
short peptide segment 158LRYDSTAERLY168 of complement protein C8a is highly structured and folded into a two-stranded b-hairpin conformation
within its protein context (PDB: 3OJY). When the peptide is split from its parent protein C8a, it can also maintain in the hairpin conformation to interact
with its partner protein C8g (PDB: 2QOS), with superposed RMSD value of 0.87 Å. (B) The long peptide segment 276EADCGLRPLFEKKSLEDKTERELLE-
SYID304 of prothrombin is highly flexible and locally helicated within its protein context (PDB: 5EDM). When the peptide is split from its parent protein
prothrombin, it exhibits a substantial conformational change upon binding to its partner protein prothrombin H (PDB: 5A2M), with a superposed RMSD
value of 5.32 Å. (C) The short peptide segment 7GKGLGKGGA15 of histone H4 is intrinsically disordered within its protein context (PDB: 4PSX). When the
peptide is split from its parent protein H4, it exhibits a large conformational change upon binding to its partner protein Brd2 (PDB: 2E3K), with a
superposed RMSD value of 5.10 Å. (D) The midsize peptide segment 15SQETFSDLWKLLPEN20 of tumor-suppressor p53 is partially structured in a helical
conformation within its protein context (PDB: 2LY4). When the peptide is split from its parent protein p53, it exhibits a moderate conformational change
upon binding to its partner protein MDMX (PDB: 2MWY), with a superposed RMSD value of 4.18 Å. (E-a) The long peptide segment 254NSELLSLINDM-
PITNDQKKLMSNN277 of respiratory syncytial virus fusion RSV F glycoprotein is highly structured and folded into a two-stranded helical hairpin
conformation within its protein context (PDB: 3RRR). When the peptide is split from its parent protein RSV F, it can also maintain in the hairpin
conformation to interact with its partner protein motavizumab (PDB: 3IXT), with a superposed RMSD value of 1.45 Å. (E-b) The short peptide segment
427
KNRGIIKTFSN437 of RSV F glycoprotein is intrinsically disordered within its protein context (PDB: 3RRR). When the peptide is split from its parent
protein RSV F, it exhibits a moderate conformational change upon binding to its partner protein 101F (PDB: 3O41), with superposed RMSD value of 4.17 Å.
1AK4_D:A (A) and 2MTA_C:A (D) can be roughly constrained parent context can only moderately or modestly shift their
around their native state and is changed moderately during MD RMSD profiles. In particular, the peptide segment in 4JLR_S:HL
simulations. In contrast, the conformation exhibits a large (E) is rather rigid, which shows a small structural fluctuation
variation and alters dramatically when splitting these peptide and a very similar RMSD profile with and without the context
segments from their parent context in PMIs and re-performing support. In fact, this peptide is an epitope region of respiratory
simulations of the resulting PTI counterparts. In addition, syncytial virus immunogen that can well fold into a tightly
the partially structured strand/loop peptide 1D6R_I:A (F) has bound helical hairpin structure. Crystallographic study also
also a considerable difference in its conformational dynamics confirmed that the peptide can spontaneously form the hairpin
between PMI and PTI. During MD simulations the peptide and then binds to antibody motavizumab in an independent
segment can well maintain in the native strand/loop conforma- manner.48
tion in intact PMI, whereas the conformation becomes intrin- The above structural dynamics analysis suggested that
sically disordered and highly flexible in the PTI counterpart. flexible peptide segments such as loops are vulnerable to their
The other three PMI systems, namely, 1AY7_B:A (B), 7CEI_B:A parent context, whereas rigid peptide segments such a helix can
(C) and 4JLR_S:HL (E), are fully or partially structured as an work independently to interact with their partners. This is
a-helix in their functional peptide regions and thus possess a expected because a rigid peptide configuration is primarily
larger rigidity that is relatively insensitive to the context effect. maintained by intramolecular nonbonded interactions within
As can be seen, splitting of these peptide segments from their the peptide molecule that can be kept when splitting from its
Fig. 5 Comparison of the RMSD fluctuations between PMI peptide segment and PTI peptide ligand during 50 ns MD simulations of the PMI and PTI
systems: (A) 1AK4_D:A, (B) 1AY7_B:A, (C) 7CEI_B:A, (D) 2MTA_C:A, (E) 4JLR_S:HL, and (F) 1D6R_I:A. All atoms of only peptide segment/peptide ligand are
used to derive the profile, while the partner protein and parent protein context are not considered.
parent context. In this respect, peptide flexibility can be moderately touch on the surface of their partner proteins.
regarded as an indirect indicator to reflect the context effect The two peptide segments can well keep around the native
on PMIs. As can be seen in Fig. 6A, numerous conformational crystal conformation in intact PMI systems during the whole
snapshots of the peptide segment in the PMI system and simulation course, but the conformation and even the binding
peptide ligand in the PTI counterpart were extracted from mode of the peptide ligands in PTI counterparts have a distinct
their respective 50 ns MD trajectories and then their RMSD story from that of corresponding PMI systems. The loop peptide
variations relative to the native crystal structures were calcu- ligand of 1AK4_D:A PTI displays a strong thermal motion and
lated. Here, the obtained RMSD variations for the 23 PMI/PTI its conformation varies largely during the simulations, while
systems are illustrated in Fig. 6B. Evidently, the variation the b-strand peptide ligand of 1KTZ_B:A PTI is totally unfolded
profiles are considerably different between PMI peptide into disorder and gradually unbound from its partner along the
segments and PTI peptide ligands; the former (red bar) is simulations, suggesting that the context factor should play an
generally smaller than the latter (grey bar), indicating that the essential role in the conformation and binding of functional
same peptides have a low flexibility and small disorder in peptide segments in the two PMI systems.
protein context as compared to that out of the context. The Next, the conformational snapshots of the PMI peptide
difference is very significant for a few PMI/PTI systems such as segments and PTI peptide ligands extracted from MD equili-
1AK4_D:A, 2HLE_B:A, 1D6R_I:A and 1KTZ_B:Z; structural brium trajectories were clustered into single representatives. In
examination found that these peptides are highly flexible this way, the peptide region of a PMI/PTI system can be
segments and/or bound weakly to their partners, and context represented by three conformations: (i) the native crystal struc-
is therefore essential for conformational constraint of the ture, (ii) the MD equilibrium conformation cluster of the
intermolecular recognition and interaction involved in these peptide segment in PMI, and (iii) the MD equilibrium con-
systems. formation cluster of the peptide ligand in PTI. Here, the
Here, two typical PMI systems 1AK4_D:A and 1KTZ_B:A that differences (RMSD values) between the three conformations
are influenced significantly by context are discussed visually. for the 23 PMI/PTI systems are shown in Fig. 7A. As might be
Conformational snapshots of the two PMIs and their PTI expected, after dynamics equilibrium all the 23 peptide con-
counterparts were separately extracted at 0, 10, 20, 30, 40 and formations were changed largely in PTIs as compared to those
50 ns of MD trajectories and compared in ESI,† Fig. S2. The in PMIs relative to their respective crystal structures, imparting
functional peptide segments of 1AK4_D:A and 1KTZ_B:A are, that protein context can, more or less, address a conforma-
respectively, a loop and a short b-strand that can only tional constraint on peptide binding to their partner proteins,
Fig. 6 (A) Schematic representation of extracting conformational snapshots of PMI peptide segments and PTI peptide ligands from MD trajectories as
well as computing the RMSD variation over these snapshots. (B) Comparison of the RMSD variations of PMI peptide segments and PTI peptide ligands for
the 23 PMI/PTI systems.
although the constraint effect varies considerably over different The functional peptide segment of 7CEI_B:A is a long helix
systems. More importantly, by comparing MD equilibrium con- extending out of the active site; splitting and truncation of the
formations of the same peptides in PMIs and PTIs a significant helical peptide appear not to influence its core binding region,
difference is revealed for most systems (RMSD 4 2 Å), while only but would cause a significant effect on its two termini, that is,
a few samples have a moderate or modest conformational without context support the two termini become totally
difference at their dynamics equilibrium state (RMSD o 2 Å). disordered in PTI as compared to the structured helix in PMI.
Here, three typical examples that separately represent small, Consequently, the PMI peptide segments are very similar in
moderate and large context contribution to peptide conforma- both the native crystal structure and MD equilibrium confor-
tion in PMI/PTI systems 4JLR_S:HL, 7CEI_B:A, and 2MTA_C:A mation (RMSD = 0.87 Å), but the two conformations (in PMI)
are shown in Fig. 7B, in which the three distinct conformations differ moderately or considerably to the MD equilibrium con-
for a peptide are superposed in the active site of its partner formation of the peptide ligand in the PTI counterpart (RMSD =
protein. The helical hairpin peptide of 4JLR_S:HL is tightly 1.70 and 2.18 Å). In addition, the highly flexible loop peptide of
fixed by the intramolecular hydrogen bonds and hydrophobic 2MTA_C:A is also very similar in both the crystal structure and
contacts across the two helical arms of the hairpin, which MD conformation (red and blue) (RMSD = 1.21 Å), but different
define an independent, rigid structure for the peptide and thus largely relative to its PTI peptide counterpart (green) (RMSD =
is invulnerable to context impact, with small differences 3.07 and 3.27 Å), indicating a solid context contribution to the
between its three conformations (RMSD = 0.72, 0.84 and 0.65 Å). peptide binding in PMI.
Fig. 7 (A) Comparison of the differences (RMSD values) between the crystal structure and MD equilibrium conformation of PMI peptide segments,
between the crystal structure and MD equilibrium conformation of PTI peptide ligands, and between the MD equilibrium conformations of PMI peptide
segments and PTI peptide ligands for the 23 PMI/PTI systems. (B) Superposition of PMI peptide segments (crystal structure, red), PMI peptide segments
(MD equilibrium conformation, blue) and PTI peptide ligands (MD equilibrium conformation, green) in the active site of their partner proteins:
(B-a) 4JLR_S:HL, (B-b) 7CEI_B:A, and (B-c) 2MTA_C:A.
values. This is a common phenomenon that the absolute about 80% of the total interaction energy for PMIs using an
binding energy of protein–peptide interactions is generally empirical alanine scanning approach,20 which is considerably
overestimated by MM/PBSA calculations, and we also discussed higher than the F-values (30–60%) determined in this study.
this issue previously.33,39 In fact, the MM term overestimates This could be attributed to the facts that: (i) we only selected a
the favorable interaction energy between protein and peptide small set of typical PMI samples in which the peptide segments
due to its lack of electrostatic screening at the loosely packed were defined as those few residues of parent proteins that can
Published on 13 May 2019. Downloaded by MTA Research Centre for Natural Science on 5/27/2019 2:40:11 PM.
protein–peptide interface, while the PBSA term underestimates directly contact partner proteins, (ii) rigorous dynamics simula-
the unfavorable solvent effect since the hydrophobic contribu- tions and post energetics analysis were employed to accurately
tion cannot be described accurately using the very empirical calculate the binding energy of these PMI samples, and (iii)
surface area (SA) model. However, the calculated energy values peptide flexibility and the entropy effect were added to the
are also useful for comparing the relative binding potencies of energetics analysis.
PMI and its PTI counterpart, since the bias can be largely offset Second, the binding energetic components of peptide seg-
between different versions of the same system. ments in PMI relative to the same peptide ligands in PTI were
First, we discussed the energetic contribution of peptide examined. In order to straightforwardly observe the context
segment section to the whole PMI protein–protein interaction. effect on energetic components, the average value and standard
The total binding energy of the peptide segment (and its parent error (a.v. s.e.) of differences of interaction energy (DDEint),
protein) to its partner protein in a PMI system was calculated solvent effect (DDGslv), entropy penalty (TDDS), and total
with or without consideration of entropy penalty (DGttl = DEint + binding energy (DDGttl) between the PMI peptide segment
DGslv or DGttl = DEint + DGslv TDS), and the fraction of PMI and PTI peptide ligand were computed over the 23 investigated
total binding energy contributed by the peptide segment can be PMI/PTI systems (see ESI,† Fig. S3). It is seen that the parent
derived as F = DGttl(segment–partner)/DGttl(parent–partner) context can influence the interaction energy and solvent effect
100%. The resulting F-values are plotted in Fig. 8, which range of peptide–partner binding considerably, with the a.v. s.e. of
between 30 and 60% over the 23 investigated PMI systems, 33.6 16.0 (DDEint) and 39.2 19.8 (DDGslv) kcal mol1,
suggesting that these functional peptide segments (within their while the context can also contribute moderately to the
parent context) can only contribute to about a half or less of the entropy penalty and total binding energy, with a.v. s.e. of
total binding affinity between the parent and partner proteins 14.5 7.3 (TDDS) and 8.9 3.7 (DDGttl) kcal mol1. As might
of PMI systems, and other regions out of the peptide segments be expected, without context support the peptides generally
(parent context) should also exert potential interactions with exhibit larger conformational disorder and weaker packing
the partners through, for example, long-range electrostatic tightness against their partner proteins, thus impairing the
forces and marginal van der Waals contacts. In addition, interaction energy of peptide–partner binding (viz. DDEint 4 0,
consideration of conformational flexibility and the entropy unfavorable) but also reducing the desolvation effect associated
penalty (TDS) can further decrease F-values for most samples, with the binding (viz. DDGslv o 0, favorable). Moreover, stripping
albeit the decrease is not solid (DF o 10%). Previously, Stein of the context constraint can also enhance peptide flexibility in
and Aloy estimated that interfacial peptides can contribute an unbound state, thus incurring increased entropy penalty
Fig. 8 Fraction (F-value) of the total binding energy of 23 PMI systems contributed by peptide segments. The total binding energy was calculated with or
without consideration of entropy penalty (DGttl = DEint + DGslv or DGttl = DEint + DGslv TDS).
Fig. 9 Ratio (R-value) of the total binding energies of 23 PMI systems to their PTI counterparts, i.e. DGttl(PMI)/DGttl(PTI). The complex structures of three
systems 1GHQ_B:A, 1AVX_B:A and 1KTZ_B:A with the highest R-values are shown on top.
Fig. 10 (A) Crystal structures of full-length Src (A-a), Abl (A-b) and Hck (A-c) kinases in an autoinhibitory state, in which the SH3 domain and PPII peptide
are highlighted. (B) Heatmap of the total binding free energies of three PPII peptides binding to three SH3 domains. The diagonal elements represent
cognate interactions, otherwise, noncognate: (B-a) the binding is within the intact kinase protein context (SH3–peptide segment), and (B-b) the binding is
out of the context (SH3–peptide ligand). (C) Mean selectivity diagram of three PPII peptides with (C-a) or without (C-b) context.
sampling during MD simulations. As can be seen in ESI,† Fig. S4, difference can be observed between the two heatmap profiles,
the peptide segment in a kinase context has a low disorder that is, the PPII peptides can interact with SH3 domains with a
(RMSD variation = 0.21 Å over the simulations), which can be generally high affinity in context as compared to that out of the
well structured in the PPII helical configuration that is suited context, no matter whether the interactions are cognate or
for interacting with the SH3 domain. In contrast, the split noncognate. This is consistent with the above analysis that
peptide ligand is highly flexible (RMSD variation = 0.89 Å context can effectively enhance the binding potency of peptide
over the simulations) and cannot hold in the PPII configuration segments to their partner proteins. However, the two profiles
and would incur a large entropy penalty upon binding to the impart a different story about peptide selectivity: in the former
SH3 domain. This is in line with our previous investigation of the three cognate SH3–PPII interactions (diagonal elements)
conventional protein–peptide interactions, which were found to generally have a high affinity relative to those noncognate
be co-dominated by direct readout of the intermolecular inter- interactions (non-diagonal elements), whereas in the latter no
action between the protein receptor and peptide ligand, and by obvious difference between the cognate and noncognate inter-
indirect readout of the entropy penalty upon the interaction.55 actions can be appreciated, implying that the context not only
The SH3 domains and PPII peptide segments are high- enhances peptide affinity to their cognate partners, but also
lighted in the crystal structures of full-length Src, Abl and improves peptide selectivity for cognate over noncognate.
Hck kinases in an autoinhibitory state (Fig. 10A). Based on In addition, the mean selectivity Ŝ of three PPII peptides for
the kinase crystal structures six noncognate domain–peptide their one cognate and two noncognate SH3 domains was
interactions between the three SH3 domains and three PPII derived from the systematic binding energy profiles, which
peptides were modeled and their binding energies were calcu- are separately plotted as two diagrams in Fig. 10(C-a) and
lated and visualized as two heatmaps in Fig. 10B, in which the (C-b). The mean selectivity of Src, Abl and Hck PPII peptides is
(B-a) and (B-b) profiles represent the interactions in the 18.2, 77.5 and 28.5-fold (global selectivity = 34.2-fold c 1-fold)
presence and absence of kinase context, respectively. A clear in the full-length kinase, respectively, suggesting that these
peptides have high specificity with their context support. In 19 S. Panni, L. Montecchi-Palazzi, L. Kiemer, A. Cabibbo,
contrast, the mean selectivity is largely impaired or even reversed S. Paoluzi, E. Santonico, C. Landgraf, R. Volkmer-Engert,
to 6.0, 0.74 and 1.2-fold (global selectivity = 1.7-fold close to A. Bachi, L. Castagnoli and G. Cesareni, Proteomics, 2011,
1-fold), respectively, when the peptides were split from the 11, 128–143.
context and bind independently to SH3 domains. 20 A. Stein and P. Aloy, PLoS One, 2008, 3, e2524.
21 Z. Bai, S. Hou, S. Zhang, Z. Li and P. Zhou, J. Chem. Inf.
Published on 13 May 2019. Downloaded by MTA Research Centre for Natural Science on 5/27/2019 2:40:11 PM.
P. Cieplak, J. Srinivasan, D. A. Case and T. E. Cheatham, 50 T. Löhr, A. Jussupow and C. Camilloni, J. Chem. Phys., 2017,
Acc. Chem. Res., 2000, 33, 889–897. 146, 165102.
44 D. Case, Curr. Opin. Struct. Biol., 1994, 4, 285–290. 51 G. T. Heller, F. A. Aprile, M. Bonomi, C. Camilloni, A. De
45 T. Hou, K. Chen, W. A. McLaughlin, B. Lu and W. Wang, Simone and M. Vendruscolo, J. Mol. Biol., 2017, 429,
PLoS Comput. Biol., 2006, 2, e1. 2772–2779.
46 G. G. Krivov, M. V. Shapovalov and R. L. Dunbrack, Proteins, 52 E. Karlsson, E. Andersson, J. Dogan, S. Gianni, P. Jemth and
Published on 13 May 2019. Downloaded by MTA Research Centre for Natural Science on 5/27/2019 2:40:11 PM.