James Franck Institute and Department of Chemistry, The University
of Chicago, Chicago, Illinois 60637 USA
Met-enkephalin is one of the smallest opiate peptides.
Yet, its dynamical structure and receptor docking mechanism are still not well understood. The conformational dynamics of this neuron peptide
in liquid water are studied here by using all-atom molecular dynamics
(MD) and implicit water Langevin dynamics (LD) simulations with AMBER
potential functions and the three-site transferable intermolecular
potential (TIP3P) model for water. To achieve the same simulation
length in physical time, the full MD simulations require 200 times as
much CPU time as the implicit water LD simulations. The solvent
hydrophobicity and dielectric behavior are treated in the implicit
solvent LD simulations by using a macroscopic solvation potential, a
single dielectric constant, and atomic friction coefficients computed
using the accessible surface area method with the TIP3P model water
viscosity as determined here from MD simulations for pure TIP3P water.
Both the local and the global dynamics obtained from the implicit
solvent LD simulations agree very well with those from the explicit
solvent MD simulations. The simulations provide insights into the
conformational restrictions that are associated with the bioactivity of
the opiate peptide dermorphin for the
-receptor.
 |
INTRODUCTION |
Biological "processes" are per force
dynamical in nature. For example, the dynamical motions of certain
sections of RNA molecules and enzymes determine their key biological
functions. However, rather little theory is available at the atomistic
level concerning the long time dynamical behavior of flexible natural
and synthetic macromolecules. In principle, the long time dynamics of
biomolecules are governed by the complete equations of classical
mechanics that involve realistic atomistic potentials and that include
the explicit presence of solvent molecules. Although it is impossible to solve these equations with realistic atomistic potential functions for a macroscopic system with 1023 degrees of freedom, the
molecular dynamics (MD) method provides a computationally feasible way
to investigate the N-body system on a miniature scale. However, even
with increasing computing power, the evaluation of the long time (>10
ns) behavior of large scale biological systems becomes prohibitive. As
a measure of the current state of the art, the use of the most powerful
Cray T3E supercomputer has pushed the simulation time for the
36-residue villin headpiece (HP-36) up to a remarkable one microsecond
trajectory (Duan and Kollman, 1998
). In contrast, most current MD
simulations for larger realistic systems are usually restricted to a
few nanoseconds or picoseconds. Moreover, these atomistic MD
simulations generate coordinates and velocities for all solvent and
solute atoms and, therefore, produce overwhelmingly more information
than is hopefully needed (Kostov and Freed, 1999
). Thus, it is
important to develop physically significantly more efficient methods
for determining the long time dynamical properties of biosystems. In
the present paper, we develop an implicit solvent method that greatly
reduces the required computational cost, and we test this method
against full explicit solvent MD dynamics for the small flexible
peptide Met-enkephalin. This development facilitates important
investigations of the link between the peptide structure, dynamics, and
biological function.
Greater efficiency in peptide dynamics computations is necessary to
resolve many fundamental questions associated with understanding how a
ligand that lacks a fixed shape recognizes several different molecular
receptors and exerts multiple significant biological functions. Several
structurally flexible proteins exhibit physiological significant roles.
One example is provided by the
-amyloid protein, which causes the
neurobiological disorder, Alzheimer's disease. The 42-residue peptide
displays no dominant single structure, but its structure varies with
temperature, solvent, and solution acidity (Barrow and Zagorski, 1991
;
Kirshenbaum and Daggett, 1995
). In the appropriate physical
environment, the
-amyloid aggregates into a
-sheet conformation
and forms the plaque that is one consequence of the disease process. A
recent investigation indicates that even the well-studied globular
protein myoglobin can aggregate and form plaque under the appropriate
conditions of temperature and pH (Fändrich et al., 2001
). A first
step of understanding the physical process behind these phenomena lies
in developing theories to describe the dynamics of a single flexible
peptide in aqueous solution.
Our implicit water dynamics method is tested for the dynamics of
Met-enkephalin (Tyr-Gly-Gly-Phe-Met), which is one of the smallest
neurotransmitter peptides. Since the first isolation and identification
of this molecule in 1975 from pig brains (Hughes et al., 1975
), it has
been found to be plentiful in nerve terminals. Met-enkephalin has been
investigated extensively with many different techniques, including
x-ray crystallography (Smith and Griffin, 1978
), nuclear magnetic
resonance (Roques et al., 1976
; Deber and Behnam, 1984
), and molecular
simulations (Hruby et al., 1988
; Wang and Kuczera, 1996
). Enkephalins
and endorphins all originate from the larger protein,
-lipotropin
(Stryer, 1988
), which produces Met-enkephalin,
-endorphin, and
-endorphin. The five-amino acid sequence of Met-enkephalin is shared
by all larger endorphins as their headpiece. The enkephalins and
endorphins play diverse roles in biological systems, such as pain
inhibition, aiding immune response, and producing tolerance and
dependence (Plotnikoff et al., 1986
). Inside the human body, there are
µ,
, and
opiate receptors, to which the enkephalins and
endorphins bind as ligands. Among these three types of receptors, the
-receptor is more sensitive to the enkephalins. The enkephalins and
endorphins are usually produced while the organism is under mental or
physical stress. The ligand-receptor binding modifies the neural
signal by changing the potassium ion conduction (Pasternak, 1988
).
In accord with the general trend that short peptide chains do not adopt
a single native structure, experimental studies show that
Met-enkephalin does not adopt a single conformation in aqueous solution
(Graham et al., 1992
). The first computation of an energetically minimized structure for Met-enkephalin in vacuo using a Monte Carlo
searching algorithm is based on the empirical conformational energy
program for peptides model potentials (Isogai et al., 1977
; Paine and
Scheraga, 1985
). Later computations using ECEPP/2 potential function
and a simple implicit water model do not predict a single dominant
structure either (Li and Scheraga, 1987
). A combined multidimensional
NMR and molecular modeling study (Graham et al., 1992
) likewise
indicates the absence of a distinguishable secondary structure for
Met-enkephalin in aqueous solution. When dissolved in 50 mM of sodium
dodecyl sulfate micelles, Met-enkephalin adopts a
-turn structure
that is stabilized by a phenyl-phenyl hydrophobic interaction. Despite
these findings, many aspects of both the structure and dynamics of
Met-enkephalin in aqueous solution remain unresolved.
The next section describes the MD and Langevin dynamics (LD) methods
that are applied to treat Met-enkaphalin dynamics with explicit and
implicit solvent models, respectively, using the AMBER95 (Cornell et
al., 1995
) force field. A variety of implicit solvent models is tested
by comparing the explicit and implicit solvent dynamics without the use
of adjustable parameters. MD simulations are performed to estimate the
viscosity of the three-site transferable intermolecular potential
(TIP3P) (Jorgensen et al., 1983
) water model that is used in the
explicit solvent MD simulations because this viscosity is a necessary
input parameter for the implicit solvent treatment of friction
coefficients. In Results and Discussions, the various equilibrium
structural properties generated by the explicit water MD simulations
are compared with those obtained from a set of different implicit water
LD simulations. The local and global time-correlation functions
obtained from these two types of simulations are compared. The only
variables involved in the comparison are the choices of the solvation
potential, the solvent viscosity, and dielectric constant, but these
are prescribed by various methods from experiment or simulations. A
subsequent paper will apply our mode-coupling theory of long time
peptide dynamics (Kostov and Freed, 1999
) to Met-enkephalin for
comparison with both the MD and LD simulations. (M.-y. Shen and K. F. Freed, in preparation).
 |
COMPUTATIONAL DETAILS |
Explicit water simulations
The explicit water MD simulation is based on a total system energy
that is the sum of different types of individual contributions,
|
(1)
|
and that includes both solvent and solute molecules. The explicit
water simulations are performed by using the standard AMBER95 force
field and the TIP3P model for water. The simulation is carried out
using a modified version of the TINKER 3.7 molecular design package
(Ponder et al., 1999
). The bonding (Ub) and
bond-bond bending (Ubend) interactions are
modeled by harmonic potentials, and the regular and improper torsional
energies (Utors,
Uimp-tors) are described with the standard periodic
functions. The charge-charge interaction is expressed in terms of
atomic partial charges qi,
|
(2)
|
where
is a fixed dielectric constant that represents the
screening of Coulomb interactions in liquid water, and
rij is the interparticle distance between atomic
charges i and j. In the explicit water MD
calculations,
is set to unity for all charge-charge interactions.
The van der Waals energy (UvdW) is evaluated by
using Lennard-Jones 6-12 potentials.
The MD simulations are generated by integrating the atom positions and
velocities by using the standard velocity Verlet algorithm (Allen and
Tildesley, 1987
),
|
(3)
|
|
(4)
|
where ri(t),
vi(t), and
ai(t) denote the particle positions,
velocities, and accelerations, respectively, at time t. The
integration time step is chosen as
t = 2 fs. The simulated system includes 186 solvating TIP3P water molecules in a
cubic, periodic boundary box with 18.6-Å sides. Four 5-ns and two
10-ns MD trajectories (total duration 40 ns) are used to compute the
time correlation functions and equilibrium averages. An 8-Å nonbonding
force cutoff is applied, and the temperature is controlled at 300 K
using a Berendsen-type thermal bath coupling (Berdensen et al., 1984).
Implicit water simulations
The total energy expression for the implicit solvent model differs
slightly from that of the explicit water model because the explicit
solvent-solute and solvent-solvent interactions in Eq. 1 are replaced
by dielectric screening (Uch(
)) and solvation terms (Usolv(
)),
|
(5)
|
Unlike the explicit water simulation,
Utot-imp for the implicit water treatment only
depends on the solute coordinates. The Ub,
Ubend, Uimp-tors,
Utors and the solute-solute portion of
UvdW are identical for both the explicit water
and the implicit water models. Two separate dielectric screening
constants are tested for the solute-solute interactions described by
Uch, namely,
= 78.5, the experimental
dielectric constant of water at 300 K, and
= 53.1, the lowest
theoretical value for simple point charge (SPC) water at 300 K (Smith
and van Gunsteren, 1994
). The choice of
= 53.1 provides an
estimated lower bound to the TIP3P dielectric constant because of the
similarity between the TIP3P and SPC models for water. The AMBER95
parameter set is also used for all solute-solute interactions.
A significant distinction between the explicit and implicit water
models is the presence of the macroscopic solvation potential Usolv(
). Two solvation potential functions
are utilized, the atomic solvation parameter (ASP) (Eisenberg and
McLachlan, 1986
; Wesson and Eisenberg, 1992
) and the Ooi-Scheraga
solvent-accessible surface area (SASA) (Ooi et al., 1987
) potentials,
although a third solvation potential, the generalized Born-surface area
(GB/SA) potential (Qiu et al., 1997
), is found to be wholly
unsatisfactory in performance and calculated results. In both solvation
potential treatments, a contact free energy is evaluated in terms of
the accessible surface areas (ASA)
i of all atoms
i in the peptide. Thus, the solvation free energy is a sum
of individual atom contributions,
|
(6)
|
where the empirical atom solvation energy parameters
gi have been determined by fitting experimental
aqueous solvation free energies of amino acids and selected organic
compounds to Eq. 6. Because our simulation uses the TIP3P model for
water, the empirical constants gi determined for
real water may not be completely adequate for the TIP3P model water,
producing one unavoidable inherent limitation to our tests of implicit
water simulation models. A more thorough test would require the fitting
of the gi parameters to simulations performed
for many amino acid-TIP3P water systems, but the amount of work
involved would be enormous and the value of this effort would only be
in providing a more stringent test of our theory. Obviously, of much
greater importance are improved solvation potentials that provide
superior fits to experiment. In this regard, another proposed atomic
solvation parameter optimization procedure (Baysal and Meirovitch,
1997
, 1998
) uses experimental native structures in solution as a motif and adjusts the solvation potential parameters to the global energy minimum. This method is a promising alternative for future
reparameterizations of experimental data. Except for the hydrogen
atoms, whose presence is ignored in computing the
i
(Eisenberg and McLachlan, 1986
), the solvent accessible surface area
i is calculated from the exposed surface area of
intersecting spheres centered on each atom, using a probe sphere of
radius 1.4 Å, which corresponds to the effective radius of a water
molecule. Thus, when a pair of peptide atoms are too close for a
solvent molecule to fit between them, no solvation free energy is
applied to their contact energies; only the van der Waals and
electrostatic interactions remain. The combination of these two
features contributes to the description of the hydrophobic effect. Of
the two sets of ASP parameters available in literature, we use the set
determined by Wesson and Eisenberg (1992)
. The SASA method of Ooi et
al. (1987)
uses solvation free energy determined by a similar strategy
but applied to a slightly larger classification of atom types, and,
hence, the SASA model produces a larger number of parameters in the set
{gi}.
In contrast to the explicit water model where the dynamics of both
solvent and solute atoms must be considered and where the solute
constitutes roughly 10% of the overall atoms, the implicit water model
only describes the motion of the solute atoms. Thus, the evaluation of
Utot-imp is much faster than that of
Utot-exp. Our tests are designed to determine
whether the much faster implicit solvent simulations accurately
reproduce the far more expensive explicit water MD simulation.
Oliva et al. (2000)
have also used the SASA solvation potential model
to compute forces within an implicit solvent generalized Langevin
equation (GLE) model. Although this feature coincides with one aspect
tested by us, their approach describes the frictional effects of the
solvent using memory functions whose computation slows down the GLE
computations with respect to those with an explicit SPC water
simulation by a factor of seven. When the memory function is taken as
fixed (from a prior computation), their implicit GLE treatment is only
a factor of three faster than the explicit water simulation. In
contrast, we use simple friction coefficients in our implicit solvent
approach, enabling us to achieve orders of magnitude speed ups of the
implicit solvent LD simulations compared to explicit solvent (TIP3P
water) MD simulations. Another difference between our work and that of
Oliva et al. (2000)
arises because they focus on simulations for an
equilibrium property, the native structure of a folded protein, whereas
we compare the explicit and implicit solvent models for both
equilibrium and dynamical properties of a highly flexible peptide. A
future work will describe the application of our methods to study the
stability of the native structure of the villin headpiece, where,
again, both equilibrium and dynamical properties can be computed and the speed enhancement over explicit water simulations exceeds a couple
of orders of magnitude (M.-y. Shen and K. F. Freed, submitted for publication).
Our main modifications to the TINKER package are associated with the
computation of friction coefficients and the solvation potential. The
friction coefficients are either set as constants throughout the LD
trajectory or can be updated periodically with the procedure suggested
by Pastor and Karplus (1988)
. The velocity Verlet algorithm (Allen and
Tildesley, 1987
) is also used in the LD calculations. The LD algorithm
is similar to the algorithm given by Eqs. 3 and 4 for MD simulations
but includes factors c0, c1, and
c2, arising from the frictional forces due to
the implicit water molecules, and the random term necessary by virtue of the fluctuation-dissipation theorem to assure the proper equilibrium limit at long times,
|
(7)
|
|
(8)
|
with
|
(9)
|
|
(10)
|
|
(11)
|
where mi is the mass of atom i.
The rgi and vgi are
Gaussian random variables with variances depending on the friction
coefficients in the usual fashion (Allen and Tildesley, 1987
). The
friction coefficients
i used in the Langevin dynamics simulation are determined from the solvent-accessible surface area
'i with zero probe radius. Thus, we calculate the
friction coefficient
i using stick boundary conditions,
|
(12)
|
where
is the solvent viscosity and
reff;i is the effective hydrodynamic radius of
atom i that is computed from
'i, using reff;i = (
'i/4
)1/2.
We contrast two types of LD simulations, called the dynamical and
static LD simulations. In the dynamical LD simulations, the accessible
surface area, and, hence, the atomic friction coefficients, are updated
every 100 dynamical steps, whereas, in the static LD simulations, the
friction coefficients remain constant throughout the simulation. The
solvation potential is updated every 100 dynamical steps (0.25 ps in
physical time), which speeds up the LD simulations by a factor of 4 over LD simulations with the solvation potentials recomputed at each
step. To test the adequacy of updating the solvation potential every
100 steps, we have generated a 30-ns test trajectory for a case in
which the solvation potential has been calculated at every step. These
two procedures lead to nearly identical distributions for the radius of
gyration and sufficiently similar end-to-end vector dynamics.
Appreciable enough changes appear in the atomic solvent-accessible
surface areas and, hence, in the solvation-free energy only when major
structural deformations occur in solute. The 0.25-ps interval between
updates of the solvation potential is quite short compared to the time
scales for the occurrence of large structural changes during the
Met-enkephalin dynamics. No tests have been performed to determine the
maximum acceptable update interval, but larger intervals (and faster
simulations) should be possible.
The constant friction coefficients for the static LD simulations are
computed by averaging ASA friction coefficients over a 6-ns sample
trajectory obtained after equilibration. The integration time step
t for the LD simulation is chosen as
t = 2.5 fs. Eight LD trajectories of 60 ns each (total duration of
trajectories is 480 ns) are used for the evaluation of time-correlation
functions (TCF) and equilibrium averages. No nonbonding force cutoff is applied in the LD simulations. The LD algorithm is used instead of a
pure van Gunsteren-Berendsen Brownian dynamics (BD) algorithm (van
Gunsteren and Berendsen, 1982
) because the BD algorithm applies only
for very high atomic friction coefficients, whereas some of the
hydrogen atomic ASA friction coefficients are small. Therefore, using
the BD algorithm in this case would severely shorten the feasible
integration time step to less than 0.1 fs in our simulations where
hydrogen atoms are explicitly treated.
Determination of TIP3P water viscosity
The viscosity of the surrounding medium is an essential parameter
for calculating the atomic friction coefficients (see Eq. 12). In pure
BD simulations, the viscosity only affects the overall time scale for
all of the dynamics. Therefore, the equilibrium time correlation
functions for different choices of the solvent viscosity may readily be
interconverted simply by rescaling the time. However, the dependence on
the solvent viscosity is more complicated for the LD simulations in
which inertial forces are retained in the equations of motion. Below,
we investigate the extent to which LD dynamics with different
viscosities may be interconverted by the time-rescaling procedure
available for the BD dynamics. To compare the LD and full MD
simulations, the shear viscosity for the TIP3P water is required.
Because this viscosity is unavailable for the conditions used in our MD
simulations, the viscosity is calculated from a MD simulation using a
system containing 216 TIP3P water molecules in a cubic box with a side of 18.6 Å and periodic boundary conditions. This system has the same
temperature and water density (1.003 g/cm3) as in the
Met-enkephalin MD simulations.
The standard simulation methods for bulk systems represent the shear
viscosity in terms of the off-diagonal components of the pressure
tensor,
|
(13)
|
where V is the total volume, and
mi, ri
,
pi
, and fi
are the
masses, the components (
= x, y, z) of the
particle positions, momenta, and forces acting on atom i,
respectively. We determine the shear viscosity of TIP3P water from the
Einstein-Helfand relation (Helfand, 1960
),
|
(14)
|
where the angular bracket
···
denotes the equilibrium
average
|
(15)
|
Evaluating the viscosity from Eq. 14 is superior to that using the
Green-Kubo expression (Zwanzig, 1965
)
|
(16)
|
because the numerical derivative of the square of the time
integral of the equilibrium-averaged pressure-tensor element in Eq. 14
involves a function that is much better behaved than the integrand in
the Green-Kubo expression. The presence of the infinite limit in Eq. 14 does not cause numerical difficulties because the derivative
d
G(t)/dt converges quickly as time grows. In
contrast, the Green-Kubo formulation requires the time integration of
a fluctuating function in Eq. 16 when the upper limit of integral is
large. Calculations of the time-correlation function in Eq. 16 have
large uncertainties as t grows, making the Green-Kubo-type integral difficult to converge numerically. Use of a short time simulation deteriorates the situation due to the large statistical errors in the time-correlation function. These errors are proportional to
, where the
is the correlation time for the time-correlation function in the integrand of Eq. 16 and T is the total duration of the simulations. Thus, when
T is too short, a large error appears in the pressure
time-autocorrelation function of Eq. 16 and renders the integral
difficult to evaluate accurately. Because all-atom MD simulations
require considerable computer time, the total simulation duration
T is rather small.
The energy of the TIP3P water system is first energy minimized and then
pre-equilibrated for 100 ps before running a 600-ps water MD simulation
with a 2-fs step size. The three off-diagonal elements of the pressure
tensor P
(t) are recorded and averaged at
every step. The function
G(t) is computed by numerical integration and the viscosity is obtained from the slope of the integrated function in Eq. 14.
Description of different simulations
To assess the accuracy of various approximations inherent in the
implicit water simulations, we have performed a large number of "mix
and match" LD simulations. For brevity, the following refers to each
set of simulations with the acronym ASPS to indicate simulations with
LD dynamics/static friction/ASP solvation potential/experimental dielectric constant for water; ASPD to indicate LD dynamics/dynamic friction/ASP solvation potential/experimental dielectric constant for
water; ASPE to indicate LD dynamics/static friction/ASP solvation potential/theoretical dielectric constant (53.1); SASA to indicate LD
dynamics/static friction/SASA solvation potential/experimental dielectric constant for water; ASPL to indicate LD dynamics/static friction/ASP solvation energy/experimental dielectric constant for
water/TIP3P water viscosity; NASP to indicate LD dynamics/static friction/no solvation potential/experimental dielectric constant for
water; and MD to indicate the all atom molecular dynamics simulations.
 |
RESULTS AND DISCUSSIONS |
Parameter determination and computation efficiency
We begin by describing the simulations for the TIP3P water
viscosity because this provides a central parameter for computing the
friction coefficients. Figure 1 presents
the equilibrium average of
G(t) from Eq. 13. The short
time (t < 0.3 ps) behavior of this function is highly
nonlinear, but the linearity of
G(t) emerges for
t > 0.3 ps and leads to a nearly constant first
derivative d
G(t)/dt over this time range. The
average TIP3P water viscosity
evaluated from this derivative (for
t = 4-10 ps) is 0.506 ± 0.043 cp, which is
~60.2% of the experimental value (
exp = 0.84 cp)
at 300 K and 1 atm. A similar discrepancy between the calculated TIP3P
water viscosity and experimental measurements is reported by Feller et
al. (1996)
, who have estimated
= 0.62 ± 0.08 cp, at
T = 293 K whereas
exp = 1.0 cp.
Therefore, the TIP3P water viscosity of Feller et al. is also only 62%
of the experimental value.

View larger version (0K):
[in this window]
[in a new window]
|
FIGURE 1
G(t) (in g2/(amu ps Å)2) evolution with time (in fs).
(A) Short time behavior (0-2 ps). (B) Long time
behavior (0-10 ps).
|
|
Table 1 summarizes the calculated average
atomic friction coefficients, along with their standard deviations,
that are evaluated for atoms in Met-enkephalin from a short 6-ns
trajectory by using the LD simulation with ASP solvation potential.
Figure 2 depicts the numbering used in
Table 1 and descriptions below for heavy atoms in Met-enkephalin. The
hydrogen atoms bonded to the nitrogen and oxygen atoms have vanishing
solvent-accessible areas and, therefore, zero friction coefficients
because they are embedded within the van der Waals radii of the central
atoms. This is one reason for using the LD rather than BD simulations,
because vanishing atomic friction coefficients are not permitted in BD
simulations. The friction coefficients of most heavy atoms fluctuate by
less than 3% during the 6-ns simulation trajectory, and only few of them exhibit larger variations (about 6.5%), especially for those atoms bonded to a pair of heavy groups, such as the C
atoms of tyrosine, phenylalanine, and methionine. The larger variation arises because an atom bonded to a pair of heavier groups experiences greater fluctuating forces that produce larger distortions in the local
geometry and thereby affect the accessible surface area.
View this table:
[in this window]
[in a new window]
|
TABLE 1
The average atomic friction coefficients (divided by mass)
(in ps 1) and standard deviations for selected heavy atoms
|
|
The implicit water LD simulations provide a huge saving in
computational time over explicit solvent MD simulations on a single CPU
machine. Most of the MD and LD calculations are performed with Intel
Pentium III 750 MHz or AMD Athlon (K7) 800 MHz Linux systems. On the
Intel machines, the typical elapsed times for a 60-ns LD simulation
with implicit solvent is ~20 h, whereas the full MD takes more than
660 h for just a 10-ns run. The AMD processors yield slightly
better performance for the LD simulations and are expected to be
identical to the Pentium III systems for the MD simulations. The total
duration T of the trajectories is a crucial parameter
determining the accuracy of computed time correlation functions
C(t), with the error in C(t) decreasing as
T
1/2. Thus, to achieve comparable qualities of
computed time correlation functions, a MD simulation of 60 ns would
require 200 times the CPU time needed for a 60-ns LD simulation.
Because the LD simulations are performed without a cut-off, whereas the
MD simulations introduce a time-saving cut-off on nonbonding
interactions, the ratio would grow substantially if a cut-off is not
applied in the MD simulations.
The most time-consuming step in the dynamics simulations involves the
evaluation of atomic forces. Combining the use of a cut-off on the
nonbonded interactions with the use of analytical derivatives and Ewald
summation methods implies that the computational time t for
the explicit solvent MD calculations typically scales with the number
of atoms N from t ~ N to t ~ N2. The implementation, for instance, of high
performance MD codes, such as GROMACS (Berendsen, 2001
), has reduced
the computational time to scale nearly linearly with N. In a
MD simulation of a fully solvated system, the simulation box normally
has many more solvent molecules than solute atoms, and the number of
solvent molecules increases as the cube of the box size. The minimum
box size for an explicit solvent simulation should be at least as large
as the average radius of gyration of the polymer molecule. Thus, when
the polymerization index of a given polymer is increased by a factor of
P, the minimum explicit water box size for the MD simulation
should at least be multiplied roughly by P1/2.
Hence, the total number of solvent and solute atoms grows by a factor
of roughly P3/2. Even assuming an efficient
linear scaling of t with N, the resulting scaling
t ~ P3/2 for the MD simulations
unfavorably contrasts with the scaling t ~ P for the
LD simulations. Therefore, the obvious advantages of the implicit
solvent model in treating the long time dynamics should be explored
further for larger systems.
Distributions for structural properties
In this section, we compare the distributions for several
structural properties that are calculated by averaging along the trajectories obtained from different simulation methods and solvation models. These equilibrium structural averages for the peptides are
dictated by the model potential functions. For the implicit water
simulations, we test two different dielectric constants and two
macroscopic solvation potentials to mimic the influence of the solvent
on the solute dynamics in the explicit water MD simulations. A
comparison of the equilibrium structural averages calculated from the
implicit water LD simulation and from the all-atom MD simulations
provide the first portion of an optimal test (with no extra adjustable
parameters) for the adequacy of using the implicit solvation potentials
to model the influence of the TIP3P water on Met-enkephalin dynamics.
One useful statistical measure for comparing different simulation
methods is provided by the probability distribution for the radius of
gyration (Rg) of Met-enkephalin,
|
(17)
|
where rg is the position of the
center of gravity of Met-enkephalin, and the sum runs over all atoms in
the solute. The square of the radius of gyration
(R
) is computed for each recorded time
point for all dynamical trajectories, and histograms of
R
are constructed by binning the values
into 100 discrete intervals that range between 0 and 60 Å2
for R
. Figure
3 A-D, present the
R
probability distributions computed from
the MD, ASPS, SASA, and NASP simulations, and Fig. 3 E
compares all four R
distributions. The
maximum R
found for Met-enkephalin is
~60 Å2, and certain aspects of the distributions can be
understood from a more detailed examination of the Met-enkephalin
trajectories. In particular, there are three classes of dominant
configurations: the "extended" state in which
R
lies in the range of 50-60
Å2, the "semi-packed" state with
R
~20-40 Å2, and the
"packed" state where R
15-20 Å2.

View larger version (0K):
[in this window]
[in a new window]
|
FIGURE 3
R (in Å2)
distribution from different simulation methods. (A) Full
atom MD. (B) ASP solvation potential. (C) SASA
solvation potential. (D) No solvation potential.
(E) A combined plot.
|
|
We now compare the range of variation of R
between the different simulations. Globally,
R
oscillates between 15 and 55 Å2 during the MD simulation, and the ASPS trajectories
display a similar range for R
. Inspection
of the time evolution of R
obtained from
the MD simulation indicates that Met-enkephalin stays in the packed and
semi-packed states most of the time and only occasionally jumps to the
extended state. The distribution for R
suggests the existence of a rapid interconversion between the packed
and semi-packed states and a slower transition process between them and
the extended state, a process that is verified by viewing the time
course of LD trajectories and R
. This
physical picture is supported by the presence of two peaks at 21 and 27 Å2 in Fig. 3 A and from the shoulder in the
extended region near 40-50 Å2. The latter feature implies
a higher probability for being in the transition state leading to the
more stretched configurations. The R
distribution extracted from the ASPS trajectories (see Fig.
3 B) differs from that illustrated in Fig. 3 A
for the MD simulations by having a slightly lower occurrence frequency
for the semi-packed state. The R
distribution from ASPS simulations is single-peaked as opposed to the
double-peaked structure found for the MD simulations. The SASA
simulations display a similar behavior but with broader structure and a
shifted maximum in the R
distribution of
Fig. 3 C. One peak is present in the SASA distribution with a maximum at 25 Å2, which lies between the MD maxima of 21 and 27 Å2. The SASA R
distribution is wider than that for the ASPS trajectory and has greater
population for the extended state, in better agreement with the MD
distribution. The SASA model uses a more detailed classification for
types of units, and this feature apparently is responsible for the
improved agreement of the LD simulation using the SASA solvation
parameter set with the MD simulations. The
R
probability distribution calculated from
a single 60-ns trajectory by using SASA model solvation potential also
exhibits a double-peaked structure (which converts to a single peak
after averaging over eight trajectories) that is similar to that of the
MD simulations. This indicates that the double-peaked structure from
the MD simulation may disappear for averages over longer trajectories.
The R
distribution from the NASP
simulations is presented in Fig. 3 D and displays a
narrower single-peaked structure with a restricted
R
fluctuation range between 15 and 40 Å2 that provides a poor representation of the MD
distribution. The comparison of all the R
distributions in Fig. 3 E demonstrates the necessity for
using a solvation potential so the LD simulations adequately
approximate the R
distribution from the MD
simulation. Because no extended state conformations appear in the NASP
simulations, the addition of the solvation potentials helps stabilize
the extended state configuration, which has a maximal charged-atom
surface area that is stabilized by the solvation potentials. Moreover,
the lack of appearance of extended conformations in the NASP
simulations severely degrades its description of the long time
dynamics, especially the global dynamics that are discussed in the next subsection.
Tourwé et al. (1996)
have studied the conformational restrictions
in the opioid peptide dermorphin that are necessary for the bioactivity
of its binding to the
receptor. The peptide must be in the
trans conformation for the
C
-C
bond of both the Tyr and Phe
residues for maximum binding strength. The gauche(
)
conformation of Tyr greatly weakens the receptor docking affinity,
decreasing the
-receptor binding affinity by a factor of 300. The
gauche(+) conformers for both Tyr and Phe are likewise not
favorable for
-receptor binding. Fixing either of the Tyr or Phe
residues in the trans conformation elevates the bio-activity for docking with the
-receptor. Earlier computational studies conclude that the favored Tyr C
-C
conformer is trans in aqueous solution, whereas that of Phe
is gauche(
). Other experimental evidence supports these
configurations for the opioid peptide in aqueous solution (Tourwé
et al., 1996
; Harms et al., 1998
). Hence, Tourwé's work suggests
that the solution structure of opioid peptides might differ from its
optimal receptor bound form.
Figures 4 and
5 display the probability distribution
for the torsional angles of the Tyr and Phe residues, respectively, as obtained from averaging over the trajectories with different simulation models. Every distribution in Figs. 4 and 5 exhibits a similar three-peaked structure, in which the peaks at
/3,
, and 5
/3 correspond to the gauche(
), trans, and
gauche(+) conformations, respectively. All the simulations
yield qualitatively similar descriptions for the Tyr residue
conformation: the dominant population is trans. The
gauche(
) conformer has only 23%, 26%, and 25%
occurrences for the ASPS, NASP, and MD simulations, respectively,
whereas the gauche(+) conformer only has a trace
probability, usually less than 10%. The SASA solvation potential
simulations predict the trans configuration as dominant with
99.8% of the total occurrence probabilities. An earlier simulation
study by Hruby et al. (1988)
suggests a dominant gauche(
)
rotamer for Tyr, which disagrees with later computations obtained from
a longer trajectory (1 ns) (Wang and Kuczera, 1996
) and with our even
longer simulations.

View larger version (9K):
[in this window]
[in a new window]
|
FIGURE 4
Tyr C -C torsional angle
(in radians) distribution from different simulation methods.
(A) Full atom MD. (B) ASP solvation potential.
(C) SASA solvation potential. (D) No solvation
potential.
|
|

View larger version (18K):
[in this window]
[in a new window]
|
FIGURE 5
Phe C -C torsional angle
(in radians) distribution from different simulation methods.
(A) Full atom MD. (B) ASP solvation potential.
(C) SASA solvation potential. (D) No solvation
potential.
|
|
As shown in Fig. 5, all the LD simulations produce very different
distributions for the Phe C
-C
conformation, but all of them favor the gauche
conformations. The gauche(+)/gauche(
) ratio
changes drastically from one model to another. Without a solvation
potential (the NASP simulations), the rotamer count strongly favors
gauche(+) (73%), with only smaller contribution of
trans (18%) conformers. Turning on the ASP solvation
potential raises the population of the gauche(
) conformer
from 9% in the NASP simulation to 32% in the ASP case, but the basic
structure of the distributions is similar for the NASP and ASPS
simulations. Unlike in the ASPS and NASP simulations,
gauche(
) is strongly favored in the SASA solvation
potential simulation, and its trans probability is the
highest (21%) among the LD simulations. In contrast, the explicit
water MD simulation presents a totally different picture with an almost
even distribution between gauche(+) and
gauche(
) and with a slightly higher trans
ratio. The trans occurrence frequency in the MD simulation
is 37%, which is higher than the gauche(
) (29%) and
gauche(+) (34%) frequencies. The trans
probability from MD simulation is slightly higher than those predicted
with various LD methods. The MD simulations prediction of a greater
preference for the trans conformer agrees quite well with
the bioactivity assay.
The comparisons in Fig. 5 indicate a subtle inadequacy of the currently
available solvation model potentials. A more detailed inspection of the
molecular movie for the Met-enkephalin trajectories provides some
insight into features of the potentials that produce this
overestimation of the gauche conformations by the LD
simulations. The close encounter between the two phenyl rings is
generally accompanied by transitions out of the trans
conformation. The ASPS simulations trajectories display the pair of
phenyl rings as tending to stay close together throughout most of the
simulation, and, consequently, the configuration for the Tyr and Phe
are predominantly trans and gauche(+),
respectively. Therefore, the underestimation of the population in the
trans state by the ASP solvation potential simulations may
be due to an inaccuracy in the description of the nonbonding
aromatic-aromatic interactions. Such an inaccuracy would also partly
explain why the ASP solvation potential yields a different distribution
for the radius of gyration than the distributions from the SASA and MD
simulations. The ASP model lumps all carbon atoms into only one
classification for which the effective solvation free energy per unit
of accessible surface area is slightly positive, i.e., slightly
hydrophobic. The SASA solvation model assigns aromatic carbon atoms as
very slightly hydrophilic with a small negative solvation free
energy-per-unit-accessible surface area. When two phenyl rings come in
close contact, part of the solvent-accessible surface for the aromatic
carbons is lost. The positive solvation free energy for all carbon
atoms in the ASP parameter set implies that this loss of surface area
creates an effective attractive force between the phenyl rings, so
frequent contact between the aromatic groups is expected and is,
indeed, observed in the animation of the ASP model trajectories. This
effective solvation-induced attractive force between the phenyl groups
also restricts Met-enkephalin in the ASP model from accessing the
extended state to the extent found for the MD simulations or the SASA model.
Segmental time-correlation functions
Time-correlation functions provide a graphic measure for the rate
of change of dynamic variables. The TCF used here to compare the
dynamics predicted by the different models are the
P1 dipole autocorrelation functions of
interatomic position vectors, which are defined as
|
(18)
|
where the interatom vectors lij are
|
(19)
|
The angular brackets
···
in Eq. 18 denote an equilibrium
average. The P1 correlation function is totally
determined in the MD simulation by the amino acid sequence, the AMBER
atomic interaction potentials, and the frictional forces exerted by
surrounding solvent. By choosing atoms i and j to be proximate or
distant, respectively, the correlation function
Cij(t) reflects the local and global flexibility of the peptide. Because this flexibility generally varies
along the peptide sequence, we analyze a number of these local
P1 correlation functions. A measure of the local
segmental rigidity is taken as the decay rate of the TCF for the local
motion. An analogous P2 TCF can be obtained from
several experimental quantities, such as the relaxation times
T1 and T2, the
S2 order parameter, and the nuclear Overhauser
effect in NMR relaxation studies and also from fluorescence anisotropy
measurements. Because the relaxation time for the
P2 TCF of a given
lij(t) is roughly half that for
P1, we provide computations for
P1 that are more easily evaluated from our
theory for the long time dynamics that will be applied elsewhere to the
present set of MD and LD simulations (M.-y. Shen and K. F. Freed,
in preparation).
The local and global time-correlation functions considered here are
evaluated by averaging the scalar product
ij(0) ·
ij(t) over a trajectory of finite
duration time. The statistical error due to the use of finite
trajectories is estimated by the Zwanzig-Ailawadi formula (Zwanzig and
Ailawadi, 1969
),
|
(20)
|
for T >
', where
' is the correlation time defined by,
|
(21)
|
and T is total duration of trajectories. Clearly, when
the correlation function C(t) approaches zero or when
'/T is large, the statistical uncertainty in
C(t) is the greatest. Thus, the largest error arises for
slow dynamical variables and in the long time tail of C(t),
especially for the much shorter (smaller T) explicit water
MD simulations. The plots of the calculated correlation functions
contain error bars for the C(t) in each of the three curves
depicted, with the values C±(t) = C(t) ±
[1
C(t)] providing estimates for the
uncertainty in the computed TCF by use of finite duration trajectories.
The global dynamics of a polymer or peptide is usually associated with
the slowest motions of the molecular system, such as tumbling, folding,
and extending. These motions usually involve cooperative displacements
of large portions of the solute, and the global motions may also affect
the long time dynamics of more local molecular movements. The slowly
varying variables we illustrate here are the backbone end-to-end vector
(the C
1-C
5 separation vector) and the
inter-phenyl ring vector (C
1-C
4). The inter-phenyl vector is considered here because of its relevance to the
bioactivity of opioid peptides, as described in the previous subsection. The time scale for the conformational dynamics of the
separation vector between the two phenyl groups provides a different
perspective for the factors affecting the
-receptor binding process.
Figure 6 displays the simulated
end-to-end TCF with the estimated statistical error bars placed at 500, 1000, and 2000 ps. The end-to-end vector correlation functions
C(t), whose relaxation time
' is ~480 ps, exhibit
similar behaviors for the ASPS, ASPD, and SASA simulations. The TCF
obtained from the ASPS simulation is nearly identical to that from the
ASPD computation with only very small deviations in the long time tail as shown in Fig. 6 A, but both depart significantly from
the MD correlation functions. The MD correlation function in Fig.
6 A appears closer to the NASP correlation function (see
Fig. 6 B), which is computed without a solvation potential.
This similarity between the NASP and MD simulations, however, is
accidental, because the ASPS, ASPD, and SASA simulations in Fig. 6 have
been performed using the experimental water viscosity of 0.84 cp at a
temperature of 300 K rather than the TIP3P water viscosity of 0.50 cp
that should be used in the LD simulations to mimic the frictional
forces of the TIP3P water in the solvated MD simulations. The atoms in the MD simulation experience lower drag forces and, hence, exhibit faster global dynamics than the atoms in the LD simulation performed with the experimental water viscosity. Consequently, to compare the MD
and LD simulations on an equal footing, two strategies have been
pursued. First, another set of LD simulations has been generated using
the TIP3P water viscosity (which leads to lower atomic frictions
coefficients), and, second, we simply rescale the time with the ratio
between the simulated TIP3P and experimental water viscosities. The
rescaling of time would render these two simulations identical if the
dynamical influence of the inertia term of the Langevin equation is
negligible for the long time dynamics (despite the fact that it
represents the dominant force on some hydrogen atoms).

View larger version (10K):
[in this window]
[in a new window]
|
FIGURE 6
The end-to-end TCF from different simulation methods.
(A) Comparison between explicit and implicit water model
with experimental water viscosity. Dark solid, Full atom MD;
solid, ASPS; dashed, SASA simulations with
experimental water viscosity. (B) Comparison between
different solvation potentials. Dark solid, the simulation
without solvation potential; solid, ASPS; dashed,
SASA simulations with experimental water viscosity. (C)
Comparison between dynamical, static friction models and different
dielectric constant. Dark solid, ASPS simulation with
= 53.1; solid, ASPS; dashed, ASPD
simulations with experimental water viscosity. (D)
Comparison between time-scaled dynamics, low-friction LD simulation and
MD simulation. Dark solid, MD simulation; solid,
ASPS simulation with time scaled by a factor 0.603; dashed,
ASPS simulations with TIP3P water viscosity.
|
|
Figure 6 D compares the time rescaled correlation function
from the ASPS (to correct for the use of the experimental viscosity) with the ASPL simulations that use the TIP3P viscosity obtained from MD
calculations of pure TIP3P water. The TCF from the time-scaled ASPS and
the TIP3P water viscosity ASPL simulations agree very well with each
other and with the MD simulation TCF for the shorter time range where
the error bars overlap substantially, and the agreement persists within
error bars over the full range. This imply(s), as expected, that the
inertial terms are fairly negligible for the long time behavior of the
slower global motions. Inertial forces may be more important for the
dynamics of some buried hydrogen atoms whose ASA atomic friction
coefficients are very small or zero. Time-scaled NASP correlation
functions for global motions appear to decay too fast but behave
relatively well for some local correlation functions. The origin of
this differing behavior of the NASP model may be a direct consequence
of the lack of appearance of extended configurations in the NASP
simulations. The slower global motions couple more strongly with the
overall rotation and are therefore affected by the presence or absence
of the extended state. Due to the absence of a solvation potential in
the NASP simulations, the more compact peptide tumbles at a faster rate because of a smaller diffusion
tensor.

View larger version (19K):
[in this window]
[in a new window]
|
FIGURE 7
The phenyl-phenyl vector (atom number 10-45) time
correlation functions from different simulation methods. (A)
Comparison between explicit and implicit water model with experimental
water viscosity. Dark solid, Full atom MD; solid,
ASPS; dashed, SASA simulations with experimental water
viscosity. (C) Comparison between different solvation
potentials. Dark solid, the simulation without solvation
potential; solid, ASPS; dashed, SASA simulations
with experimental water viscosity. (C) Comparison between
dynamical, static friction models and different dielectric constant.
Dark solid, ASPS simulation with = 53.1;
solid, ASPS; dashed, ASPD simulations with
experimental water viscosity. (D) Comparison between
time-scaled dynamics, low-friction LD simulation and MD simulation.
Dark solid, MD simulation; solid, ASPS simulation
with time scaled by a factor 0.603; dashed, ASPS simulations
with TIP3P water viscosity.
|
|

View larger version (18K):
[in this window]
[in a new window]
|
FIGURE 8
The Phe-4 C -H correlation functions
from different simulation methods. (A) Comparison between
explicit and implicit water model with experimental water viscosity.
Dark solid, Full atom MD; solid, ASPS;
dashed, SASA simulations with experimental water viscosity.
(B) Comparison between different solvation potentials.
Dark solid, the simulation without solvation potential;
solid, ASPS; dashed, SASA simulations with
experimental water viscosity. (C) Comparison between
dynamical, static friction models and different dielectric constant.
Dark solid, ASPS simulation with = 53.1;
solid, ASPS; dashed, ASPD simulations with
experimental water viscosity. (D) Comparison between
time-scaled dynamics, low-friction LD simulation and MD simulation.
Dark solid, MD simulation; solid, ASPS simulation
with time scaled by a factor 0.603; dashed, ASPS simulations
with TIP3P water viscosity.
|
|
The inter-phenyl vector TCF in Fig. 7 exhibits similar trends among the
different simulation methods as found for the end-to-end vector
correlation function. The correlation times of the inter-phenyl vector
are only slightly smaller than the times for the end-to-end vector,
indicating that the long time dynamics of the inter-phenyl separation
vector is mainly governed by global tumbling. This slow relaxation of
the inter-phenyl motion ensures a slower relative motion between phenyl
groups in Met-enkephalin, and, therefore, affects its receptor
affinity. Slower inter-phenyl dynamics conserve the relative
orientations of Tyr and Phe residues along the main chain, producing
the bioactivity of Met-enkephalin.
The local positional TCF involve more complicated intramolecular
motions and are usually much faster than the correlation functions for
global motions. The P1 correlation time
is
defined by
|
(22)
|
The
for local motions ranges from less than 20 ps, such as for
the terminal C-H bond, to 100-200 ps for the central backbone bonds.
We illustrate the general trends in Fig. 8 by considering the
C
-H bond of Phe-4, a central C-C bond to characterize the backbone dynamics, and one C-C bond on the aromatic side chains for describing the phenyl flipping dynamics. The
P1 correlation times
of five C-H
correlation functions are listed in Table 2. These five correlation functions
generated from the ASPS solvation simulation exhibit fairly similar
features: all have approximately a 150-ps correlation time with a
faster decay at short times. The Phe-4 C-H bond has the slowest decay
(in 0-400 ps range) among the five. The Met-5 C-H bond has the
slowest tail for t > 400 ps, but these correlation
functions are still very close together. The nearly identical dynamics
for all C-H bonds from ASPS simulations can be explained by the
following structural feature discussed in the previous subsection: the
positive free energy parameter between carbon atoms in the ASP model
leads to an effective attractive force between phenyl groups, so the
dynamics produced by this model describe Met-enkephalin as very prone
to be in the semi-packed state. Because the semi-packed state has less
backbone flexibility, the long time tail in all the C-H dynamics is
mainly governed by rigid tumbling, leading to only a single decay rate
for all the C-H bonds in the ASP model.
View this table:
[in this window]
[in a new window]
|
TABLE 2
The P1 correlation times (in ps)
for the C -H motion from Langevin dynamics simulations
using various implicit solvent models (water viscosity is = 0.84 cp)
|
|
The time correlation functions computed with the SASA solvation
parameters yield two distinguishable groups of C-H relaxation times,
faster dynamics for Gly-2 and Gly-3 and slower for Tyr-1, Phe-4, and
Met-5. The Phe-4 C-H bond correlation function decays slightly more
slowly, and the Tyr-1 C-H relaxation time is the fastest among the
three slow C-H bonds. A similar behavior emerges from the
MD-calculated TCF, although the statistical errors are moderately
large. All the simulation models predict the Gly-2 C-H relaxation as
the fastest, and this prediction arises because the light hydrogen side
chain and its position at the end of the backbone yield the expected
faster motion. The internal backbone deformation motion, such as that
of the C-C bond shown in Fig. 9,
has a relaxation time (
) scale of ~400 ps, much faster than the
time scale of global tumbling (900 ps). These results facilitate constructing a qualitative physical picture for the dynamical evolution
of the Met-enkephalin dynamics. The faster moving leading portion (Tyr)
swings around the heavier tail (Phe-Met) through a conformational
transition occurring in the flexible connection between the two
glycines. Met-enkephalin can undergo rapid transitions to the preferred
-receptor docking position because the flexible Gly-Gly portion
allows the fast structural readjustments that lead to higher binding
strength. This conclusion partly explains why the observed and
calculated solution conformation populations of opiate peptides differ
from the one with the highest bioactivity. The local chain flexibility
enables transitions to occur quickly between these two local
conformations.