| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
Key Laboratory of Structural Biology, and School of Life Sciences, University of Science and Technology of China, Chinese Academy of Sciences, Hefei, Anhui 230027, China
Correspondence: Address reprint requests to Haiyan Liu or to Yunyu Shi, Key Laboratory of Structural Biology, School of Life Sciences, University of Science and Technology of China, Hefei, Anhui 230027, China. E-mail: hyliu@ustc.edu.cn. E-mail: yyshi{at}ustc.edu.cn.
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
A straightforward way to solve this problem is to increase the simulation time. With the improvements in computer power and algorithms, we have progressed to tens of nanoseconds now (Daggett, 2000
). This timescale is still too short to observe many important protein processes, such as slow conformational changes and protein folding/unfolding. The inefficiency in sampling is a result of the frustrated nature of the energy landscape. While the exploration of different conformational states and the mechanism of transitions between different conformational states are of more interest than examining local fluctuations during a simulation, the system will spend most of the time in locally stable statesinteresting escaping events being rarely observed in a conventional simulation (Eastman et al., 1999
).
To extend the capability of atomic simulations, new techniques need to be developed to improve the sampling efficiency considerably. Various methods have been proposed, such as the multicanonical ensemble algorithm (Hansmann and Okamoto, 1993
; Nakajima et al., 1997
), adaptive umbrella sampling (Bartels and Karplus, 1998
), conformational flooding (Grubmuller, 1995
), chemical flooding (Muller et al., 2002
), and essential dynamics simulation (Amadei et al., 1996
). Among them, the last three are based on collective coordinates.
The use of collective coordinates has become an important technique to extract functionally relevant motions from simulation results (Kitao and Go, 1999
; Berendsen and Hayward, 2000
). Computational techniques to determine collective motions are varying. Principal component analysis (PCA) or essential dynamics analysis (EDA) (Amadei et al., 1993
) can be carried out on a large number of configurations in MD or Monte Carlo (MC) trajectories, or a large set of experimental structures. The CONCOORD method (de Groot et al., 1997
) uses a very crude approximation of atomic interactions to generate a large number of configurations that satisfy a set of atomic distance constraints and then PCA or EDA can be applied. Normal mode analysis (NMA) uses a harmonic approximation of the full atomic force field and a single experimental structure suffices. The most time-consuming aspects of NMA are energy minimization and diagonalization of the Hessian matrix using an atomic force field. Coarse-grained models, such as the Gaussian network model (GNM) (Haliloglu et al., 1997
) and the anisotropic network model (ANM) (Atilgan et al., 2001
), have been designed to reproduce a few collective modes using simplified pairwise Hookean potential and have been successful in predicting atomic fluctuations. By the above methods, an "essential subspace" spanned by a small number of collective modes can be obtained. Collective coordinates are widely employed to investigate protein dynamics. Recent studies have shown that functionally relevant motions occur along the direction of a few collective coordinates, which dominantly contribute to atomic fluctuation. Kitao et al. (1998)
have used PCA and a jumping-among-minimum model to elucidate the energy landscapes of proteins. The analyses applied to a 1-ns MD trajectory of the human lysozyme in water indicate that, 1), the energy surfaces of individual conformational substates are nearly harmonic and mutually similar; and 2), protein motions consist of three types of collective modes, namely, multiply hierarchical modes (the number of which accounts for only 0.5% of all modes, yet dominate contributions to total mean-square atomic fluctuation), singly hierarchical modes, and harmonic modes. Intersubstate motions are observed only in a small-dimensional subspace spanned by the axes of multiply and singly hierarchical modes.
Collective coordinates can be used as a basis set for efficient sampling. In MC simulations, normal modes are used as variables and the sampling step size is scaled by normal mode amplitudes (Noguti and Go, 1985
). In the conformational flooding (Grubmuller, 1995
) and chemical flooding methods (Muller et al., 2002
), according to the collective modes obtained from short MD simulations, a coarse-grained description of conformational substate is derived from which bias potential can be constructed. The bias potential destabilizes the initial conformation and lowers free energy barriers of structural transitions, thus complex conformational transitions and chemical reactions may be observed and followed in a simulation. The Berendsen group developed an "essential dynamics" simulation method. In this method, protein motions are constrained to move along the essential collective modes, while the motions along the other degrees of freedom obey the usual equations of motions (Amadei et al., 1996
). de Groot et al. (1996a
,b
) have applied this technique in the study of a 13-residue peptide hormone, guanylin, and a 85-residue protein, HPr. The region of conformational space obtained from the essential dynamics sampling includes the area sampled by the normal MD and extends beyond. Abseher and Nilges (2000)
applied restraints in essential subspace on an ensemble of MD trajectories that proceed otherwise independently in parallel. The results show that weak restraints on the ensemble variance suffice for an increase in sampling efficiency along collective modes by two orders of magnitude.
One disadvantage of the essential dynamics method is that the collective modes need to be obtained from a predetermined set of protein conformations, usually from a relatively short period of simulation. Such a set usually represents only local fluctuations. As collective modes are treated as linear combinations of Cartesian coordinates, they are, in principle, conformation-dependent. That is, the essential subspace may vary when the protein conformation belongs to different local states. The method we introduced here uses the collective modes obtained by the coarse-grained model ANM (Atilgan et al., 2001
) to guide the atomic-level MD simulation. Using the ANM, the collective modes are estimated using a single protein conformation without carrying out a long simulation. These modes can be updated frequently during the simulation since computation of the collective modes using ANM is rapid, which allows us to give up the severe assumption that the essential subspace is invariant in the conformational space. Another key strategy in our method is that the motions along the collective modes are amplified by coupling them to a thermal bath at higher temperature than the remaining motions, which is achieved by taking advantage of the weak-coupling algorithm (Berendsen et al., 1984
). The idea comes from the molecular dynamics docking method (Di Nola et al., 1994
; Mangoni et al., 1999
). The original method consists of a separation of the center-of-mass motion of the substrate from its internal motions, and a separate coupling to different thermal baths for both types of the motions of the substrate and for the motions of the receptor. Using higher center-of-mass temperature and room temperature for the internal degrees of freedom of the substrate, an efficient search in the translational and rotational subspace is achieved without disturbance of the internal structure. The separate temperature bath approach has also been employed in a novel MD simulation, in which the protein and solvent have been simulated at different temperatures to demonstrate the key roles of solvent in controlling functionally important fluctuations (Vitkup et al., 2000
).
Algorithmatically, our amplified-collective-motions (ACM) approach comprises the following steps: 1), estimate the collective modes according to a single protein configuration in the MD simulation using ANM, and the collective modes updated frequently during the simulation; 2), project the velocities of the atoms into two subspaces: the subspace spanned by the limited number of slow collective modes (essential subspace) and the rest; and 3), couple the velocities in different subspaces with different thermal baths, the essential subspace coupled to higher temperature and the remaining coupled to normal temperature bath. We expect the essential subspace can be sampled adequately and large amplitude motions of the protein can be observed during the ACM simulation. In the meanwhile, local interactions and higher frequency motions are kept at normal temperature, allowing local structures to relax to accommodate the large amplitude motions in the collective coordinates.
Here, we briefly describe the construction of the following sections. We firstly describe the theory and methods of ACM, and then computational details for two test systems, an S-peptide analog and bacteriophage T4 lysozyme (T4L). Subsequently, we provide some results and discussion, which demonstrate the efficiency of the ACM method. Finally, we provide concluding remarks.
| THEORY AND METHODS |
|---|
|
|
|---|
atoms in the protein, and the interactions between residues in close proximity are replaced by harmonic springs (Miyazawa and Jernigan, 1985
from GNM show excellent agreement with experiments. It should be noted that GNM is a scalar model and all fluctuations are implicitly assumed to be isotropic. In fact, the fluctuations of the proteins are in general anisotropic, and it is important to assess the directions of collective motions, as these can be directly relevant to biological functions and mechanisms. So, an extension of GNM, called the anisotropic network model (ANM), has been developed (Atilgan et al., 2001
In our method, we adapted ANM. For a folded protein, the junctions are identified with the center-of-mass of each residue. According to ANM, harmonic potential can be written as
![]() | (1) |
are the force constant, distance, and equilibrium distance between center-of-mass of residues i and j, respectively. In this work, kij are given by
![]() | (2) |
In the case of Nr residues, the Cartesian coordinates are ri = (xi, yi, zi), and the second derivatives of the overall potential can be organized in the 3Nr x 3Nr Hessian matrix H. H is composed of Nr x Nr super-elements, each of size 3 x 3:
|
H|<|=|>||<|\left[|>|\begin|<|array|>||<|llll|>|h_|<|11|>|&h_|<|12|>|&|<|\cdots|>|&h_|<|1\mathrm|<|N|>|_|<|\mathrm|<|r|>||>||>|\\h_|<|21|>|&h_|<|22|>|&|<|\cdots|>|&h_|<|2\mathrm|<|N|>|_|<|\mathrm|<|r|>||>||>|\\ | (3) |
j) is
![]() | (4) |
![]() | (5) |
![]() | (6) |
![]() | (7) |
![]() | (8) |
is the diagonal matrix consisted of the eigenvalues (
i) of H, organized in ascending order. The ith column of U is the eigenvector associated with
i. There are 3Nr eigenvalues, of which the initial six associated with overall translations and rotations are zero. The remaining 3Nr-6 nonzero eigenvalues and corresponding eigenvectors reflect the respective frequencies and Cartesian components of the individual modes. The first few slow, long wavelength collective modes may represent functionally relevant motions of the protein.
The amplified collective motion method
Before going into the technical details of the ACM method, we first introduce the weak-coupling method briefly (Berendsen et al., 1984
). The weak-coupling method is usually used to couple MD simulations to external baths in, for example, constant temperature (T0) simulations. In a constant temperature simulation, the kinetic energy of the protein at time t (Ek(t)) can be given as
![]() | (9) |
![]() | (10) |
![]() | (11) |
t is the MD time step, and
T is the temperature relaxation time (a larger
T indicates a weaker coupling). The factor S is used to uniformly scale the velocities of the next time step, making the actual temperature relax toward T0.
Our technique is based on the weak-coupling method. From V(t), we can obtain the velocity of the center-of-mass of each residue,
![]() |
![]() | (12) |
the component of the Cartesian coordinates of residue
in el. V1(t) is the projection of the velocities into the essential subspace. The projection of V(t) in the rest subspace can be simply calculated as
![]() | (13) |
We apply the weak-coupling method to these two parts of the velocities, separately. For V1, the reference temperature is higher (Th) and the corresponding temperature-scaling factor is Sh. V2 is coupled to the normal temperature (Tl), the scaling factor being Sl, and
![]() | (14) |
| COMPUTATIONAL DETAILS |
|---|
|
|
|---|
-helical analog of Ribonuclease A S-peptide (Tirado-Rives and Jorgensen, 1991
-helix (residues 412), N-terminal (13), and C-terminal (1315) random coils (Fig. 1 a).
|
-helix is denatured completely (Fig. 1 b). Here, we selected two structures as the starting structures: the initial native structure and the final unfolded structure for further simulations (Fig. 1). Instead of an explicit water model, we used an implicit solvent modelthe generalized Born/surface area (GBSA) water model (Still et al., 1990
Stochastic dynamics simulations (van Gunsteren et al., 1981
) have been used for the implicit solvent simulations, and the GBSA model has been used in conjunction with the GROMOS96 43A1 force field (Zhu et al., 2002
). After energy minimization with the steepest descent method, the system was equilibrated by 50-ps GBSA simulations, during which the positions of nonhydrogen atoms were restrained. All atoms were given an initial velocity obtained from a Maxwellian distribution at 274 K. Since the system only contains the peptide (151 atoms), we believe the above procedures were enough to generate equilibrated structures for the successive sampling simulations. The simulations within each of the four groups were started using different sets of initial velocities.
During the normal GBSA simulations (NA and UN), the peptide was coupled to a temperature bath of reference temperature 274 K with a relaxation time of 0.1 ps. Bonds were kept rigid using the SHAKE method (relative geometric tolerance = 10-4) (Ryckaert et al., 1977
). The nonbonded interactions were calculated without cutoff. A time step of 2 fs was used. The ACM simulations (NA_ACM and UN_ACM) were otherwise the same as the control simulations, except that a few of the slowest collective modes and the rest have been coupled to separate temperature baths, respectively. The short (rcs) and long-range cutoff distances (rcl) in the ANM analysis were set as 0.7 nm and 1.4 nm, respectively. Either the first three or the first five modes were coupled to the higher temperature (Th = 358 K), with a relaxation time of 0.01 ps, and the other modes were coupled to the lower temperature (Tl = 274 K and
Tl = 0.1 ps). The collective modes were recalculated either every 10 or every 25 time steps, according to the new current configuration of the peptide. The computational cost of recalculating the ANM modes with such frequencies can be nearly ignored relative to the simulation steps. Average temperatures of the selected lowest collective modes and the rest during one UN_ACM simulation were listed in Table 1. Since only a few modes (five slowest modes) were coupled to the higher temperature (Th = 358 K), the actual temperatures of them have large fluctuations. The average value is 345 K, lower than the reference temperature. This may be caused by the weak coupling between motions along the slowest modes and the remaining motions, and the mixing of different modes as they evolve with the conformation. The other degrees of freedom have much smaller fluctuations in temperature and the average temperature is nearly the same as the reference temperature. Although a few modes were coupled to the higher temperature, the average temperatures of the entire system in the ACM simulations are only
1 K higher than that in the conventional simulations.
|
Bacteriophage T4 lysozyme
Bacteriophage T4 lysozyme (T4L) is composed of two domains connected by a long
-helix. There is a deep opening between the N-terminal and C-terminal domains, which is the active site cleft for oligosaccharide binding (Anderson et al., 1981
). Many crystal structures of T4L and its mutants have been found that indicate the hinge-bending-type domain motion opening or closing up the cleft (Faber and Matthews, 1990
; Zhang et al., 1995
).
The crystal structure of T4L (entry 2LZM of the PDB) determined at 1.7 Å resolution was used as the starting structure (Weaver and Matthews, 1987
). Rectangular periodic boundary conditions were used with box length of 5.263 nm x 5.687 nm x 7.827 nm (the minimum distance between the solute and the box boundary is 1.0 nm). SPC water molecules were added from an equilibrated cubic box containing 216 water molecules (Berendsen et al., 1981
). Some of the added water molecules were removed so that no water oxygen atom is closer than 0.23 nm to a nonhydrogen atom of the protein or another water oxygen atom. The system, protein and water, was initially energy-minimized for 500 cycles using the steepest descent method. Eight Cl- ions were added to compensate the net positive charge on the protein. These ions were introduced by replacing water molecules with the highest electrostatic potential. The energy was again minimized using steepest descent method. The final system contains 1703 protein atoms, eight Cl- ions and 7018 water molecules, leading to a total size of 22,765 atoms. 200-ps equilibration MD runs were performed, in which the temperature was gradually increased from 50 K (20 ps), 100 K (20 ps), 150 K (20 ps), 200 K (20 ps), 250 K (20 ps), and 300 K (100 ps), together with an increase of the temperature coupling constant (from 0.01 to 0.1 ps) and a decrease of the position restraints force constant (from 50,000 to 10,000 kJ mol-1 nm-2). Then two 3-ns productive runs were performed, respectively.
One conventional MD at 300 K (S300) was performed using an isothermal-isobaric simulation algorithm (Berendsen et al., 1984
). Protein and solvent was coupled separately to temperature baths of reference temperature with a coupling time of 0.1 ps. The pressure was also kept constant by weak coupling to a bath of reference pressure (P0 = 1 atm; coupling time
p = 0.5 ps). Bonds were kept rigid using the SHAKE method with a relative geometric tolerance of 10-4 (Ryckaert et al., 1977
). Long-range forces were treated using twin-range cutoff radii of Rcp = 0.8 nm for the charge group pair list, and Rcl = 1.4 nm for the longer-range nonbonded interactions. The pair list was updated every 20 fs. Reaction-field forces were included (Tironi et al., 1995
), originated from a dielectric permitivity of 54 for SPC water (Smith and van Gunsteren, 1994
). A time step of 2 fs was used. The other is ACM simulation (SACM), which was otherwise the same as the control simulation, except for modifications to temperature coupling. Short- and long-range cutoff distances of 0.7 nm and 1.0 nm, respectively, were used in ANM. We selected the three slowest collective modes (nl = 3). These three degrees of freedom were coupled to the higher temperature Th = 800 K, with a relaxation time of 0.01 ps, and the other modes were coupled to the temperature Tl = 300 K (
Tl = 0.1 ps). The average temperatures were 751 K and 299 K for the collective modes and the rest, respectively (Table 1). The total number of degrees of freedom of T4L is over 5000 and only three collective modes were coupled to 800 K. The temperature averaged over all degrees of freedom of the system in SACM is almost the same as in S300. The collective modes were recalculated every 100 time steps according to the new current configuration of the protein. The most time-consuming part of the ACM algorithm compared to usual MD is the diagonalization of the 3Nr x 3Nr Hessian matrix (Dhillon, 1997
). However, Nr is only the number of residues and the updating of ANM modes is not done for every step. For the S-peptide analog the additional cost of ACM over usual MD is ignorable. For T4L, ACM increases the computational cost by only
10% over usual MD with the above setup.
The same software has been used as in the simulations of the S-peptide analog. EDA (Amadei et al., 1993
) was performed with WHAT IF (Vriend, 1990
).
| RESULTS AND DISCUSSION |
|---|
|
|
|---|
Root-mean-square deviations (RMSD) from the starting structure for the backbone atoms from residues 412 were calculated (Fig. 2). The helix is stable during the NA simulations (Fig. 2 a), which show agreement with our previous results (Zhang et al., 2001
). Compared with Fig. 2 a, during the NA_ACM simulations, the native
-helix is also kept very well, only with significantly larger structural fluctuations (Fig. 2 b). In the UN simulations, RMSD are stable and show little change after a period of time, which indicate the peptide has been "trapped" in a single non-native substate (Fig. 2 c). We have carried out 10 UN_ACM simulations. In eight of them, the unfolding peptide refolds into the native helix (Fig. 2 d). The refolding time (the first time when the RMSD from the native structure is <0.1 nm) ranged from 20 to 90 ns. After refolding, the behaviors of the peptide in the UN_ACM simulations are similar to its behaviors in the NA_ACM simulations (Fig. 2 b). In only two UN_ACM simulations, refolding has not been observed within 100 ns.
|
|
Among the 10 UN_ACM simulations, peptide refolding was observed in eight simulations. It is interesting to study the folding/unfolding pathways based on these multiple trajectories. The forming and breaking of five native i + 4
i hydrogen bonds, i.e., Phe8:NH-Ala4:CO (HB1), Leu9:NH-Ala5:CO (HB2), Arg10:NH-Ala6:CO (HB3), Glu11:NH-Lys7:CO (HB4), and His12:NH-Phe8:CO (HB5), were monitored (Fig. 4), which shows correspondence with the folding/unfolding of the peptide (Fig. 2). The five hydrogen bonds are stable in the NA simulations (Fig. 4 a) and less stable in NA_ACM simulations (Fig. 4 b). No native hydrogen bonds formed during the UN simulations. In the refolding process of UN_ACM simulations, the hydrogen bonds near the N-terminal of the peptide formed first, and then the hydrogen bonds near the C-terminal (Fig. 4 c). Instead, the breaking of the hydrogen bonds took place first near the C-terminal in the unfolding process (Zhang et al., 2001
). The results indicate that the
-helical hydrogen bonds near the N-terminal may be intrinsically more stable than those near the C-terminal, forming earlier in the folding process and being disrupted later in the unfolding process.
|
Root-mean-square fluctuations (RMSF) of the C
atoms from residues 1162 were calculated for the two simulations (Fig. 5 a). SACM shows significantly larger fluctuations than S300. As expected, the increased motions in the ACM simulation mainly come from the interdomain motion in T4L. Root-mean-square deviations (RMSD) from the starting structure of the C
atoms from residues 1162 (Fig. 5 b), N-terminal domain (residues 1365, Fig. 5 c), and C-terminal domain (residues 75162, Fig. 5 d) indicate that the larger amplitude motions in SACM do not come from the intradomain motions of the N- and C-terminal domains. The secondary structures as well as the structures of each domain have been well-maintained in the ACM simulation.
|
atoms of residues 1162 are used to construct the covariance matrix. The results indicate that the first two eigenvectors contribute
90% to the total positional variations among the experimental structures.
As indicated by de Groot et al. (1998)
, the first mode obtained by EDA on the x-ray cluster corresponds to a closure motion (defined by an effective hinge axis perpendicular to the line connecting the centers of mass of the two domains) and the second mode consists of a twist of the two domains, with the effective hinge axis being more parallel to the line connecting the two centers of mass. We projected the x-ray structures and the trajectories of the two simulations onto the plane defined by the above two modes to compare the extent of variations in the x-ray structures and the two simulations (Fig. 6). Along the first mode, there seem to be two distinct clusters of the x-ray structures. Structures to the left region are open structures, and structures to the right are closed. The MD simulation (S300), which started with a closed structure (indicated by an arrow in Fig. 6), spend most of the time at the intermediate region between the two clouds of the x-ray structures, but does not reach positions corresponding to either the most open or the most closed configurations in the x-ray cluster. The ACM simulation (SACM), started with the same structure as S300, fluctuates much more significantly toward all directions in the plane than the usual MD and x-ray clusters. These results suggest that the expanded sampling in SACM is mainly the interdomain motion in T4L. Among the set of 38 x-ray structures, we selected the structure with the maximum and the minimum projections on the closure and twist modes, respectively. In Fig. 7 a, each pair of structures was superimposed. Between the extreme structures along the closure mode, the RMSD of C
atom positions is 0.40 nm (Fig. 7 a, left). Along the twist mode the corresponding RMSD is 0.14 nm (Fig. 7 a, right). The same were performed on the sets of conformations sampled in the usual (Fig. 7 b) and ACM simulations (Fig. 7 c), respectively. In SACM, both the closure and the twist motions have larger amplitudes than in S300 and in the x-ray structure set, in agreement with Figs. 5 and 6.
|
|
In our conventional MD simulation of T4L, a large part of the region of x-ray structures could not be sampled within 3 ns, but the ACM simulation seemed to cover the x-ray structures within the same simulation time (Fig. 6). Dynamical transitions among the x-ray structures are much more obvious in SACM. We expect further such simulations with ACM may grasp the essential motions at much longer timescales, which can have more extensive relations with functions.
Comparisons with other sampling methods
Collective coordinates have been used in the study of protein motions for quite some time. In the ACM method presented here, the collective modes are obtained using the coarse-grained model ANM (Atilgan et al., 2001
). The ANM approximation seems sufficient to reproduce the slow dynamics as well as the harmonic modes obtained by NMA, using all-atom empirical potentials. In the case of T4L, the first few slowest ANM modes are able to significantly cover the first two essential modes obtained on a set of crystal structures. ANM is computationally more efficient than NMA and EDA. More importantly, by using ANM we can update the collective modes on the fly. We also do not need a known set of conformers to determine the essential modes or to assume the essential modes to be invariant. This is not the case in the essential dynamics simulation method (Amadei et al., 1996
). For large systems, the updating of the ANM modes can be computationally expensive if it is done every MD step. This is, however, unnecessary. An "ideal" updating frequency is that it can catch up with the changes in the essential subspace with minimum additional computational cost. This should not be an issue in practice, because while larger systems require more efforts to obtain the ANM modes, the physical processes of interest generally take place at longer timescales. For T4 lysozyme in explicit water, updating the ANM modes every 100 steps (0.2 ps) only increases the computational cost by 10%. One does not need to risk violating the physical requirement by reducing the frequency further to gain little in additional computational efficiency. For both systems we studied, we expect the conformational changes, that may lead to significantly different slow conformational subspaces, to take place for significantly longer than the respective time intervals for ANM updating.
One key strategy in the ACM approach is to couple only the first few low-frequency collective modes to a higher temperature. Compared with other previously described sampling methods employing higher temperatures, such as the multicanonical ensemble algorithm (Hansmann and Okamoto, 1993
; Nakajima et al., 1997
), the temperatures of the entire system are elevated in the latter approaches. As a result, the higher energy regions are nondiscriminatively sampled. This significantly expands the accessible conformational space, introducing additional burden on sampling. The other methods, such as adaptive umbrella sampling (Bartels and Karplus, 1998
), conformational flooding (Grubmuller, 1995
), and chemical flooding (Muller et al., 2002
), achieve higher sampling efficiency by flattening the potential energy surface. In the ACM method, the potential energy function is the same as in the conventional simulation. The expansion of sampling is along the slow modes, i.e., low potential energy valleys on the potential energy surface.
There are still some disadvantages to our method. In the ACM simulations, a number of collective modes are coupled to a higher temperature at all times; that is, the system is always in a nonequilibrium state. There remains the question of how to extract the thermodynamics properties of the system from such simulations. Another disadvantage has to do with our selection of the slowest modes for higher temperature coupling. In some cases, the functional motions of the proteins we are interested in may not occur along these slowest collective modes. Future work is needed on how to pick up the collective modes of functional importance in such cases and selectively excite them in simulations.
| CONCLUSIONS |
|---|
|
|
|---|
In our first application to demonstrate the method, multiple GBSA simulations of the S-peptide analog were performed. For the ACM simulations starting from the native structure, the native structure is stable, but larger fluctuations than the conventional simulations are observed. Among the total 10 ACM simulations started from the denatured structure, peptide refolding was observed in eight simulations; the peptide was not trapped in local, non-native minima as is the case in conventional simulations. It seems that the ACM method can expand the conformational space of the peptide extensively while restricting the sampling within the low-energy regions. The peptide results are interesting also in the sense that, to our knowledge, ANM has only been used to investigate the collective motions of folded structures before our work.
In the T4 lysozyme case, compared with the conventional simulation which sampled a restricted area in the conformational space, the interdomain motions observed in the ACM simulation (SACM) are much more obviously and in good agreement with the observed variances of different x-ray structures. As in the peptide case, the amplified motions are restricted to the long wavelength collective modes, with the secondary structures and individual domains almost undisrupted, compared with the conventional simulation.
Applications of the ACM method may be abundant, such as large-magnitude functional motions of protein, folding/unfolding pathways of peptides and proteins, and structure predictions. Our results indicated that the method could be used in both explicit and implicit solvent simulations and may be generally applicable to many different MD and MC formalisms. Of course, the method needs further improvements to investigate these important biological problems efficiently.
| ACKNOWLEDGEMENTS |
|---|
|
|
|---|
This work has been supported by the Chinese Academy of Sciences (grant number G1999075605) and the Chinese National Natural Science Foundation (grant numbers 90103032 and 30025013).
Submitted on October 19, 2002; accepted for publication January 28, 2003.
| REFERENCES |
|---|
|
|
|---|
Amadei, A., A. B. M. Linssen, and H. J. C. Berendsen. 1993. Essential dynamics of proteins. Proteins. 17:412425.[Medline]
Amadei, A., A. B. M. Linssen, B. L. de Groot, D. M. F. van Aalten, and H. J. C. Berendsen. 1996. An efficient method for sampling the essential subspace of proteins. J. Biomol. Struct. Dyn. 13:615625.[Medline]
Anderson, W. F., M. G. Grutter, S. J. Remington, L. H. Weaver, and B. W. Matthews. 1981. Crystallographic determination of the mode of binding of oligosaccharides to T4 bacteriophage lysozyme: implications for the mechanism of catalysis. J. Mol. Biol. 147:523543.[Medline]
Atilgan, A. R., S. R. Durell, R. L. Jernigan, M. C. Demirel, O. Keskin, and I. Bahar. 2001. Anisotropy of fluctuation dynamics of proteins with an elastic network model. Biophys. J. 80:505515.
Bahar, I., A. R. Atilgan, and B. Erman. 1997. Direct evaluation of thermal fluctuations in proteins using a single parameter harmonic potential. Fold. Des. 2:173181.[Medline]
Bartels, C., and M. Karplus. 1998. Probability distributions for complex systems: adaptive umbrella sampling of the potential energy. J. Phys. Chem. B. 102:865880.
Berendsen, H. J. C., J. P. M. Postma, W. F. van Gunsteren, and J. Hermans. 1981. Interaction models for water in relation to protein hydration. In Intermolecular Forces. B. Pullman, editor. Reidel, Dordrecht, the Netherlands. pp.331342.
Berendsen, H. J. C., J. P. M. Postma, W. F. van Gunsteren, A. Di Nola, and J. P. Haak. 1984. Molecular dynamics with coupling to an external bath. J. Chem. Phys. 81:36843690.
Berendsen, H. J. C., and S. Hayward. 2000. Collective protein dynamics in relation to function. Curr. Opin. Struct. Biol. 10:165169.[Medline]
Daggett, V. 2000. Long timescale simulations. Curr. Opin. Struct. Biol. 10:160164.[Medline]
de Groot, B. L., A. Amadei, D. M. F. van Aalten, and H. J. C. Berendsen. 1996a. Towards an exhaustive sampling of the configurational spaces of the two forms of the peptide hormone guanylin. J. Biomol. Struct. Dyn. 13:741751.[Medline]
de Groot, B. L., A. Amadei, N. A. J. van Nuland, and H. J. C. Berendsen. 1996b. An extended sampling of the configurational space of hpr form E. coli. Proteins. 26:314333.[Medline]
de Groot, B. L., D. M. F. van Aalten, R. M. Scheek, A. Amadei, G. Vriend, and H. J. C. Berendsen. 1997. Prediction of protein conformational freedom from distance constraints. Proteins. 29:240251.[Medline]
de Groot, B. L., S. Hayward, D. M. F. van Aalten, A. Amadei, and H. J. C. Berendsen. 1998. Domain motions in bacteriophage T4 lysozyme: a comparison between molecular dynamics and crystallographic data. Proteins. 31:116127.[Medline]
Dhillon, I. S. 1997. A new O (n2) algorithm for the symmetric tridiagonal eigenvalue/eigenvector problem. In Computer Science Division Technical Report No. UCB//CSD-97971, University of California at Berkeley, Berkeley, CA.
Di Nola, A., D. Roccatano, and H. J. C. Berendsen. 1994. Molecular dynamics simulation of the docking of substrates to proteins. Proteins. 19:174182.[Medline]
Doniach, S., and P. Eastman. 1999. Protein dynamics simulations from nanoseconds to microseconds. Curr. Opin. Struct. Biol. 9:157163.[Medline]
Eastman, P., M. Pellegrini, and S. Doniach. 1999. Protein flexibility in solution and in crystals. J. Chem. Phys. 110:1014110152.
Erman, B., A. Kloczkowski, and J. E. Mark. 1989. Chain dimensions and fluctuations in random elastomeric networks. 2. Dependence of chain dimensions and fluctuations on macroscopic strain. Macromolecules. 22:14321441.
Faber, H. R., and B. W. Matthews. 1990. A mutant T4 lysozyme displays five different crystal conformations. Nature. 348:263266.[Medline]
Grubmuller, H. 1995. Predicting slow structural transitions in macromolecular systems: conformational flooding. Phys. Rev. E. 52:28932906.
Haliloglu, T., I. Bahar, and B. Erman. 1997. Gaussian dynamics of folded proteins. Phys. Rev. Lett. 79:30903093.
Hansmann, U. H. E., and Y. Okamoto. 1993. Prediction of peptide conformation by multicanonical algorithm: new approach to the multiple-minima problem. J. Comp. Chem. 14:13331338.
Kabsch, W., and C. Sander. 1983. Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features. Biopolymers. 22:25762637.
Kitao, A., and N. Go. 1991. Conformational dynamics of polypeptides and proteins in the dihedral angle space and in the Cartesian coordinate space: normal mode analysis of deca-ananine. J. Comp. Chem. 12:359368.
Kitao, A., S. Hayward, and N. Go. 1994. Comparison of normal mode analyses on a small globular protein in dihedral angle space and Cartesian coordinate space. Biophys. Chem. 52:107114.[Medline]
Kitao, A., S. Hayward, and N. Go. 1998. Energy landscape of a native protein: Jumping-Among-Minimum model. Proteins. 33:496517.[Medline]
Kitao, A., and N. Go. 1999. Investigating protein dynamics in collective coordinate space. Curr. Opin. Struct. Biol. 9:164169.[Medline]
Kloczkowski, A., J. E. Mark, and B. Erman. 1989. Chain dimensions and fluctuations in random elastomeric networks. 1. Phantom Gaussian networks in the underformed state. Macromolecules. 22:14231432.
Kraulis, P. J. 1991. MOLSCRIPT: a program to produce both detailed and schematic plots of protein structures. J. Appl. Crystallog. 24:946950.
Mangoni, M., D. Roccatano, and A. Di Nola. 1999. Docking of flexible ligands to flexible receptors in solution by molecular dynamics simulation. Proteins. 35:153162.[Medline]
Ming, D., Y. Kong, S. J. Wakil, J. Brink, and J. Ma. 2002a. Domain movements in human fatty acid synthase by quantized elastic deformational model. Proc. Natl. Acad. Sci. USA. 99:78957899.
Ming, D., Y. Kong, M. A. Lambert, Z. Huang, and J. Ma. 2002b. How to describe protein motion without amino acid sequence and atomic coordinates. Proc. Natl. Acad. Sci. USA. 99:86208625.
Miyazawa, S., and R. L. Jernigan. 1985. Estimation of effective inter-residue contact energies from protein crystal structures: quasi-chemical approximation. Macromolecules. 18:534552.
Muller, E. M., A. de Meijere, and H. Grubmuller. 2002. Predicting unimolecular chemical reactions: chemical flooding. J. Chem. Phys. 116:897905.
Nakajima, N., H. Nakamura, and A. Kidera. 1997. Multicanonical ensemble generated by molecular dynamics simulations for enhanced conformational sampling of peptides. J. Phys. Chem. B. 101:817824.
Noguti, T., and N. Go. 1985. Efficient Monte Carlo method for simulation of fluctuating conformations of native proteins. Biopolymers. 24:527546.[Medline]
Ryckaert, J. P., G. Ciccotti, and H. J. C. Berendsen. 1977. Numerical integration of the Cartesian equations of motion of a system with constraints: molecular dynamics of n-alkenes. J. Comp. Phys. 23:327341.
Smith, P. E., and W. F. van Gunsteren. 1994. Consistent dielectric properties of the simple point charge and extended simple point charge water models at 277 and 300K. J. Chem. Phys. 100:31693174.
Still, W. C., A. Tempczyk, R. C. Hawley, and T. Hendrickson. 1990. Semianalytical treatment of solvation for molecular mechanics and dynamics. J. Am. Chem. Soc. 112:61276129.
Teeter, M. M., and D. A. Case. 1990. Harmonic and quasiharmonic descriptions of crambin. J. Phys. Chem. 94:80918097.
Tirado-Rives, J., and W. L. Jorgensen. 1991. Molecular dynamics simulations of an
-helical analogue of ribonuclease-A S-peptide in water. Biochemistry. 30:38643871.[Medline]
Tirion, M. M. 1996. Large amplitude elastic motions in proteins from a single-parameter, atomic analysis. Phys. Rev. Lett. 77:19051908.[Medline]
Tironi, I. G., R. Sperb, P. E. Smith, and W. F. van Gunsteren. 1995. A generalized reaction field method for molecular dynamics simulations. J. Chem. Phys. 102:54515459.
van der Spoel, D., R. van Drunen, and H. J. C. Berendsen. 1999. Groningen Machine for Chemical Simulations. Department of Biophysical Chemistry, Bioson Research Institute, Nijenborgh, Groningen, the Netherlands.
van Gunsteren, W. F., H. J. C. Berendsen, and J. A. C. Rullman. 1981. Stochastic dynamics for molecules with constraints Brownian dynamics of n-alkanes. Mol. Phys. 44:6995.
van Gunsteren, W. F., S. R. Billeter, A. A. Eising, P. H. Hunenberger, P. Kruger, A. E. Mark, W. R. P. Scott, and I. G. Tironi. 1996. Biomolecular Simulation: The GROMOS96 Manual and User Guide. Vdf Hochschulverlag, Zurich, Switzerland.
Vitkup, D., D. Ringe, G. A. Petsko, and M. Karplus. 2000. Solvent mobility and the protein glass transition. Nature Struct. Biol. 7:3438.[Medline]
Vriend, G. 1990. WHAT IF: a molecular modeling and drug design program. J. Mol. Graph. 8:5256.[Medline]
Weaver, L. H., and B. W. Matthews. 1987. Structure of bacteriophage T4 lysozyme refined at 1.7A resolution. J. Mol. Biol. 193:189199.[Medline]
Zhang, X. J., J. A. Wozniak, and B. W. Matthews. 1995. Protein flexibility and adaptability seen in 25 crystal forms of T4 lysozyme. J. Mol. Biol. 250:527552.[Medline]
Zhang, Z., Y. Zhu, and Y. Shi. 2001. Molecular dynamics simulations of urea and thermal-induced denaturation of S-peptide analogue. Biophys. Chem. 89:145162.[Medline]
Zhu, J., Y. Shi, and H. Liu. 2002. Parametrization of a generalized Born/solvent-accessible surface area model and applications to the simulation of protein dynamics. J. Phys. Chem. B. 106:48444853.
This article has been cited by other articles:
![]() |
B. Isin, K. Schulten, E. Tajkhorshid, and I. Bahar Mechanism of Signal Propagation upon Retinal Isomerization: Insights from Molecular Dynamics Simulations of Rhodopsin Restrained by Normal Modes Biophys. J., July 15, 2008; 95(2): 789 - 803. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. G. Su, C. H. Li, R. Hao, W. Z. Chen, and C. Xin Wang Protein Unfolding Behavior Studied by Elastic Network Model Biophys. J., June 15, 2008; 94(12): 4586 - 4596. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. J. Mashl and E. Jakobsson End-Point Targeted Molecular Dynamics: Large-Scale Conformational Changes in Potassium Channels Biophys. J., June 1, 2008; 94(11): 4307 - 4319. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Zheng and B. R. Brooks Modeling Protein Conformational Changes by Iterative Fitting of Distance Constraints Using Reoriented Normal Modes Biophys. J., June 15, 2006; 90(12): 4327 - 4336. [Abstract] [Full Text] [PDF] |
||||
![]() |