help button home button Biophys. J.
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS

This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Moret, M. A.
Right arrow Articles by Pascutti, P. G.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Moret, M. A.
Right arrow Articles by Pascutti, P. G.

Biophys J, March 2002, p. 1123-1132, Vol. 82, No. 3

New Stochastic Strategy to Analyze Helix Folding

M. A. Moret,*dagger P. M. Bisch,dagger K. C. Mundim,Dagger and P. G. Pascuttidagger

 *Departamento de Física, Universidade Estadual de Feira de Santana, Feira de Santana,  dagger Instituto de Biofísica Carlos Chagas Filho, Universidade Federal do Rio de Janeiro, Rio de Janeiro, and  Dagger Instituto de Química, Universidade de Brasília, Brasília, Brazil


    ABSTRACT
TOP
ABSTRACT
INTRODUCTION
METHODS
RESULTS AND DISCUSSION
CONCLUSIONS
REFERENCES

We propose an alternative stochastic strategy to search secondary structures based on the generalized simulated annealing (GSA) algorithm, by using conformational preferences based on the Ramachandran map. We optimize the search for polypeptide conformational space and apply to peptides considered to be good alpha -helix promoters above a critical number of residues. Our strategy to obtain conformational energies consist in coupling a classical force field (THOR package) with the GSA procedure, biasing the Phi  × Psi  backbone angles to the allowed regions in the Ramachandran map. For polyalanines we obtained stable alpha -helix structures when the number of residues were equal or exceeded 13 amino acids residues. We also observed that the energy gap between the global minimum and the first local minimum tends to increase with the polypeptide size. These conformations were generated by performing 2880 stochastic molecular optimizations with a continuum medium approach. When compared with molecular dynamics or Monte Carlo methods, GSA can be considered the fastest.


    INTRODUCTION
TOP
ABSTRACT
INTRODUCTION
METHODS
RESULTS AND DISCUSSION
CONCLUSIONS
REFERENCES

It is well known that the biological activity depends on the spatial conformation acquired by the macromolecules in the physiological medium. The action of hormones and drugs is also dependent on the molecular three-dimensional structure of the target molecules. In recent years, the atomic description of biological molecules in computational simulations have promoted significant advances in the comprehension of the biological process as well as proposed new insights in the design of molecules to satisfy specific properties.

Biological macromolecules have a large number of degrees of freedom leading to several local minima in the molecular energy hyper-surface. Concerning protein functionality, it is presumed that these molecules express their biological activity when they are close to the global minimum of energy (Yon, 1997). A generalized concern is how to predict the lowest conformational energy through simulation procedures. In that sense, we have developed a computational code to perform stochastic molecular optimization (Moret et al., 1998b) hoping to find the lowest conformational energy.

The available literature on peptide conformational analysis is enormous, since Corey and Pauling (1953) determined the ideal values for all backbone bond lengths and bond angles. The allowed values for the pairs of dihedral angles about the Calpha atoms, which are limited by steric constraints, were determined by Ramachandran et al. (1963) and are summarized in the so-called Ramachandran map.

Important advances are found in the theoretical field of chain topologies prediction (Rooman et al., 1991) and also regarding the folding patterns determined by experimental studies (Richards, 1991; Dobson, 1992; Elöve et al., 1992; Haynie and Freire, 1993; Kiefhaber et al., 1992; Radford et al., 1992; Khorazanizadeh et al., 1993; Evans and Radford, 1994; Clarke et al., 1999) as well as by theoretical ones (Dill, 1990, 1993; Shakhnovich, 1994; Sali et al., 1994; Wolynes et al., 1995; Bryngelson et al., 1995; Pande et al., 1998; Hiltpold et al., 2000).

Helices are the most prevalent secondary structural motif observed in proteins with known structure. Several recent theoretical studies of the helix-coil transitions have shown the relation between the unfolded state, in random coil conformations, and the helical conformational states, by using molecular dynamics (Daggett et al., 1991; Tobias and Brooks, 1991; Wang et al., 1995; Brooks, 1996, Doruker and Bahar, 1997; Bertsch et al., 1998, Hiltpold et al., 2000), Monte Carlo (Sung, 1994, 1995; Wu and Wang, 1998), or other theoretical approaches (Jun and Weaver, 2000; Park and Goddard, 2000). Both standard molecular dynamics and Monte Carlo simulation methods require extended computational time to obtain helical secondary structures.

To search peptide conformational space, we proposed a very fast stochastic procedure consisting of a simulated annealing methodology coupled to a biased search of the energy hyper-surface according to the allowed regions in the Ramachandran map. We applied this procedure to investigate the helical propensities of polyalanines as a function of the polypeptide size, in a continuum medium approach with a dielectric constant epsilon r = 2. To illustrate the method applicability, a more complex system is presented briefly, where we search for the secondary structures in a small protein.

Simulated annealing methods have been applied successfully for the description of a variety of global optimization problems. The power of the simulated annealing methods is due to their suitability for large-scale optimization problems, especially for those in which a desired global minimum is hidden among many local minima. The first nontrivial solution in this sense was proposed to solve combinatorial problems (Kirkpatrick et al., 1983). Their algorithm strictly follows the quasi-equilibrium Boltzmann-Gibbs statistics using a Gaussian visiting distribution. Based on generalized thermostatistics (Tsallis, 1988, 1995; Curado and Tsallis, 1991), Tsallis and Stariolo (1996) proposed the generalized simulated annealing (GSA), or Tsallis machine, approach. Tsallis machine was applied in several problems, such as molecular optimization using classical methods (Moret et al., 1998b) or semi-empirical methods (Mundim and Tsallis, 1996), geophysical problems (Mundim et al., 1998), traveling salesman problem (Penna, 1995a), numerical data fitting (Penna, 1995b), and genetic algorithm (Moret et al., 1998a). GSA has been proven to be the most effective simulated annealing method when considered for optimizing combinatorial problems (Penna, 1995a) and seems to be the most effective one in simulated annealing when considered for optimizing problems in real space (Tsallis and Stariolo, 1996). Our strategy to obtain molecular conformational energies is to couple a classical force field with the GSA procedure and use a Ramachandran map as a tendency for the probable dihedral angles.

We considered as allowed regions of the Ramachandran map, those corresponding to the Phi  and Psi  angles values proposed in the PRELUDE software package (Rooman et al., 1991). These values were computed from comparative statistics of the backbone secondary structure for several amino acid sequences. Table 1 shows the seven possible conformations proposed by this method to predict main-chain topology. These specific angles describe the average conformation of a wide range of proteins with known backbone topology and were also employed by Dandekar and Argos (1994) to perform a polypeptide conformational search using a genetic algorithm. In our strategy we used these seven values (Table 1) to define the center of each allowed region for the dihedral coordinates. Dandekar and Argos (1994) used an empirical potential based on the hydrophobic contribution of each amino acid. Our procedure uses a classical force field that describes all atoms in the system whereas conformations are obtained by a stochastic method, i.e., GSA. The stochastic procedure is used to scan the peptide main chain oriented by Ramachandran map and also to analyze side-chain conformations.


                              
View this table:
[in this window]
[in a new window]
 
TABLE 1   Seven possible conformations for several amino acid sequences

In developing the computational code, for the GSA biased by the Ramachandran map procedure (GSARM), four aspects were considered to create an efficient calculation system. First, the energy function E(phi) is defined in an N-dimension continuous space, where phi epsilon  RN; second, transpose easily the conformational barrier to avoid trapping the system in local minima; third, the search is biased considering seven conformational regions that are defined in the Ramachandran map as proposed by Rooman et al. (1991); and, fourth, the method gives a rapid analysis of the energy hyper-surface, increasing information about Ramachandran maps in addition.

Here we present the GSARM procedure used for recovering the global minimum. We then discuss the results obtained for some molecular structures of simple polypeptides (polyalanines), and the conclusions of this research are presented.


    METHODS
TOP
ABSTRACT
INTRODUCTION
METHODS
RESULTS AND DISCUSSION
CONCLUSIONS
REFERENCES

The THOR program (Arêas et al., 1995; Moret et al., 1998b; Pascutti et al., 1999a,b) was developed to be a comprehensive and a flexible tool to investigate macromolecular structures of biological interest such as proteins and membranes. The computational code is based on a classical force field and considers the GROMOS parameters (van Gusteren and Berendsen, 1987) as well as the corrections from the GROMOS96 version; however, other force fields can be easily implemented. Both molecular dynamics and optimization methods are available in this program. The choice of either of these methods depends on the user needs. For example, if dynamic properties are needed, the software can perform molecular dynamics simulations. To analyze systematically the conformational energy hyper-surface or to map the global and local minima we use stochastic methods.

In the THOR program, the conformational energy of the molecule is made up of a sum of bonded and nonbonded terms (Arêas et al., 1995; Pascutti et al., 1999a,b). In this approach, only hydrogen atoms covalently bonded to oxygen or to nitrogen are considered explicitly, whereas CH1, CH2, and CH3 groups are assumed to be an atomic unit. In our stochastic procedure we use a simplified version of the conformational energy, which maintains fixed bond lengths and bond angles within their ideal values. We focus our search on the dihedral angle space (Moret et al., 1998b). Therefore, we analyze the changes of the following energy function:
E=E<SUB>ϕ</SUB>+E<SUB>VdW</SUB>+E<SUB>el</SUB>, (1)
where Ephi is the dihedral angle potential, EVdW is the van der Waals potential, and Eel is the Coulomb potential term (see definitions and used parameters in Moret et al., 1998b).

The search of local and global minima and the mapping of the energy hyper-surface involves the comparison of the energies of two consecutive random conformations. The molecular geometry at steps t + 1 and t, are given by phit+1, and phit, respectively, where phit is an N-dimensional vector that contains all dihedral coordinates (N) to be optimized. Then, for two consecutive steps, they are related by
ϕ<SUB>t+1</SUB>=ϕ<SUB>t</SUB>+&Dgr;ϕ<SUB>t</SUB>, (2)
where Delta phit is a random perturbation of the dihedral angles.

This random perturbation Delta phit allows visiting the potential hyper-surface. Each point in Table 1 defines a central point of seven allowed square regions. To define the extension of these regions we have initially taken large Phi  and Psi  angular variations centered on each point. By testing the methodology on polyalanines, we have verified that most of the optimized structures present variations of the Phi  and Psi  values that did not exceed 3.6° according to the expected central points of Table 1. Therefore, we have considered the conformational search in the GSARM procedure restricted to these seven square areas of side equal to 7.2. To compare the number of cycles necessary for convergence, we have also performed conformational search on the entire Phi  × Psi  space (full GSA procedure).

To generate the random vector phit+1 we use the visiting distribution function g(Delta phit,i) for each component Delta phit,i of the perturbation vector Delta phit, defined as follows:
g<SUB>q<SUB>v</SUB></SUB>(&Dgr;ϕ<SUB>t,i</SUB>)=<FENCE><FR><NU>q<SUB>v</SUB>−1</NU><DE>&pgr;</DE></FR></FENCE><SUP>1/2</SUP> <FR><NU>&Ggr;<FENCE><FR><NU>1−<FR><NU>1</NU><DE>2</DE></FR> (q<SUB>v</SUB>−1)</NU><DE>q<SUB>v</SUB>−1</DE></FR></FENCE></NU><DE>&Ggr;<FENCE><FR><NU>1</NU><DE>q<SUB>v</SUB>−1</DE></FR>−<FR><NU>1</NU><DE>2</DE></FR></FENCE></DE></FR> (3)

×<FR><NU>[T<SUB>q<SUB>v</SUB></SUB>(t)]<SUP>1/(3−q<SUB>v</SUB>)</SUP></NU><DE><FENCE>1+(q<SUB>v</SUB>−1) <FR><NU>(&Dgr;ϕ<SUB>t,i</SUB>)<SUP>2</SUP></NU><DE>[T<SUB>q<SUB>v</SUB></SUB>(t)]<SUP>2/(3−q<SUB>v</SUB>)</SUP></DE></FR></FENCE><SUP>1/(q<SUB>v</SUB>−1)−1/2</SUP></DE></FR>,
where qv is the visiting index, Tq&ugr; is the visiting temperature, and Gamma  is the gamma function. To obtain the Delta phit vector we took a random value for the random vector a = {ai} (0 < ai < 1) and performed a numerical integration of the visiting distribution probability:
a<SUB>i</SUB>=<LIM><OP>∫</OP></LIM>g(&Dgr;ϕ<SUB>t,i</SUB>)dϕ (4)
The integral of gqv(Delta phit) has an analytical solution only for qv = 1 or qv = 2. In the GSA package, the integral is calculated by using a series expansion and by taking the inverse function of a polynomial series, whose expansion has a cutoff at the 17th order, as proposed in Moret et al. (1998b).

The following step is to compare the energy of the new conformation related to the old one; if this energy is lower than the previous one, the new conformation is accepted. To test the acceptance of a new conformation with higher energy we use the generalized Tsallis statistic PqA, given by Tsallis and Stariolo (1996):
P<SUB>q<SUB>A</SUB></SUB>=[1−(1−q<SUB>A</SUB>)[E(ϕ<SUB>t</SUB>)−E(ϕ<SUB>t+1</SUB>)]/T<SUB>q<SUB>A</SUB></SUB>(t)]<SUP>1/(1−q<SUB>A</SUB>)</SUP>, (5)
where qA is acceptance index and TqA is the acceptance temperature. At the limit, where qA right-arrow 1 we recover the Boltzmann-Gibbs statistics used in the conventional Metropolis criterion (Metropolis et al., 1953).

In summary, the algorithm for searching the peptide secondary structure, using GSA or GSARM procedure, is as follows. 1) Fix the parameters (qA, qv). Start, at t = 1, with arbitrary internal coordinates (arbitrary dihedral angles) and a high enough value T0 for the initial visiting and acceptance temperatures: Tqv(1) and TqA(1). 2) Randomly choose a region of the Ramachandran map for each residue using the visiting distribution probability gqv(Delta phit) (Eq. 5). 3) Randomly generate the vector phit+1, given by the visiting distribution probability gqv(Delta phit) (Eq. 5). 4) Calculate the conformational energy E(phit+1) by using the THOR program and the acceptance criterion as follows: if E(phit+1) < E(phit), replace phit by phit+1; if E(phit+1>=  E(phit), run a random number r epsilon  [0, 1]; if r > PqA (acceptance probability) retain phit; otherwise, replace phit by phit+1. 5) Cool the system by decreasing the temperatures Tqv(t) and TqA(t), assuming, for the sake of simplicity, that Tqv(t) = TqA(t):
T<SUB>q<SUB>A</SUB></SUB>(t)=T<SUB>q<SUB>v</SUB></SUB>(t)=T<SUB>q<SUB>v</SUB></SUB>(1) <FR><NU>2<SUP>q<SUB>v</SUB>−1</SUP>−1</NU><DE>(1+t)<SUP>q<SUB>v</SUB>−1</SUP>−1</DE></FR>, (6)
where t is the discrete computational step. 6) Go back to step 2 until the energy Emin(phi) is reached, i.e., when a lower energy value is no longer obtained in several steps. A more general test for the convergence is whether the same final conformation is obtained starting from different initial conditions.

For a sufficiently large number of steps, this procedure assures that the system can escape from any local minimum and explore the entire allowed energy hyper-surface. In the following section the applications for the GSA and GSARM approaches is shown.

To study the helix formation and to compare the two methodologies (GSA and GSARM) we performed a search of minima in the energy hyper-surface of the polyalanines with 5-20 alanine residues. To obtain the dihedral minima structures we have used both GSA and GSARM approaches taking as initial conformations the completely extended ((Phi , Psi ) = (180o, 180o)) conformation for all polyalanines. In the calculations we have fixed all bond lengths and bond angles within their ideal values, as proposed by Corey and Pauling (1953). We have performed a set of simulations with different initial parameters, i.e., T0 = [1, 2, 5, 10, 50, 100] and qA and qv values in the interval [1.1, 2.5] with 0.1 as a step. We observed that global minimum is reached in more than 50% of the simulations using T0 = 100, qA = 1.1, and qv values larger or equal to 2. For each peptide system we performed 180 simulations (90 with GSA and 90 with GSARM). Therefore we performed 2880 simulations (1440 to GSA and 1440 to GSARM). Using this procedure we obtained sets of configurations in local and global minima.


    RESULTS AND DISCUSSION
TOP
ABSTRACT
INTRODUCTION
METHODS
RESULTS AND DISCUSSION
CONCLUSIONS
REFERENCES

The alpha -helix is the classic element of protein structure. Pauling et al. (1951) were the first ones to describe the alpha -helix. They predicted it as a stable and favorable structure in proteins. All the hydrogen bonds of the alpha -helix backbone are aligned along the helical axis with the same orientation. Because a peptide bond has a dipole moment arising from the different polarity of the NH and CO groups, these dipole moments are also aligned along the helical axis. The overall effect is a significant macro-dipole that has the positive pole at the amino end and the negative pole at the carboxyl end of the alpha -helix. The overall energy that stabilizes the alpha -helix came from the attractive contributions due to hydrogen bonds and/or by charge-helix dipole interactions (Shoemaker et al., 1985) as well as from the van der Waals contributions. The alpha -helix formation is a cooperative process where the electrostatic energy has an important role in stabilizing this type of structure, as was shown by Park and Goddard (2000) through ab initio quantum mechanics calculations. These authors found that extending the length of an alpha -helix by adding additional residues increasingly favors the alpha -helix formation. A critical number of amino acids, however, are necessary to stabilize this alpha -helix structure, and an upper limit may also be imposed by the entropy effect. Isolated alpha -helix structures, in fact, would have to be longer than 13 residues to be stabilized by the attractive interactions (Shoemaker et al., 1987; Rogers, 1989; Voet and Voet, 1994).

Different amino acids have been found to present weak though definite preference in favor or against being in alpha -helix structure, and the intrinsic helical propensity of some amino acids has been demonstrated to be position dependent (Petukhov et al., 1999). In this sense, Ala, Glu, Leu, and Met are considered to be good alpha -helix promoters whereas Pro, Gly, Tyr, and Ser are considered to be poor ones (Branden and Tooze, 1991). Such preferences were the main considerations in all early attempts to predict secondary structures from amino acid sequences, but they were not strong enough to obtain accurate predictions.

Short polypeptides and individual protein fragments generally do not form helices in water. The first example resulting in a significant alpha -helix formation in water, near 0°C, was obtained with the C- and S-peptide fragments of ribonuclease A (Kim and Baldwin, 1984). Stable alpha -helix formation was observed in 16-residue alanine-based peptides and was attributed to the high helix-forming potential of alanines (Marqusee et al., 1989). Other experimental measurements on alanine-based peptides have shown that helix nucleation takes place on the millisecond time scale (Clarke et al., 1999).

Molecular dynamics simulations, considering aqueous or implicit solvent, have also tested the stability of polyalanines. Doruker and Bahar (1997) studied the alpha -helix stability in homopeptides of 13 amino acid residues, and they proposed a rank for the amino acids involved where Ala < Val < Ser < Gly. They observed that polyalanine unfolds within few hundreds of picoseconds at 350 K. Starting from all-coil conformations, Hiltpold et al. (2000) observed the helical stabilization of alanine-based polypeptides of ~30 residues, in an implicit solvent model, within the first 30 ns, at 360 K, with an average folding time of 10 ns. By means of torsion-coordinates molecular dynamics, a method that eliminates bond and angle degrees of freedom, Bertsch et al. (1998) observed the alpha -helix formation in a 20-residue alanine peptide in trajectories of 0.5 ns, with a half-life of 210 ps for the helix formation. A shorter alpha -helix folding time of 100 ps was found by Wu and Wang (1998) for a 16-residue polyalanine, with self-guided molecular dynamics, based on the motion guided by an introduced external force.

Stochastic methods have also been applied in polyalanine helix studies (Sung, 1994; Hoffmann and Knapp, 1996). Sung (1994), with the Metropolis-Monte Carlo method and the solvent-referenced interaction, observed the alpha -helix formation in a 16-residue alanine peptide, with different initial conditions, after 15 × 106 steps. In this work we applied the stochastic method GSA, and demonstrated that polyalanines are stable in the alpha -helix structure when the peptide has 13 or more amino acid residues, in a low dielectric constant medium. Furthermore, we show that this stable conformation can be reached in a few thousand steps.

In Tables 2 and 3 are shown the 10 lowest-energy conformations obtained for most of the studied peptides, using both GSARM and GSA procedures. It can be observed that, unless a critical number of residues is attained (~13 residues), the lowest-energy state obtained corresponds to a nearly random conformation, and only few residues are in the alpha -helix region whereas the majority of them is out of this region. Furthermore, for peptides with less than 13 residues, the energy gap between the lowest-energy state and any other lower-energy states is of the order of the available thermal energy at room temperature. On the contrary, for peptides with more than 13 residues, the lowest-energy state corresponds to a state where most of the residues are in the alpha -helix conformation, and the energy gap between the lowest state and the following lower one tends to be very large.


                              
View this table:
[in this window]
[in a new window]
 
TABLE 2   Conformations obtained with GSA and GSARM


                              
View this table:
[in this window]
[in a new window]
 
TABLE 3   Conformations obtained with GSA and GSARM

We summarized our results for polyalanines in Fig. 1, showing the minimum energy conformations of some polyalanines with 5 residues up to 20 residues using these methodologies. In this figure, we note that no conformational preference is observed for peptides with less than 13 residues (Fig. 1, A-F). Peptides with 13 or more residues tended to be stabilized in an alpha -helix structure (Fig. 1, G-J). We observe in Table 2 that the peptide with 13 alanine residues presented an alpha -helix structure that corresponds to the lowest-energy minimum. The peptides with 6, 7, 8, 9, and 11 alanine residues presented an alpha -helix structure corresponding only to its secondary local minimum. On the other hand, in Table 3, all peptides with more than 13 residues presented an alpha -helix conformation corresponding to the lowest-energy minimum.



View larger version (43K):
[in this window]
[in a new window]
 
FIGURE 1   The structures correspond to the lowest minimum of the polyalanines with 5 (A), 6 (B), 7 (C), 8 (D), 9 (E), 11 (F), 13 (G), 15 (H), 18 (I), and 20 (J) residues. It is clearly shown that polyalanines with 13 or more residues have alpha -helix conformations.

In Fig. 2 is shown the Ramachandran map for all minima energy configurations obtained with both GSARM and GSA methods for the pentalanine and for the icoalanine. It was possible to discriminate the most populated regions on the Ramachandran map. We observed that pentalanine (Fig. 2 A) populates more the beta -region than the alpha -region. On the contrary, we observed that icoalanine (Fig. 2 B) populates more the alpha -region than the beta -region. In Fig. 2, C-H, more details of the icoalanine simulations are shown. We noted that N-terminus residues have not been stabilized even around an alpha -region, in accordance with the literature (Munoz and Serrano, 1995). On the other hand, the presented residues showed an alpha -helix structure preference.



View larger version (24K):
[in this window]
[in a new window]
 
FIGURE 2   (A and B) The Phi  and Psi  coordinates of each residue of the minimum energy structures found by using both GSA and GSARM procedures, for pentalanine (A) and icoalanine (B). (C-H) Ramachandran plot of some residues of the icoalanine are shown in detail: N-terminus (C), ALA3 (D), ALA8 (E), ALA12 (F), ALA17 (G), and C-terminus (H).

In Fig. 3 A is shown the energy gap between the global (lowest) minimum and the first local minimum for all polyalanines. In this figure we observe that for peptides with a number of residues less than 12 residues the energy gap tends to increase with the enhancement of the peptide size. On the other hand, peptides with 13-16 residues do not present a large energy gap. Peptides with 17 or more residues presented a large energy gap, and therefore, these peptides are very stable in an alpha -helix conformation. In Fig. 3 B is shown the energy gap between the lowest-energy state and the minimum with the largest number of the residues in the alpha -region. In this figure we also observe that peptides with 13 or more residues tend to have an alpha -helix conformation.



View larger version (12K):
[in this window]
[in a new window]
 
FIGURE 3   (A) Energy gap between the first local minimum and the global minimum for all polyalanines; (B) Energy gap between the structures with a largest number of residues in the alpha -region and the lowest minimum.

Although we have used a simple version for the atomic representation of the peptide in a continuum electrostatic medium of dielectric constant epsilon r = 2, considering the united atom model for aliphatic carbons and the GROMOS force field, the general features, i.e., polyalanine alpha -helix preference above a critical residue number of ~13 amino acids, the energy gap from the alpha -helix to other conformations, and the gap enhancement showing the crescent stabilization as the number of residues increases, are in good accordance with experiments (Shoemaker et al., 1987; Marqusee et al., 1989; Clarke et al., 1999) and previous theoretical predictions (Rogers, 1989; Sung, 1994; Hoffmann and Knapp, 1996; Doruker and Bahar, 1997; Bertsch et al., 1998; Wu and Wang, 1998; Park and Goddard, 2000; Hiltpold et al., 2000).

To compare both approaches, GSARM and GSA, we have analyzed the number of cycles necessary to have convergence as a function of the peptide size (Fig. 4). It is shown in Fig. 4 that the GSARM approach that took less than 1000 steps to converge is a fast method to determine peptide conformations. All energy minima obtained by both GSA and GSARM procedures for the studied polyalanines were recorded, and both procedures lead to the same results, although differences were observed in the number of interactions and consequently the computational time expended in each simulation.



View larger version (12K):
[in this window]
[in a new window]
 
FIGURE 4   The lowest number of cycles necessary to obtain a conformation of minimum energy as function of polypeptide size.

The time step in molecular dynamics simulations is of the order of 0.0005-0.0020 ps. When considering the latter, 50,000 steps are necessary for a 100-ps trajectory and millions of steps to simulate events at a nanosecond time scale. So far, it was reported in the literature that simulations of the alpha -helix folding for a polyalanine had taken 0.1-30 ns (Wu and Wang, 1998; Bertsch et al., 1998; Hiltpold et al., 2000). Usual stochastic methods, such as Metropolis-Monte Carlo, can take millions of steps to simulate the alpha -helix polyalanine folding (Sung, 1994). Applying the GSARM procedure, peptides as long as 20 monomers initially in an extended conformation can adopt an alpha -helical structure in less than 1000 steps. The fact that the global energy minimum is attained in so few steps is probably due to the fractal structure of peptide and protein energy hyper-surfaces, as we have recently suggested (Moret et al., 2001).

To test our procedure on protein fragments we studied the secondary structures of the insect defensin A (Protein Data Bank code 1ICA, Cornet et al., 1995). It is a basic 4-kDa protein that in vivo is excreted in the hemolymph of the flesh fly Phormia terramovae larvae in response to bacterial challenge or tissue injury (Lambert et al., 1989) and is principally active against Gram-positive bacteria. Insect defensin A presents one alpha -helix and two beta -strands stabilized by three disulfide bridges. The energy hyper-surface of the alpha -helix and the two beta -sheet structures were analyzed, by means of the GSA procedure, randomly searching all the main chain Phi  and Psi  angles and all of the side-chain dihedral angles of these structures starting from the native conformation. The remaining residues, including the main-chain and lateral groups, were kept at the native positions.

The results for the alpha -helix domain (HIS13-ARG23) showed that the lowest energy for the unfolded state at a random coil conformation is ~10 kcal/mol higher than the conformational energy of the alpha -helix global minimum. However, when we performed the search in the region of beta -sheet (ARG26-ARG39), we found that the lowest-energy conformation corresponds to a random coil, which is only 0.8 kcal/mol less than the corresponding beta -sheet conformational energy.


    CONCLUSIONS
TOP
ABSTRACT
INTRODUCTION
METHODS
RESULTS AND DISCUSSION
CONCLUSIONS
REFERENCES

The GSA method was used to explore the energy hyper-surface of peptides. This method, based on generalized thermostatistics, is a fast stochastic approach used to determine the allowed global and local energy minima. We have applied this strategy to predict secondary structures of polyalanines. An alpha -helix conformation was found as corresponding to the lowest conformational energy for peptides of 13 or more residues, as suggested in the literature. When the search of the peptide backbone conformations is performed with the GSARM approach, where certain values of dihedral angles Phi  and Psi  in the Ramachandran map are more frequently visited, the simulation generally expends less than 1000 steps to find the global minimum energy for a 5-20-residue polyalanine. Compared with other theoretical approaches based on molecular dynamics or Monte Carlo methodologies, GSARM can be considered as the fastest strategy.

The electrostatic interactions between adjacent dipoles in the peptide backbone are repulsive, so a critical number of residues is needed to be counterbalanced by the attractive interactions, such as H-bonds. Then, short peptides of less than seven residues are flexible, and they show no conformational preference. When the number of residues of the chain increases (8-13 residues) the structures tend to collapse into H-bonded turns, as shown in Fig. 1. Above a critical number of 13 amino acid residues the enhancement of the hydrogen bond number stabilizes the polyalanine in an alpha -helix structure. In fact, for long-chain peptides, most of the possible H-bonds of the backbone tend to be formed. Then, an energy gap arises from the lowest-energy conformation to the next low-energy conformation. This energy gap enlarges with the number of residues.

The helical conformations presented in Fig. 1 were also obtained when we performed a restricted search to one quadrant of the Ramachandran map. In this case, the backbone angles Phi  and Psi  were restrained to the third quadrant, i.e., -180o < Phi  < 0o and -180o < Psi  < 0o, as proposed by Wang et al. (1995) and Tobias and Brooks (1991) to analyze helical propensities.

Finally, we performed a search considering a large value for the dielectric constant (epsilon r = 80) to obtain the lowest-energy minimum for two systems: a 13-alanine peptide and a 14-alanine one. According our simulations these peptides tend to adopt values for their Phi  and Psi  dihedral angles corresponding to the alpha -helix region in the Ramachandran map (i.e., in the third quadrant), but the H-bond formation is poorly maintained. This example as well as the simulations of the defensin show the importance of having a good description of the solvent effect, mainly when we are interested in searching for the protein conformations in its appropriated environment. This environment changes from the low dielectric constant, typical of the protein interior, to the higher one, for residues in contact with the solvent.

According to the Ramachandran map, proteins have forbidden values for the Phi -Psi pairs of dihedral angles. Performing the search for the peptide backbone conformations, exclusively in the allowed regions, the number of steps to achieved GSA convergence decreases drastically. Both approaches, the GSA and the GSARM, have been shown to be applicable to helix folding studies; however, the method proposed here is sufficiently general to be extended to analyze any conformational state of polypeptides and proteins in general.

    ACKNOWLEDGMENTS

The partial financial support from Conselho Nacional de Desenvolvimento Científico e Tecnológico, Coordenação de Aperfeiçoamento de Pessoal de Nível Superior, Fundac'ão de Amparo a Pesquisa do Estado do Rio de Janeiro, and Fundação Universitária José Bonifácio is acknowledged.

    FOOTNOTES

.

Address reprint requests to Dr. P. M. Bisch, Instituto de Biofisica, Universidade Federal do Rio de Janeiro, CCS, Bloco G, Ilha do Fundao, 21919-900 Rio de Janeiro, Brazil. Tel.: 55-21-2562-6575; Fax: 55-21-2280-8193; E-mail: pmbisch{at}biof.ufrj.br.

Submitted May 2, 2000, and accepted for publication December 17, 2001.


    REFERENCES
TOP
ABSTRACT
INTRODUCTION
METHODS
RESULTS AND DISCUSSION
CONCLUSIONS
REFERENCES

Biophys J, March 2002, p. 1123-1132, Vol. 82, No. 3
© 2002 by the Biophysical Society   0006-3495/02/03/1123/10  $2.00



This article has been cited by other articles:


Home page
Biophys. JHome page
R. A. da Silva, L. Degreve, and A. Caliri
LMProt: An Efficient Algorithm for Monte Carlo Sampling of Protein Conformational Space
Biophys. J., September 1, 2004; 87(3): 1567 - 1577.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Moret, M. A.
Right arrow Articles by Pascutti, P. G.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Moret, M. A.
Right arrow Articles by Pascutti, P. G.


HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
Copyright © 2002 by the Biophysical Society.