help button home button Biophys. J.
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS

This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Vila, J. A.
Right arrow Articles by Scheraga, H. A.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Vila, J. A.
Right arrow Articles by Scheraga, H. A.

Biophys J, December 1998, p. 2637-2646, Vol. 75, No. 6

Role of Hydrophobicity and Solvent-Mediated Charge-Charge Interactions in Stabilizing alpha -Helices

Jorge A. Vila,*# Daniel R. Ripoll,§ Myriam E. Villegas,* Yury N. Vorobjev, and Harold A. Scheraga#

 *Universidad Nacional de San Luis, Facultad de Ciencias Físico Matemáticas y Naturales, and Instituto de Matemática Aplicada San Luis, Consejo Nacional de Investigaciones Científicas y Técnicas, 5700 San Luis, Argentina;  #Baker Laboratory of Chemistry and Chemical Biology and  §Cornell Theory Center, Cornell University, Ithaca, New York 14853 USA; and  Department of Biochemistry, FLOB, The University of North Carolina, Chapel Hill, North Carolina 27599 USA

    ABSTRACT
Top
Abstract
Introduction
Methods
Results
Discussion
Summary
References

A theoretical study to identify the conformational preferences of lysine-based oligopeptides has been carried out. The solvation free energy and free energy of ionization of the oligopeptides have been calculated by using a fast multigrid boundary element method that considers the coupling between the conformation of the molecule and the ionization equilibria explicitly, at a given pH value. It has been found experimentally that isolated alanine and lysine residues have somewhat small intrinsic helix-forming tendencies; however, results from these simulations indicate that conformations containing right-handed alpha -helical turns are energetically favorable at low values of pH for lysine-based oligopeptides. Also, unusual patterns of interactions among lysine side chains with large hydrophobic contacts and close proximity (5-6 Å) between charged NH3+ groups are observed. Similar arrangements of charged groups have been seen for lysine and arginine residues in experimentally determined structures of proteins available from the Protein Data Bank. The lowest-free-energy conformation of the sequence Ac-(LYS)6-NMe from these simulations showed large pKalpha shifts for some of the NH3+ groups of the lysine residues. Such large effects are not observed in the lowest-energy conformations of oligopeptide sequences with two, three, or four lysine residues. Calculations on the sequence Ac-LYS-(ALA)4-LYS-NMe also reveal low-energy alpha -helical conformations with interactions of one of the LYS side chains with the helix backbone in an arrangement quite similar to the one described recently by Groebke et al., 1996 (Proc. Natl. Acad. Sci. U.S.A. 93:4025-4029). The results of this study provide a sound basis with which to discuss the nature of the interactions, such as hydrophobicity, charge-charge interaction, and solvent polarization effects, that stabilize right-handed alpha -helical conformations.

    INTRODUCTION
Top
Abstract
Introduction
Methods
Results
Discussion
Summary
References

One of the fundamental problems in the biophysical chemistry of proteins is the mechanism by which the amino acid sequence determines the three-dimensional structure of a protein. This is known as the protein folding problem (Gibson and Scheraga, 1988; Vásquez et al., 1994). Proteins are commonly built from secondary structural elements, mainly alpha -helices, beta -structures, and turn conformations. Research has been focused on the nature of the factors that stabilize these elements of secondary structure to gain insights into the mechanisms involved in protein folding (Richardson, 1976; Zimmerman and Scheraga, 1977a; Chou et al., 1990; Wilmot and Thornton, 1990; Kemp et al., 1991a).

The alpha -helix conformation is not only the most abundant element of secondary structure observed in proteins (Creighton, 1983) but is also the most studied, both theoretically (Poland and Scheraga, 1965, 1970; Finkelstein et al., 1991) and experimentally (Padmanabhan et al., 1990; Bradley et al., 1990; Wojcik et al., 1990; O'Neil and DeGrado, 1990; Lyu et al., 1990). The data obtained in these investigations are very useful for understanding the primary events that take place during the protein folding process. One goal of these studies is to be able to identify short fragments in protein sequences that can form alpha -helices in isolation. These fragments may act as possible chain-folding initiation sites in the whole protein (Anfinsen, 1972; Finkelstein and Ptitsyn, 1976; Matheson and Scheraga, 1978; Presta and Rose, 1988; Montelione and Scheraga, 1989).

In an attempt to understand the roles that the amino acid side chains and the solvent play in determining the structure and stability of alpha -helices, we developed the host-guest technique (Von Dreele et al., 1971a,b; Ananthanarayanan et al., 1971), which was used to evaluate the helix-coil stability constants for each of the 20 naturally occurring amino acids in water (Wojcik et al., 1990). In particular, the Zimm-Bragg parameters (Zimm and Bragg, 1959) sigma  and s for L-alanine, derived from host-guest studies (Platzer et al., 1972) agree substantially well with those derived from studies of tri-block co-polymers (Ingwall et al., 1968). The value of s = 1.08 (at 20°C) for the intrinsic helix-forming tendency of L-alanine implies that short oligopeptides of L-alanine should have a very low helix content. In fact, the observed helix content for a tri-block co-polymer with a central block of 10 residues of L-alanine was indistinguishable from zero (Ingwall et al., 1968). Recently, Kemp and co-workers (1991a,b, 1995, 1996) reported the results of a series of studies of conjugates Ac-Hel1-(ALAn-LYS-ALAm)-NH2 and analogs in aqueous solution. In agreement with the results from the host-guest study, they found s to be close to 1 for L-alanine at 25°C.

However, because many naturally occurring amino acids such as L-alanine are not soluble in water, charged residues have been inserted inside synthetic oligopeptides to solubilize them in studies designed to ascertain the relative helix stability of these amino acids in water (Marqusee et al., 1989; Padmanabhan et al., 1990; O'Neil and DeGrado, 1990; Lyu et al., 1990; Merutka et al., 1990). Among others, Marqusee et al. (1989) found that a synthetic 16-residue alanine-based oligopeptide with the sequence acetyl-AAAAKAAAAKAAAAKA-amide (3K(I)) adopts conformations with an unusually high alpha -helix content. The tendency of this oligopeptide to adopt helical conformations was attributed to a high helical potential of the L-alanine residue. This assumption of a high helical potential for L-alanine is in contrast with the results obtained from host-guest and block co-polymers experiments (Platzer et al., 1972; Ingwall et al., 1968). In an attempt to account for this apparent discrepancy, we carried out simulations (Vila et al., 1992) on the oligopeptides 3K(I) and acetyl-AAQAAAAQAAAAQAAY-amide (AQY) and obtained helix contents in agreement with experimental values. From these computations, we concluded that the lysine and glutamine residues that solubilize 3K(I) and AQY, respectively, confer stability on the alpha -helix conformation. By contrast, our theoretical analysis involving both statistical and molecular mechanics calculations on a 16-residue poly (L)-alanine with no interior lysines or glutamines showed very low helix content, in complete agreement with predictions of the helix fraction based on the s values reported by several authors (Platzer et al., 1972; Ingwall et al., 1968; Kemp et al., 1991a,b, 1995, 1996).

Despite these important results, it appears that a number of interesting questions had not been answered, either theoretically or experimentally. Among them, if both alanine (s = 1.08 at 25°C (Platzer et al., 1972)) and lysine (s = 0.94 at 25°C (Dygert et al., 1976)) are not strong helical formers, as determined by the host-guest technique, why is it that sequences containing charged lysine residues do not prefer extended conformations, as is intuitively assumed? And if the alpha -helical conformation is stabilized by charges as we proposed (Vila et al., 1992), is it possible to identify the nature of the stabilizing interactions?

The roles of electrostatic and hydrophobic interactions for the stability of alpha -helical structures are not yet fully understood. To accomplish this goal, we have carried out simulations of lysine- and alanine-based oligopeptides. Among the sequence patterns studied are Ac-KAK-NMe and Ac-KAAAAK-NMe present in 6K(I) and 3K(I) oligopeptides, respectively, and Ac-Kn-NMe, with n = 2, 3, 4 and 6. In all cases, a pH value of 2 was assumed. The current theoretical study considers the coupling between structure and ionization equilibria by examining the pH-dependent conformational preferences of these short oligopeptides (Ripoll et al., 1996).

    METHODS
Top
Abstract
Introduction
Methods
Results
Discussion
Summary
References

Evaluation of the conformational energy

The evaluation of the conformational energy follows the procedure recently published (Ripoll et al., 1996; Vorobjev and Scheraga, 1997); i.e., the total free energy, E(rp, pH), associated with the conformation rp of the molecule in aqueous solution at a given pH, can be defined by considering a three-step thermodynamic process (cavity creation, polarization of the solvent, and alteration of the state of proton binding) involved in transferring the neutral polypeptide from the gas phase to the aqueous solution, as:
E(<UP><B>r</B></UP><SUB><UP>p</UP></SUB>, <UP>pH</UP>)=E<SUB><UP>int</UP></SUB>(<UP><B>r</B></UP><SUB><UP>p</UP></SUB>)+F<SUB><UP>vib</UP></SUB>(<UP><B>r</B></UP><SUB><UP>p</UP></SUB>)+F<SUB><UP>cav</UP></SUB>(<UP><B>r</B></UP><SUB><UP>p</UP></SUB>)+F<SUB><UP>solv</UP></SUB>(<UP><B>r</B></UP><SUB><UP>p</UP></SUB>)+F<SUB><UP>inz</UP></SUB>(<UP><B>r</B></UP><SUB><UP>p</UP></SUB>, <UP>pH</UP>), (1)
where Eint(rp) is the internal conformational energy of the molecule in the absence of solvent, assumed to correspond to the ECEPP/3 energy (Momany et al., 1975; Némethy et al., 1983, 1992; Sippl et al., 1984) of the neutral molecule; Fvib(rp) is the conformational entropy contribution; Fcav(rp) is the free energy associated with the process of cavity creation when transferring the molecule from the gas phase into the aqueous solution; Fsolv(rp) is the free energy associated with the polarization of the aqueous solution; and Finz(rp, pH) is the free energy associated with the change in the state of ionization of the ionizable groups due to the transfer of the molecule from the gas phase to the solvent, at a fixed pH value.

The contribution to the total free energy from the conformational entropy of the molecule, Fvib(rp), has been approximated by the harmonic vibrational contribution (G<A><AC>o</AC><AC>&cjs1171;</AC></A> and Scheraga, 1969; Zimmerman and Scheraga, 1977b) of each conformation obtained by using the ECEPP/3 potential function. Fcav(rp) describes the free energy of creation of a cavity to accommodate a zero-charged peptide molecule; i.e., all partial atomic charges are set to zero. As shown previously (Sitkoff et al., 1994; Simonson and Brünger, 1994), Fcav(rp) can be considered as the free energy of transfer of a nonpolar molecule from the gas phase to water. This free energy is proportional to the solvent-accessible surface of the molecule.

The pK shift of the ith ionizable group is computed as:
&Dgr;pK<SUB><UP>i</UP></SUB>={[E(PS<SUP>+</SUP><SUB><UP>i</UP></SUB>)−E(PS<SUP><UP>o</UP></SUP><SUB><UP>i</UP></SUB>)]−[E(S<SUP>+</SUP><SUB><UP>i</UP></SUB>)−E(S<SUP><UP>o</UP></SUP><SUB><UP>i</UP></SUB>)]}/ (2)
[(<UP>ln</UP> 10) k<SUB><UP>B</UP></SUB>T],
where E(PSi+) and E(PSio) are the total energies of the ith residue in the ionized and neutral state, respectively, in the protein environment, and E(Si+) and E(Sio) are the total electrostatic energies of the ith single isolated residue in the ionized and neutral state, respectively, in the solvent. The free energy E(PSi+) is obtained under the assumption that all the ionizable residues, other than residue i, are in a state of zero charge. It should be noted that the effect of ionic strength (assumed here to be less than 0.1 M) was not included in the present study for the reasons described previously (Ripoll et al., 1996). The MBE method provides an accurate and stable calculation of both the potential of mean force between ionized groups of the protein and the pK shifts of the ionizable groups as a function of the protein environment. Additional details of the calculation of the electrostatic properties are provided by Ripoll et al. (1996).

Generation of the oligopeptide chains

A large part of the computational work was devoted to identifying low-energy structures that constitute a representative ensemble of the conformations in solution. The conformations corresponding to the different sequences under investigation were constructed using the ECEPP/3 algorithm (Momany et al., 1975; Némethy et al., 1983, 1992; Sippl et al., 1984). The program considers the complete set of backbone and side-chain dihedral angles as the independent variables while the bond lengths and bond angles of the oligopeptide chain are maintained fixed at their ECEPP/3 values.

The free energy terms, Fsolv(rp) and Finz(rp, pH), associated with electrostatic solvation in Eq. 1 are very costly to compute. A full search for the global minimum of the function represented by Eq. 1, requiring the energy-minimization of thousands of conformations, is beyond current computational capabilities. For this reason, a protocol that produces a reasonable sampling of the conformational space defined by E(rp, pH) without minimizing this particular function, was used.

The conformational search

For each sequence, 200 or more local-minimum conformations were obtained from a series of conformational search runs carried out by using a modified version (Ripoll et al., 1996) of the electrostatically driven Monte Carlo (EDMC) method (Ripoll and Scheraga, 1988, 1989; Ripoll et al., 1998; O'Donnell et al., 1996). During each of these runs, no less than 5000 conformations were generated. After energy minimization using the secant unconstrained minimization solver (SUMSL) algorithm (Gay, 1983) in combination with 1) ECEPP/3 or 2) ECEPP/3 plus a surface solvation model (SRFOPT) (Vila et al., 1991), their free energies were computed by using Eq. 1. The objective of these Monte Carlo runs was to sample the low-energy regions of the free energy E(rp, pH). It is worth noting that steps 1 and 2 are associated only with the generation procedure and are intended to enhance the diversity of the conformations generated. The solvation free energy of the conformations was always included in the free energy E(rp, pH). Additional details of the procedure can be found in an earlier publication (Ripoll et al., 1996).

Evaluation of the helix content

As only very short sequences are considered here, a common definition of helix content (Ripoll et al., 1996) is not quite appropriate. Consequently, we have computed two quantities that represent the preference of the residues for the right-handed alpha -helical conformation: 1) FA, the fraction of residues in the A region of the phi -psi map as defined by Zimmerman et al. (1977) (-110° <=  phi  < -40° and -90° <=  psi  < -10°) and 2) FJcoup, the fraction of residues with a Boltzmann-averaged vicinal coupling constant (3JHNalpha ) less than or equal to 6 Hz (Wüthrich, 1986).

Both quantities were computed with the equation:
F<SUB>X</SUB>=n<SUB><UP>X</UP></SUB>/N<SUB><UP>res</UP></SUB>, (3)
where the subscript X stands for A or Jcoup; nX is the number of residues in the A region of the phi -psi map in definition 1 or the number of residues with the Boltzmann-averaged coupling constant (3JHNalpha ) less than or equal to 6 Hz in definition 2. Nres is the total number of residues in the oligopeptide chain.

    RESULTS
Top
Abstract
Introduction
Methods
Results
Discussion
Summary
References

In the present study, EDMC runs were carried out for the several oligopeptide sequences described in Table 1. Four sequences correspond to homo-oligopeptides of the lysine residue. Two other sequences contain lysine and alanine residues in some of the patterns present in sequences studied by Marqusee et al. (1989), whereas the remaining sequences correspond to homo-oligopeptides of alanine. All the EDMC runs were started from randomly generated conformations; i.e., all the initial dihedral angles were produced by a random number generator. For each of the runs for the sequences containing lysine residues, more than 5000 conformations were generated, and the total free energy given by Eq. 1 was computed (see Table 1). For the runs with sequences of homo-oligopeptides of alanine, the numbers of generated conformations exceeded 11,000. 

                              
View this table:
[in this window]
[in a new window]
 
TABLE 1   Summary of the EDMC runs on various oligopeptide sequences

The calculations for lysine homo-oligopeptides composed of two, three, four, and six residues at pH 2 show that these sequences tend to prefer conformations with a high alpha -helix content as can be seen in Table 2. For two and three lysine residues, all the residues of the lowest-energy conformations (shown in Figs. 1 and 2, respectively) lie in region A of the phi -psi map (Zimmerman et al., 1977). However, conformations with all the residues in the A* region (110° >=  phi  > 40° and 90° >=  psi  > 10°), corresponding to the left-handed alpha -helix, are also energetically very favorable. For the two-lysine oligopeptide, the best-optimized conformation with both residues in the A* region is 1.3 kcal/mol higher than the global minimum (with both residues in the A region) whereas, for the three-lysine oligopeptide, the all-A* conformation is only 0.07 kcal/mol higher than the all-A conformation. The Boltzmann-averaged values for the vicinal coupling constants, 3JHNalpha , computed over all accepted conformations, and the computed helix content of each sequence, are listed in Table 2. The low values of FJcoup for the sequences KK and KKK (50% and 33%, respectively) arise from the contribution of coupling constants greater than 6 Hz (from the left-handed alpha -helical conformations) to the Boltzmann average < 3JHNalpha > .

                              
View this table:
[in this window]
[in a new window]
 
TABLE 2   Lowest-energy conformations and overall helix content


View larger version (21K):
[in this window]
[in a new window]
 
FIGURE 1   Lowest energy conformation (E = -95.10 kcal/mol) found for the sequence KK. The amino terminus of the chain is indicated with the letter N, and a dashed line is used to show the distance (in angstroms) between the NH3+ groups of the two lysines. Both LYS residues are in the A (Zimmerman et al., 1977) region of the phi -psi map.


View larger version (75K):
[in this window]
[in a new window]
 
FIGURE 2   Lowest energy conformation (E = -130.73 kcal/mol) for the sequence KKK obtained from a conformational search. A ribbon is used to trace the backbone of the oligopeptide, and the amino terminus of the chain is indicated with the letter N. Dashed lines are used to show the distances (in Å) between the NH3+ groups of the LYS residues. The solvent-accessible surface (Connolly, 1983a,b) of the molecule is displayed by dots. The conformation corresponds to a complete alpha -helical turn with a single hydrogen bond between the NH of the carboxyl-terminal group (-NMe) and the CO of the amino-terminal group (Ac-).

The homo-oligopeptide sequence with four lysine residues also exhibits a large preference for alpha -helical conformations (see Table 2). In this case, the lowest-energy conformation (-166.58 kcal/mol) is fully alpha -helical; however, a second conformation (A*AAA) quite close in energy (-166.46 kcal/mol) also exists. In the latter conformation, both hydrophobic interactions between side chains of residues in positions 1 and 4 and charge-charge solvent-mediated interactions of all NH3+ groups are optimized (see Fig. 3, which also shows the solvent-accessible surface (Connolly, 1983a,b). In particular, it is interesting to observe the arrangements of the side chains of residues 1 and 4. These side chains are packed together, enhancing the hydrophobic interactions described above, whereas the NH3+ groups at the tips of the side chains tend to separate from each other by pointing in opposite directions (Nzeta -Nzeta distance = 8.1 Å). Such favorable close approach of charged amino groups in water, due to polarization of the intervening water, has been demonstrated by ab initio quantum mechanical calculations (Cho et al., 1998) and by observations (Magalhaes et al., 1994) of proteins in the Protein Data Bank (Bernstein et al., 1977).


View larger version (88K):
[in this window]
[in a new window]
 
FIGURE 3   Conformation with the second-lowest energy (E = -166.46 kcal/mol) found for the sequence KKKK. A ribbon is used to trace the backbone of the oligopeptide, and the amino terminus of the chain is indicated with the letter N. This conformation contains three residues in the A region (Zimmerman et al., 1977) of the phi -psi map forming an alpha -helical turn and a single hydrogen bond between the NH of the carboxyl-terminal group (-NMe) and the CO of residue LYS-1. Strong hydrophobic interactions and good solvent polarization effects bring the LYS side chains of residues LYS-1 and LYS-4 together. To achieve this close packing, the backbone of LYS-1 is forced out of the A region into the A* region. The solvent-accessible surface (Connolly, 1983a,b) is displayed by dots to indicate the compactness of the conformation. The NH3+ groups at the tips of the side chains show good solvent exposure. The interactions among LYS side chains placed the NH3+ groups at distances between closest neighbors in space within the range of 12-14 Å in three of four cases; the NH3+ groups of LYS-1 and LYS-4 are at a distance of ~8 Å. The conformation is 0.12 kcal/mol higher in energy than the full alpha -helix, which is the lowest-energy conformation found for the sequence.

For six LYS residues, the Zimmerman et al. (1997) codes for residues LYS-1 to LYS-5 are in the A region of the phi -psi map; i.e., the lowest-energy conformation is mostly alpha -helical, as shown in Table 2. The Boltzmann-averaged values for the vicinal coupling constants, < 3JHNalpha > , indicate the presence of other conformers close in energy with LYS-1 in conformations other than the A region.

Analysis of the lowest-energy conformation for the KKKKKK sequence shows that arrangements of the lysine side chains, similar to those found for the sequence with four lysine residues, are the result of optimization of both hydrophobic and charge-charge solvent-mediated interactions. The packing between the lysine side chains of residues 1-5 and 2-6 can be seen in Fig. 4. It is worth noting that residues 3 and 4 remain in the alpha -helical region with the side chains well exposed to the solvent, as shown in Fig. 4. This conformational preference of the six-residue sequence is associated with a large Delta pKa of all the ionizable groups (in the range of 1.5-2.5 pK units). Even though the Delta pKa values are not very accurate because the calculations at pH 2 were carried out by assuming zero salt concentration, nevertheless, these values indicate repulsive charge-charge interactions among the ionizable groups that are counterbalanced by increasing the hydrophobic packing in conformations other than the alpha -helix. It is important to point out that we have not found large shifts in the pKa values of the lysine residues in the lowest-energy conformations from runs with shorter sequences of lysine residues. The fact that conformations other than the alpha -helix are the most probable for the sequence KKKKKK is consistent with experimental observations showing that poly-lysine is not helical at this pH (Applequist and Doty, 1962).


View larger version (25K):
[in this window]
[in a new window]
 
FIGURE 4   Top (a) and side (b) views of the lowest-energy conformation for the sequence KKKKKK. A ribbon is used to trace the backbone of the oligopeptide, and the amino terminus of the chain is indicated with the letter N. This conformation has an 83% helix content (5/6 of all residues are in the A region (Zimmerman et al., 1977) of the phi -psi map). The interactions among LYS side chains position the NH3+ groups at distances within the ranges 9.0-13.0 Å and 5.3-7.0 Å.

As already mentioned, the two-lysine oligopeptide has a tendency to adopt conformations compatible with the alpha -helical structure (both left- and right-handed). The sequence KKK, shown in Table 2, behaves similarly. This behavior is altered by inclusion of one alanine residue between two lysine residues (sequence KAK, shown in Table 2). In the latter case, only the right-handed alpha -helical conformations is preferred. Inclusion of four alanine residues between two lysine residues (see results for the sequence KAAAAK in Table 2) shows reduced alpha -helix content compared with the observations for the KAK, KKKK, and KKKKKK sequences. The analysis of the pattern KAAAAK is important for three reasons. First, this pattern appears in the sequences identified as 3K(I), 3K(II), and 4K in the studies by Marqusee et al. (1989) on short 16-residue alanine-based peptides. Second, a similar pattern has been used in more than 50 lysine-containing peptides in studies by Chakrabartty et al. (1994) aimed at determining helix propensities of the naturally occurring amino acids. Furthermore, this pattern is present in synthetic peptides used to determine the amino- and carboxyl-capping preferences of all 20 amino acids (Doig and Baldwin, 1995). And third, based on experimental studies of a series of alanine-rich peptides containing a single lysine residue (conjugates Ac-Hel1-(ALAn-LYS-ALAm)-NH2), Groebke et al. (1996) proposed a plausible structural model for the stabilization of the alpha -helical conformation by the lysine residue.

Among all the accepted conformations obtained in our simulations for the sequence KAAAAK, there are some that correspond to full alpha -helical structures. A graphical inspection of these alpha -helical conformations shows at least one, displayed in Fig. 5, in which one of the lysine residues (K6) interacts with the alpha -helical backbone in a pattern that quite closely resembles the one described by Groebke et al. (1996). However, this conformation, having a total energy E = -139.39 kcal/mol, is not the lowest energy found. In fact, the lowest-energy conformation (E = -143.16 kcal/mol) for the sequence KAAAAK, shown in Fig. 6, is not fully alpha -helical (see Zimmerman et al. (1977) code in Table 2). Perhaps the most important observation is the tendency of lysine side chains to adopt packed conformations with optimal hydrophobic and charge-charge solvent-mediated interactions, as observed for the other oligopeptides previously described. The interactions between the two lysine residues provide the extra stabilization energy that makes this conformation preferred over the full alpha -helix.


View larger version (31K):
[in this window]
[in a new window]
 
FIGURE 5   An alpha -helical conformation with relatively low energy (E = -139.39 kcal/mol) encountered in the conformational search for the sequence KA4K, with the amino terminus of the chain indicated with the letter N. A ribbon is used to trace the backbone of the oligopeptide. Residue LYS-6 interacts with the alpha -helical backbone in the manner suggested by Groebke et al. (1996) based on NOEs observed in 1H NMR spectra of selectively deuterated analogs of Ac-Hel1-(ALAn-LYS-ALAm)-NH2. The distances 4.5 and 3.9 Å between Cbeta of LYS-6 and Calpha of ALA-3 (a) and Nzeta of LYS-6 and the carbonyl O of ALA-2 (b), respectively, are indicated.


View larger version (84K):
[in this window]
[in a new window]
 
FIGURE 6   Lowest-energy conformation (E = -143.16 kcal/mol) encountered in the EDMC run for the sequence KA4K. This sequence is the repeating unit in the peptide 3K(I) of Marqusse et al. (1989). The backbone of the oligopeptide is traced with a ribbon, and the amino terminus of the chain is indicated with the letter N. The solvent-accessible surface (Connolly, 1983a,b) is displayed by dots to indicate the compactness of the conformation. The distance between Nzeta of LYS-1 and Nzeta of LYS-6 is 9.4 Å. The conformation has a helix content of 67% using the definition for FA (or FJcoup) given by Eq. 3. The characteristic pattern of side-chain/side-chain interactions between LYS residues is again observable. The LYS side chains show a preference for packed conformations instead of being extended and completely exposed to the solvent.

The Boltzmann-averaged values of the vicinal coupling constants, < 3JHNalpha > , for the residues of the sequence KAAAAK (Table 2) indicate that four of six residues are in the alpha -helical region of the phi -psi map. Right-handed alpha -helix contents of 67% are obtained for both FA and FJcoup, using the definitions given by Eq. 3. These results are consistent with a weak helix-forming tendency of L-alanine (Vila et al., 1992).

Runs for the homo-oligopeptides of alanine show a lesser tendency to adopt alpha -helical conformations than those sequences in which lysine was included. As shown in Table 2, the lowest-energy conformation for the sequence AAAAAA has only 33% and 0% helix content, using the definitions for FA and FJcoup, respectively. For the sequence AAA, on the other hand, the lowest-energy conformation is fully alpha -helical, but a significant number of conformers similar in energy but with residues in non-alpha -helical regions are present. These conformers contribute to produce a helix content, FJcoup, of zero for the AAA sequence.

Table 1 shows that runs for the alanine homo-oligopeptides were terminated after 1000 accepted conformations whereas the runs for oligopeptides containing lysine residues show a smaller number of accepted conformations. The reason for these differences is that the former runs required considerably less CPU time. For example, the CPU requirements for one of the runs with the sequence KKK is more than 100 times that of the run for the sequence AAA, whereas the former run led to five times fewer accepted conformations.

    DISCUSSION
Top
Abstract
Introduction
Methods
Results
Discussion
Summary
References

The novel and distinctive effects observed in our simulations are the following. First, the LYS side chains in the lowest-energy conformations for all these sequences show a clear tendency to pack among themselves. As the conformations are generated through energy minimization with ECEPP/3 (or ECEPP/3 plus a surface solvation term), the nonbonded interactions (that contribute largely to Eint (rp)) play a major role in defining the side-chain packing. The lysine side-chain arrangements show both good hydrophobic contact and good solvated conformations for the NH3+ groups. Typical distances between NH3+ groups fall into the two ranges: 5.5-8.0 Å and 9.5-13 Å.

Second, the low-energy conformations of these oligopeptides consistently show a preference for the lysine side chains to interact among themselves rather than adopt conformations far from each other, as is commonly assumed. Although this could be seen as an artifact of the simulations, we note that there are unusual arginine-arginine contacts (Magalhaes et al., 1994) and also lysine-lysine contacts, similar to the ones observed for lysine residues in our simulations, in many protein molecules from the Protein Data Bank (Bernstein et al., 1977). Calculations carried out by Magalhaes et al. (1994) on the guanidinium/guanidinium ion pair and by Cho, K.-W., K. T. No, and H. A. Scheraga (unpublished) on pairs of ionized methylamines reveal that a possible explanation for these arrangements is the bridging role of the water molecules that keeps the positively charged groups close to each other. Fig. 7 illustrates one example of lysine-lysine pairs in a protein, similar to the pair arrangements found in the simulations presented here.


View larger version (25K):
[in this window]
[in a new window]
 
FIGURE 7   Fragment from the x-ray crystallographic structure of staphylococcal nuclease (Wynn et al., 1997), containing two interacting pairs of lysine residues. The backbone of the fragment, involving residues G55 to I72, is highlighted with a red ribbon. The side chains of the lysine residues are shown in white. In close resemblance with the conformations encountered in our simulations, the aliphatic portions of the lysine side chains form strong hydrophobic contacts bringing the Nzeta atoms of neighboring side chains at distances of 4.1 and 6.2 Å for the K63-K64 and K70-K71 pairs, respectively. In particular, the pair K70-K71 is well exposed to the solvent, and all the side-chain heavy atoms have low temperature factors. The x-ray crystallographic structure of staphylococcal nuclease shows 11 pairs of lysine residues with distances less than 10 Å between the NH3+ groups.

Third, short alanine sequences containing three or six residues do not show a strong preference for alpha -helical conformations (as seen in Table 2), when compared with the behavior of lysine oligopeptides, in complete agreement with our previous experimental (Ingwall et al., 1968) and theoretical (Vila et al., 1992) results. Although a beta 1-alpha 4 interaction free energy (a hydrophobic bond formed between the Cbeta of the ith residue and the Calpha of the (i + 3)th residue, with i increasing toward the amino terminus) of -0.3 kcal/mol exists in a poly(L-alanine) alpha -helix (Némethy and Scheraga, 1962), the strength of this interaction overcomes the non-hydrogen-bonded defects at the ends of an alpha -helix only when the helix becomes sufficiently long (Bixon et al., 1963); this is consistent with the absence of helix content in a 10-residue poly(L-alanine) chain (Ingwall et al., 1968) but the presence of helix to the extent of ~75% (at 0°C) when the chain length attains a value of 160 (Ingwall et al., 1968).

Fourth, for the pattern KAAAAK included in the 3K(I) peptide of Marqusee et al. (1989), only a fraction of the chain remains alpha -helical. Thus, according to our calculations, inclusion of alanine residues does not appear to stabilize the helical conformation in short oligopeptides. An outstanding feature of the lowest-energy conformation for the sequence KAAAAK is the arrangement of a pairwise interaction between the LYS-LYS side chains similar to the one described in the first point above (see Fig. 6).

It is worth mentioning with respect to the fourth point above that studies of conjugates Ac-Hel1-(ALAn-LYS-ALAm)-NH2 (with n = 2-5 and m = 1-7) and analogs in aqueous solution by Groebke et al. (1996) show that the intrinsic helix-forming tendency of L-alanine is close to 1.0 (in the range of 1.01-1.02) at 25°C. Besides being in close agreement with the intrinsic helix-forming tendency of L-alanine proposed by Scheraga and co-workers (Platzer et al., 1972; Ingwall et al., 1968) and with model predictions by Vila et al. (1992), the results of Groebke et al. (1996) also indicate a significant population of packed conformations compatible with interactions between lysine and the helix backbone; i.e., the CH2 groups of the side chain of lysine (at position i) interact with the alpha -CH of alanine at position (i-3), and the protons of the NH3+ group of lysine interact with the carbonyl oxygen of residue (i-4), with i increasing toward the alpha -carboxyl terminus. Based on these results, Groebke et al. (1996) concluded that "helix stability is controlled primarily by interactions of the lysine side chain with the helix barrel, and only passively by the alanine matrix." The type of interaction described by Groebke et al. (1996) was observed in our simulations for the sequence KAAAAK (Fig. 5). However, our lowest-energy conformation for KAAAAK is not fully helical. The presence of a second LYS residue in KAAAAK led to other conformations with close packing between the lysine side chains, resulting in lower energy. This side-chain packing seems to be due to both strong hydrophobic interactions and optimal solvent polarization effects (Fig. 6). A theoretical investigation of the behavior of alanine-based sequences with a single ionizable residue, i.e., lysine, glutamic acid, arginine, etc., such as the one studied by Groebke et al. (1996), is currently in progress.

With regard to the interaction of lysine with the helix backbone, although quite important, this may not be the only explanation for the unusual helix formation in the short alanine-based peptides studied by Marqusee et al. (1989), as substitution of three LYS residues in the sequence of 3K(I) by GLU residues (with the sequence identified as 3E) leads to an oligopeptide with almost the same helix content as 3K(I). However, as the side chain of a GLU residue is shorter than that of a LYS residue, the stabilization effects due to side-chain/helix/backbone interactions are expected to be smaller.

Our discrimination between the conformational preferences for alanine and lysine residues leads us to infer that the unusual high helix content observed by Marqusee et al. (1989) in short alanine-based peptides is due, mainly, to solvent polarization effects and possibly hydrophobic side-chain/side-chain interactions between the lysine residues.

    SUMMARY
Top
Abstract
Introduction
Methods
Results
Discussion
Summary
References

In sequences containing less than seven lysine residues, two or more lysine side chains seem to prefer conformations in which they interact with each other, optimizing hydrophobic interactions as well as the solvent polarization effects due to the ionizable groups. These interactions force these oligopeptides to adopt specific conformational patterns; in particular, the alpha -helix is among the most commonly (low-energy) found. As a consequence, a significant reduction in the number of allowed conformations appears to take place when charged groups are included in the sequences. On the other hand, sequences that include alanine residues show a weaker tendency to adopt alpha -helical conformations. These results are in agreement with experimental evidence (Ingwall et al., 1968; Wojcik et al., 1990; Kemp et al., 1991a,b, 1995, 1996; Groebke et al., 1996) showing that alanine is not a strong helix former, contrary to the conclusions of Marqusee et al. (1989). The source of the unusual stability of the helical conformation in short 16-residue alanine-based peptides seems to be associated with the conformational preference of ion pairs of charged lysine residues introduced into the sequences to make them water soluble. The stability of the ion pairs of charged lysines is due, mainly, to hydrophobic interactions and solvent polarization effects that contribute with low free energy of solvation to bring the lysine side chains close together.

    ACKNOWLEDGMENTS

We thank F. Aversa for help with searches of the PDB and A. Liwo and K. Gibson for helpful discussion.

This research was supported by grants from the National Institutes of Health (GM-14312 and 1 R03 TW00857-01), the National Institutes of Health National Center for Research Resources (P41RR-04293 and RR-08102), the National Science Foundation (MCB95-13167), the Agencia Nacional de Promoción Científica y Tecnológica Argentina (PICT0030), and Project P-328402 of the Universidad Nacional de San Luis. M.E.V. acknowledges support from CONICET. Support was also received from the National Foundation for Cancer Research. This research was conducted using the resources of the Cornell Theory Center, which receives funding from Cornell University, New York State, the National Center for Research Resources at the National Institutes of Health, and members of the Center's Corporate Partnership Program.

    FOOTNOTES

Received for publication 6 July 1998 and in final form 27 August 1998.

Address reprint requests to Dr. Harold A. Scheraga, Dept. of Chemistry and Chemical Biology, Cornell University, Baker Laboratory, Ithaca, NY 14853-1301. Tel.: 607-255-4034; Fax: 607-254-4700; E-mail: has5{at}cornell.edu.

Dr. Scheraga is on leave from the Novosibirsk Institute of Bioorganic Chemistry, Novosibirsk, 630090, Russia.

    REFERENCES
Top
Abstract
Introduction
Methods
Results
Discussion
Summary
References

Biophys J, December 1998, p. 2637-2646, Vol. 75, No. 6
© 1998 by the Biophysical Society   0006-3495/98/12/2637/10  $2.00



This article has been cited by other articles:


Home page
Protein Sci.Home page
H. D. Nguyen, A. J. Marchut, and C. K. Hall
Solvent effects on the conformational transition of a model polyalanine peptide
Protein Sci., November 1, 2004; 13(11): 2909 - 2924.
[Abstract] [Full Text] [PDF]


Home page
Biophys. JHome page
J. A. Vila, H. A. Baldoni, D. R. Ripoll, A. Ghosh, and H. A. Scheraga
Polyproline II Helix Conformation in a Proline-Rich Environment: A Theoretical Study
Biophys. J., February 1, 2004; 86(2): 731 - 742.
[Abstract] [Full Text] [PDF]


Home page
Biophys. JHome page
S. Noinville, F. Bruston, C. El Amri, D. Baron, and P. Nicolas