help button home button Biophys. J.
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS

This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Jang, H.
Right arrow Articles by Zhou, Y.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Jang, H.
Right arrow Articles by Zhou, Y.

Biophys J, February 2002, p. 646-659, Vol. 82, No. 2

Folding Thermodynamics of Model Four-Strand Antiparallel beta -Sheet Proteins

Hyunbum Jang,* Carol K. Hall,* and Yaoqi Zhoudagger

 *Department of Chemical Engineering, North Carolina State University, Raleigh, North Carolina 27695-7905; and  dagger Department of Physiology and Biophysics, State University of New York at Buffalo, 124 Sherman Hall, Buffalo, New York 14214


    ABSTRACT
TOP
ABSTRACT
INTRODUCTION
MODELS AND SIMULATION METHOD
RESULTS AND DISCUSSION
CONCLUSIONS
REFERENCES

The thermodynamic properties for three different types of off-lattice four-strand antiparallel beta -strand protein models interacting via a hybrid Go-type potential have been investigated. Discontinuous molecular dynamic simulations have been performed for different sizes of the bias gap g, an artificial measure of a model protein's preference for its native state. The thermodynamic transition temperatures are obtained by calculating the squared radius of gyration Rg2, the root-mean-squared pair separation fluctuation Delta B, the specific heat Cv, the internal energy of the system E, and the Lindemann disorder parameter Delta L. Despite these models' simplicity, they exhibit a complex set of protein transitions, consistent with those observed in experimental studies on real proteins. Starting from high temperature, these transitions include a collapse transition, a disordered-to-ordered globule transition, a folding transition, and a liquid-to-solid transition. The high temperature transitions, i.e., the collapse transition and the disordered-to-ordered globule transition, exist for all three beta -strand proteins, although the native-state geometry of the three model proteins is different. However the low temperature transitions, i.e., the folding transition and the liquid-to-solid transition, strongly depend on the native-state geometry of the model proteins and the size of the bias gap.


    INTRODUCTION
TOP
ABSTRACT
INTRODUCTION
MODELS AND SIMULATION METHOD
RESULTS AND DISCUSSION
CONCLUSIONS
REFERENCES

We have been engaged in a computational study aimed at understanding the basic physical principles underlying protein aggregation, a phenomenon with serious biomedical problems that include a host of ultimately fatal protein deposition diseases (Clark and Steele, 1992; Eaton and Hofrichter, 1990; Gallo et al., 1996; Massry and Glasscock, 1983; Moore and Melton, 1997; Selkoe, 1991; Simmons et al., 1994). The long-term goal of our work is to reveal molecular-level mechanisms underlying protein aggregation and fibril formation. Because protein aggregation often involves the association of beta -strands, or the conversion of alpha -helices to beta -strands (Benzinger et al., 1998, 2000; Burkoth et al., 1998; Esler et al., 1996, 2000; Lazo and Cowning, 1998; Lynn and Meredith, 2000; Sunde et al., 1997; Zhang et al., 2000), we have focused part of our efforts on the development of models of beta -strand proteins. These models must be capable of capturing the essential physical features of real beta -strand proteins and at the same time be simple enough to allow the simulation of multi-protein systems with current computer capability. In this paper we introduce one such model, a four-strand protein interacting via a hybrid Go-type potential (Go and Taketomi, 1978, 1979; Taketomi et al., 1975; Ueda et al., 1978) and describe the folding properties of an isolated model chain as a function of temperature. This sets the stage for a later paper in which we examine the folding of several of these model beta -strand proteins into fibrils.

Low-resolution or simplified protein models are best suited to our purpose because they allow us to study the behavior of single or multi-protein systems over relatively long time scales. These models provide numerous insights into the thermodynamic and kinetic properties of protein folding. The low-resolution models used in simulation studies of protein folding can be divided into two classes: on-lattice models (Bratko and Blanch, 2001; Chan and Dill, 1994; Dill and Stigter, 1995; Go and Taketomi, 1978, 1979; Gupta and Hall, 1997, 1998; Gupta et al., 1999; Harrison et al., 1999, 2001; Kolinski et al., 1995; Lau and Dill, 1989; Miller et al., 1992; Skolnick and Kolinski, 1991; Taketomi et al., 1975; Ueda et al., 1978) and off-lattice models (Dokholyan et al., 1998, 2000; Guo and Thirumalai, 1995, 1996; Guo and Brooks, 1997; Guo et al., 1997; Nymeyer et al., 1998; Pande and Rokhsar, 1998; Shea et al., 2000; Takada et al.,. 1999; Zhou and Karplus, 1997a, 1997b, 1999). The first low-resolution lattice models of proteins were introduced by Go et al. (Go and Taketomi, 1978, 1979; Taketomi et al., 1975; Ueda et al., 1978) in the late 1970s. In their models, the protein was treated as a sequence of beads on a two- or three-dimensional lattice with the native tertiary structure being determined by a set of preassigned attractive residue-residue pairs at the nearest-neighbor lattice sites. Another class of lattice model used to describe protein folding is the HP lattice model proposed by Lau and Dill (1989). To mimic the hydrophobic effect, the protein is modeled as a chain of polar (P) and hydrophobic (H) residues with an attractive potential between nonbonded H beads and zero potential between nonbonded H and P or P and P residues. Simulation results for H-P models have provided useful information on protein folding-refolding pathways for isolated chains (Chan and Dill, 1994; Gupta and Hall, 1997; Gupta et al., 1999; Miller et al., 1992). In recent years these models have been extended to the treatment of protein aggregation for multichain systems (Bratko and Blanch, 2001; Dill and Stigter, 1995; Gupta and Hall, 1998; Harrison et al., 1999, 2001).

Recently, off-lattice models have gained attention as they can more accurately represent the real features of proteins. Guo et al. (Guo and Thirumalai, 1995, 1996; Guo and Brooks, 1997; Guo et al., 1997) have investigated the thermodynamics and kinetics of protein folding using Langevin simulations on off-lattice protein models. Their heteropolymer models consist of N connected beads, which correspond to three types of residues: hydrophobic, hydrophilic, and neutral. They found two characteristic temperatures, a collapse transition temperature Ttheta and a folding transition temperature Tf in their thermodynamic studies of beta -barrel and four-helix bundle proteins. Zhou and Karplus (1997a, 1997b, 1999) recently introduced an off-lattice heteropolymer model for a three-helix bundle protein with a hybrid Go-type potential. They used a bias gap parameter, g, to vary the strength of native contacts relative to that of nonnative contacts. Despite the model's simplicity, thermodynamic studies on this protein-like model exhibit complex protein transitions that have been observed experimentally including a collapse transition, a disordered globule to ordered globule transition, an ordered globule to native transition, and a transition to a surface frozen inactive state (Ptitsyn, 1995).

In this paper we investigate the thermodynamic phase behavior of three different four-strand antiparallel beta -strand peptides: the beta -sheet, the beta -clip, and the beta -twist. These beta -strand peptides each have 39 connected residues (beads) and have different native state conformations as shown in Fig. 1. Nonbonded beads can interact through a hybrid Go-type potential (Go and Taketomi, 1978, 1979; Taketomi et al., 1975; Ueda et al., 1978) modeled as a square-well or square-shoulder potential depending on the value of the bias gap parameter g. Discontinuous molecular dynamic (DMD) simulations (Alder and Wainwright, 1959; Rapaport, 1978; Smith et al., 1996) were performed on model protein systems with bias gaps ranging from 0.3 to 1.5. The bias gap measures the difference in interaction strength between the native and nonnative contacts: the larger it is the more the native state is favored over the nonnative state. Intermediate values of the bias gap are thought to be the most representative of a real protein in their equilibrium and dynamic behavior. By exploring how variations in our model protein's bias gap, an artificial measure of its "preference" for the native state, influence the types of phase behavior observed, we can get a feeling for how a real protein's preference for its native state, as measured for example by the energy difference between the denatured and native state, is manifested in the protein's phase diagram and vice versa.



View larger version (34K):
[in this window]
[in a new window]
 
FIGURE 1   Global energy minimal structures for: (a) the beta -sheet, (b) the beta -clip, and (c) the beta -twist model proteins.


    MODELS AND SIMULATION METHOD
TOP
ABSTRACT
INTRODUCTION
MODELS AND SIMULATION METHOD
RESULTS AND DISCUSSION
CONCLUSIONS
REFERENCES

We consider three different off-lattice protein models whose native states are four-strand antiparallel beta -strand peptides. The global energy minimal structures for the three different beta -strand models are shown in Fig. 1. The "zig-zag" shaped native protein in Fig. 1 a is called a beta -sheet; sometimes it is known as betabellin (Lim et al., 2000). The paper clip and helix-shaped native structures in Fig. 1, b and c are called beta -clip and beta -twist, respectively (Kolinski et al., 1995). A qualitative difference between the beta -sheet structure and the other two structures is immediately apparent. In the native state, the beta -sheet is two-dimensional (planar), whereas the beta -clip and beta -twist have well-defined three-dimensional four-barrel structures. The reduced squared radius of gyration per bead, R<UP><SUB><IT>g</IT></SUB><SUP><IT>2</IT></SUP></UP>/sigma 2N, the total number of native contacts, N<UP><SUB><IT>native</IT></SUB><SUP><IT>total</IT></SUP></UP>, and the reduced squared end-to-end distance, r<UP><SUB><IT>1–39</IT></SUB><SUP><IT>2</IT></SUP></UP>/sigma 2, for the three global energy minimal structures are summarized in Table 1. Here N is the number of beads and sigma  is the diameter of each bead along the chain. Note that chains with a large number of native contacts are generally more compact and consequently have a small value of the squared radius of gyration.


                              
View this table:
[in this window]
[in a new window]
 
TABLE 1   Reduced squared radius of gyration per bead, R<UP><SUB>g</SUB><SUP>2</SUP></UP>/sigma 2N, the total number of native contacts, N<UP><SUB>native</SUB><SUP>total</SUP></UP>, and the reduced squared distance between beads number 1 and 39, r<UP><SUB>1–39</SUB><SUP>2</SUP></UP>/sigma 2, in the global energy minimal structure for the three different beta -strand models

Each model protein contains 39 connected beads. Each bead represents an amino acid residue that can be regarded as being localized at the Calpha atom. Nonbonded beads can interact with each other through a square-well or square-shoulder potential (Zhou et al., 1996, 1997),
u<SUB>ij</SUB>(r)=<FENCE><AR><R><C>∞</C></R><R><C>B<SUB>ij</SUB>&egr;,</C></R><R><C>0,</C></R></AR> <AR><R><C>r≤&sfgr;</C></R><R><C>&sfgr;<r≤&lgr;&sfgr;</C></R><R><C>r>&lgr;&sfgr;</C></R></AR></FENCE> (1)
in which sigma  is the bead diameter, epsilon  is an energy parameter with epsilon  > 0, lambda sigma is the square-well diameter with lambda  = 1.5 throughout, and r is the distance between residues i and j. The quantity Bijepsilon , the square-well depth or square-shoulder height represents the interaction strength between nonbonded residue pair i and j and is defined as
B<SUB>ij</SUB>&egr;=<FENCE><AR><R><C>B<SUB>N</SUB>&egr;</C></R><R><C>B<SUB>O</SUB>&egr;</C></R></AR> <AR><R><C><UP>Native contact</UP></C></R><R><C><UP>Nonnative contact,</UP></C></R></AR></FENCE> (2)
in which BN and BO are measures of the relative strengths of the energies associated with the native and nonnative pair interactions in this Go-type potential (Go and Taketomi, 1978, 1979; Taketomi et al., 1975; Ueda et al., 1978). Nonbonded pairs of beads that are in contact in the global energy minimal structure experience an attractive interaction, i.e., BN < 0, when their square-wells overlap. On the other hand, nonbonded pairs of beads that are not in contact in the global energy minimal structure experience either an attractive interaction (BO < 0) or a repulsive interaction (BO > 0). The parameter BO can be either positive or negative depending on the size of the bias gap parameter g,
B<SUB>O</SUB>=(1−g)B<SUB>N</SUB>, (B<SUB>N</SUB><0, g>0), (3)
in which g is the bias gap (Zhou and Karplus, 1997a, 1997b, 1999). The bias gap measures the ratio of the interaction strength between the native contacts and nonnative contacts. Note that for g > 1, BO > 0; in this case nonnative contacts are repulsive so that the native state structure is strongly favored over any nonnative state structure. For 0 < g < 1, BO < 0; all nonbonded contact pairs are attractive, but native contacts are always more favorable than nonnative contacts. For g = 0, native and nonnative contacts are equally favorable, BO = BN; the model reduces to a homopolymer. The bias gap is an artificial measure of a model protein's preference for its native state; in a real protein this preference for the native state might be measured for example by the energy difference between the native and nonnative state.

The interaction between two bonded beads, i and i + 1, is given by an infinitely deep square-well potential,
u<SUP>bond</SUP><SUB>i,i+1</SUB>(r)=<FENCE><AR><R><C>∞,</C></R><R><C>0,</C></R><R><C>∞,</C></R></AR> <AR><R><C>r<(1−&dgr;)&sfgr;</C></R><R><C>(1−&dgr;)&sfgr;≤r≤(1+&dgr;)&sfgr;</C></R><R><C>r>(1+&dgr;)&sfgr;,</C></R></AR></FENCE> (4)
in which delta  is a flexibility parameter that controls the bond length. The bond length between neighboring beads can be varied over a small distance between the infinitely high potential barriers at (1 - delta )sigma and (1 + delta )sigma . This method for creating a flexible bond length was introduced by Bellemans et al. (1980). The flexible bond-length parameter is set to delta  = 0.1 through the paper.

Simulations were performed using the DMD algorithm (Alder and Wainwright, 1959; Rapaport, 1978; Smith et al., 1996). All simulations were started from randomly generated configurations, which were obtained from self-avoiding random walks. The initial velocities were chosen at random from a Maxwell-Boltzmann distribution at the simulation temperature. This DMD algorithm is very efficient for calculating the properties of systems containing hard spheres or square-well spheres (Smith et al., 1996; Zhou et al., 1997) and is significantly faster than standard molecular dynamics on Lennard-Jones chains. The algorithm proceeds by searching for the next collision time and collision pair, advancing all beads in time to the next collision event, and then calculating the velocity change of the colliding pair. The DMD simulation was conducted in the canonical ensemble with a constant number of particles, volume, and temperature. To maintain constant temperature, the Anderson stochastic collision method (Anderson, 1980) was used. In this method the system's particles collide with imaginary or ghost particles, which serve as an effective heat bath. In the collision with the imaginary particles, the velocity of each bead is reassigned from a Maxwell-Boltzmann distribution at the simulation temperature. A reduced ghost particle density of ng* triple-bond  ngsigma 3 = 0.1, in which ng is the number density of ghost particles, was used to ensure that 1% ~ 10% of the collisions were ghost particle collisions (Zhou and Karplus, 1999; Zhou et al., 1997).

To determine the location of the collapse transition, the mean-squared radius of gyration, Rg2, was determined in which
R<SUP>2</SUP><SUB>g</SUB>≡<FENCE><FR><NU>1</NU><DE>N</DE></FR> <LIM><OP>∑</OP><LL>i=1</LL><UL>N</UL></LIM> [(x<SUB>i</SUB>−x<SUB>c</SUB>)<SUP>2</SUP>+(y<SUB>i</SUB>−y<SUB>c</SUB>)<SUP>2</SUP>+(z<SUB>i</SUB>−z<SUB>c</SUB>)<SUP>2</SUP>]</FENCE><SUB>cf</SUB>, (5)
with N equal to the number of beads, xi, yi, zi equal to the coordinates of bead i, xc, yc, zc equal to the center of mass coordinates of the chain, and < > cf denoting a configurational average. To save computing time, the summation is taken over configurations sampled every 1000 collisions after equilibration. Other transitions were determined by calculating the reduced specific heat,
C<SUP>*</SUP><SUB>v</SUB>≡<FR><NU>C<SUB>v</SUB></NU><DE>k<SUB>B</SUB></DE></FR>=<FR><NU>⟨E<SUP>2</SUP>⟩−⟨E⟩<SUP>2</SUP></NU><DE>k<SUP>2</SUP><SUB>B</SUB>T<SUP>2</SUP></DE></FR> (6)
and the reduced internal energy,
E*≡<FR><NU>⟨E⟩</NU><DE>&egr;</DE></FR> (7)
in which kB is Boltzmann's constant and <  >  denotes an average over all collisions after equilibration. In calculating the specific heat and energy according to Eqs. 6 and 7, the weighted histogram method (Ferrenberg and Swendsen, 1989; Zhou et al., 1997) was used. In this method, temperature-independent degeneracy factors are extracted from the energy probability distribution in simulation runs at different temperatures. Once the degeneracy factors are known, a partition function is obtained for any temperature, and consequently all thermodynamic quantities can be calculated. Details of the method were reported elsewhere (Zhou et al., 1997). By plotting Cv* versus T*, in which T* is the reduced temperature, T* triple-bond  kBT/epsilon , one can determine the equilibrium transitions from the locations of the peaks and plateaus.

The degree of bead mobility can be characterized by the root-mean-squared pair separation fluctuation, Delta B, defined by
&Dgr;<SUB>B</SUB>≡<FR><NU>2</NU><DE>N(N−1)</DE></FR> <LIM><OP>∑</OP><LL>i<j</LL></LIM> <FENCE><FR><NU>⟨r<SUB>ij</SUB><SUP>2</SUP>⟩<SUB>cf</SUB></NU><DE>⟨r<SUB>ij</SUB>⟩<SUB>cf</SUB><SUP>2</SUP></DE></FR>−1</FENCE><SUP>½</SUP>, (8)
in which rij denotes the separation between beads i and j, and <  > cf denotes the configurational average. To characterize the liquid-to-solid transition of the system, the root-mean-squared fluctuation for bead i, Delta Li, defined to be
&Dgr;<SUB>Li</SUB>≡<FENCE><FR><NU>⟨(‖<B><UP>r</UP></B><SUB>i</SUB>‖−⟨‖<B><UP>r</UP></B><SUB>i</SUB>‖⟩<SUB>cf</SUB>)<SUP>2</SUP>⟩<SUB>cf</SUB></NU><DE>&sfgr;<SUP>2</SUP></DE></FR></FENCE><SUP>½</SUP>, (9)
yields the Lindemann disorder parameter,
&Dgr;<SUB>L</SUB>=<FENCE><LIM><OP>∑</OP><LL>i</LL></LIM> <FR><NU>&Dgr;<SUB>Li</SUB><SUP>2</SUP></NU><DE>N</DE></FR></FENCE><SUP>½</SUP> (10)
in which bead positions, ri (i = 1 to N), are obtained by shifting the center of mass coordinates to the origin. The quantity Delta Li in Eq. 9 defines the Lindemann disorder parameter for an individual bead, and Delta L in Eq. 10 defines the Lindemann disorder parameter. The Lindemann criterion (Lindemann, 1910) is applied to the system to determine the solid or liquid phase. A system is regarded as a solid if its Lindemann disorder parameter, Delta L, is in the range of 0.1 to 0.15 (Bilgram, 1987; Löwen, 1994; Stillinger, 1995; Zhou et al., 1999b), whereas a substance with Delta L > 0.15 is considered liquid.

The progression of a conformation toward the native state can be monitored by introducing the fraction of native contacts formed (Lazaridis and Karplus, 1997; Sali et al., 1994), Q, defined by
Q=<FR><NU>N<SUB>native</SUB></NU><DE>N<SUB>native</SUB><SUP>total</SUP></DE></FR>, (11)
in which Nnative represents the number of native contacts in a given conformation and N<UP><SUB><IT>native</IT></SUB><SUP><IT>total</IT></SUP></UP> is the total number of native contacts in the global energy minimal structure. The fraction of native contacts Q has a value between 0 and 1. When Q = 1 the model system can be regarded as being in the native structure, whereas when Q right-arrow 0 the chain becomes a random coil that indicates the denatured state.

Simulations were performed for up to 109 collisions to ensure equilibration. Each simulation was started from a different initial configuration at the temperature of interest. Equilibrium averages were typically taken by discarding the first one-half of the simulation data. All equilibrium results were averaged over at least four independent runs. For simulations at low temperature, a random initial configuration was generated at a relatively high temperature of T* = 1.0 and then gradually annealed toward the target temperature after which the equilibrium simulation was performed. This procedure can prevent the system from becoming trapped in a metastable state, i.e., a local free-energy minimum. Simulations were performed for bias gaps in the range of g = 0.3 to 1.5 at reduced temperatures in the range T* = 0.07 to 5.0.


    RESULTS AND DISCUSSION
TOP
ABSTRACT
INTRODUCTION
MODELS AND SIMULATION METHOD
RESULTS AND DISCUSSION
CONCLUSIONS
REFERENCES

Thermodynamic properties of beta -sheet

DMD simulations were performed for the beta -sheet protein to investigate phase behavior as a function of the size of the bias gap. At bias gaps of g = 0.3, 0.7, 0.9, and 1.3, thermodynamic averages including the squared radius of gyration Rg2, the root-mean-squared pair separation fluctuation Delta B, the specific heat Cv, and the internal energy E were calculated as a function of temperature. These results for g = 0.3, 0.7, 0.9, and 1.3 are shown in Fig. 2 (a-d), respectively. All quantities are described in terms of reduced units in the figures. Statistical averages were obtained over at least four independent simulations. The error bars are the standard deviation in the measured values and are only shown for values larger than the size of the symbol. The results for the specific heat and the energy were obtained using the same weighted histogram method (Ferrenberg and Swendsen, 1989) that was introduced in the study of homopolymers (Zhou et al., 1997). The equilibrium transition temperatures of the model can be identified from the maximal value of the temperature derivative of the squared radius of gyration, dRg2/dT*, and/or from peaks or plateaus in the specific heat.



View larger version (19K):
[in this window]
[in a new window]
 
FIGURE 2   Average values of the reduced squared radius of gyration per bead, < Rg2/sigma 2N> , the root-mean-squared pair separation fluctuation, < Delta B> , the reduced specific heat per bead, < Cv*/N> , and the reduced internal energy per bead, < E*/N> , as a function of the reduced temperature, T*, for the beta -sheet at four selected values of the bias gap, (a) g = 0.3, (b) g = 0.7, (c) g = 0.9, and (d) g = 1.3.

To determine the collapse transition, the temperature derivative of the squared radius of gyration, dRg2/dT*, is calculated along a spline fit of the data in Rg2 versus T* graphs as shown in Fig. 2. For a small bias gap, g = 0.3, in Fig. 2 a the maximal value of dRg2/dT* occurs at T* = 1.47, indicating the collapse transition temperature. For g = 0.7, 0.9, and 1.3 in Fig. 2, b, c, and d, the collapse transitions occur at T* = 1.0, 0.95, and 0.88, respectively. The chain has a conformational change from a random coil to the disordered globule state at the collapse transition. The disordered globule has more native contacts than the random coil, but due to the disordered nature of the state, none of the structures appear to be well ordered or native-like. The properties of disordered globule are similar to those observed experimentally for a premolten globule (Uversky and Ptitsyn, 1996a). Additional evidence for the existence of the collapse transition at g = 0.3 is that the < Delta B> versus T* curve possesses a broad, but clear, maximum located at a temperature just above T* = 1.47. In addition, a plateau in the specific heat is observed near this temperature. In contrast to the broad peak for g = 0.3 in the < Delta B> curve, for g = 0.7, 0.9, and 1.3, there is a well-defined peak in the < Delta B> curve that evidently supports the collapse transition. The collapse transition peak in the < Delta B> versus T* curve can be explained in the following way. The root-mean-squared pair separation fluctuation < Delta B> increases, as expected, as T* increases but then decreases above the collapse transition temperature because the stretched chain in the random coil state is constrained by chain connectivity, resulting in less fluctuations in the bead-bead separations.

The transition from the disordered globule state to the ordered globule state is associated with a distinct peak in the specific heat. This transition is strong with large energy change indicating a large conformational change. For g = 0.3, the peak is found at a temperature of T* = 0.58, far below the collapse transition. This indicates that the disordered globule is a stable state over a wide temperature range, a trend characteristic of smaller values of g. However, for g = 1.3 the peak is found at a temperature of T* = 0.80, only slightly below the collapse transition temperature. The collapse transition will eventually coincide with the disordered-to-ordered transition for g > 1.5. The ordered globule is similar to the molten globule, a liquid-like phase that exists at temperatures between the native and the unfolded coil state. This molten globule is a thermodynamic (not kinetic) intermediate that has a native-like secondary structure but has deficient side-chain interactions (Ptitsyn, 1995; Ptitsyn and Uversky, 1994; Schulman and Kim, 1996; Uversky and Ptitsyn, 1996b).

The folding transition from the ordered globule state to the native state can be identified from the plateaus in the specific heat. At this transition, there are no major changes in structure, but the chain has a subtle conformational change toward the native state. For the smaller gap models, g = 0.3, no evidence for the folding transition was observed. This is because for g < 0.7, a well-ordered native state does not appear to exist; instead kinked or rolled structures were observed. Consequently, no plateaus associated with the folding transition were observed for the smaller gap models. For the g = 0.7 model, although a peak in the specific heat is found at a temperature of T* = 0.35, the peak is not related to the folding transition. It is associated with the transition from the ordered globule state to a more-collapsed ordered globule state that we will call the partially ordered globule state. For the larger gap models, g = 0.9 and 1.3, the plateaus at T* = 0.40 in the specific heat are associated with the folding transition. This folding transition temperature is much lower than the disordered-to-ordered globule transition. This indicates that the ordered globule is a stable state over a wide temperature range for the larger gap models.

The thermodynamic results presented in Fig. 2 differ from those found in real proteins in that the specific heat in the unfolded state (random coil) is lower than in the folded state (ordered globule) even for the larger gap models, g > 0.9; the opposite is true for real proteins (Makhatadze and Privalov, 1995). The low value for the random-coil specific heat is due to the absence of protein-solvent interactions (Privalov, 1992) in this model, which diminishes the size of fluctuations in the energy in the unfolded state and results in a more stable energy state (Zhou et al., 1999a).

The results for the beta -sheet that we have just described are summarized in Fig. 3, which shows the phases that occur in the space spanned by the reduced temperature T* and the bias gap g. The diagram shows a complex set of protein transitions that is qualitatively similar to those observed for a model three-helix bundle protein (Zhou and Karplus, 1997a, 1997b, 1999) and for the protein-like lattice models (Dinner et al., 1994; Doniach et al., 1996; Pande and Rokhsar, 1998). However, some quantitative differences between the beta -sheet results and the three-helix bundle results are immediately apparent. This is indicated in the figure by drawing solid lines for transitions that are qualitatively the same transitions as those observed for the three-helix bundle protein and dotted lines for transitions which are not observed in the three-helix bundle protein. The upper line in the figure indicates the collapse transition; it separates the random coil phase from the disordered globule phase. Moving down in the figure, the second line from the top indicates the transition from the disordered globule to the ordered globule; it is obtained from the first peak in the specific heat. The third line from the top is a folding transition that separates the ordered globule from the native state for g > 0.7, and a collapse transition to a partially ordered globule (different from the native state) for 0.3 < g < 0.7. As described later in this paper the partially ordered globule state contains kinked or rolled structures that are distinctly different from their native state conformation. Interestingly, for 0.7 < g < 0.9, we observe a transition from the native state to the partially ordered globule state that is not observed in the three-helix bundle protein. This transition is associated with the specific heat peak for g = 0.7 at T* = 0.35 and the small specific heat peak for g = 0.9 at T* = 0.115. This transition and the partially ordered globule phase are not observed for real proteins. Because they occur at small values of the bias gap, they can be regarded as remnants of the model's homopolymer behavior (as g right-arrow 0, the model reduces to a homopolymer) in which a transition to a high-density configuration occurs at low temperatures (Zhou et al., 1996, 1997). For g < 0.9 a transition occurs at low temperature to a solid phase; this is obtained through application of the Lindemann criterion as will be discussed later. No solid phase was observed for g > 0.9 at any temperature investigated. Two triple points are found: one at g = 0.3 and T* = 0.58 where the disordered, ordered, and partially-ordered globule phases meet, and one at g = 0.7 and T* = 0.35 where the native, ordered, and partially-ordered globule phases meet.



View larger version (21K):
[in this window]
[in a new window]
 
FIGURE 3   Phase diagram for the beta -sheet model as a function of the reduced temperature, T*, and the bias gap, g.

Folding of the beta -sheet strongly depends on the bias gap and temperature. A set of sample structures for the beta -sheet characteristic of the final states observed during the simulations is presented in Fig. 4 a for g = 0.7 at T* = 0.3, Fig. 4 b for g = 0.7 at T* = 0.6, Fig. 4 c for g = 1.3 at T* = 0.5, and Fig. 4 d for g = 1.3 at T* = 0.2. The structure in Fig. 4 a corresponds to the partially ordered globule, the kinked or rolled structure that appears at low temperature for g < 0.7. (Recall that no native state structures appear at any temperatures for g < 0.7.) The partially ordered globule state occurs for smaller gap models because the attraction between nonnative contacts disturbs the tendency to order into the native state at low temperatures. The structures in Fig. 4, b and c, are examples of the ordered globule structure. The ordered globule has more native contacts than the disordered globule. As g increases, the structure corresponding to the ordered globule becomes more native-like (compare Fig. 4, b and c). The structure in Fig. 4 d is the native state structure of the beta -sheet. Note that for the off-lattice beta -sheet models with g <=  0.7 none of the structures correspond to the global energy minimal state, but for 0.7 < g < 1.0 the geometry in the native state is very similar to the global energy minimal structure shown in Fig. 1 a so that the number of native contacts is very close to that of the global energy minimal structure. For highly optimized models, g >=  1.0, the native state is the global energy minimal state (Zhou and Karplus, 1999).



View larger version (36K):
[in this window]
[in a new window]
 
FIGURE 4   Final chain conformations of the beta -sheet (a) for g = 0.7 at T* = 0.3 (the partially ordered globule state), (b) for g = 0.7 at T* = 0.6 (the ordered globule state), (c) for g = 1.3 at T* = 0.5 (the ordered globule state), and (d) for g = 1.3 at T* = 0.2 (the native state).

It is interesting to consider the variation in structural properties as folding progress. Fig. 5 shows the average value of the fraction of global energy minimal contacts formed (Lazaridis and Karplus, 1997; Sali et al., 1994), < Q> , as a function of temperature T * and bias gap g. The collapse transition occurs at < Q>  ~ 0.2 for g = 0.3 and at < Q>  ~ 0.3 for g = 1.3. The transition from the disordered globule to the ordered globule occurs at < Q> values in the range of < Q>  = 0.4 ~ 0.5 for all values of g. In the native state, we find that < Q>  > 0.9 for g > 0.7, whereas for the partially ordered globule, < Q> has values in the range from 0.6 to 0.9 depending on the size of g. In the three-dimensional graph, the flat surface at the top of the rear edge corresponds to the native state with < Q>  ~ 1 for g > 0.7. 



View larger version (58K):
[in this window]
[in a new window]
 
FIGURE 5   Average value of the fraction of global energy minimal contacts, < Q> , for the beta -sheet as a function of the reduced temperature, T *, and the bias gap, g.

Further information on the nature of the transition at low temperature can be obtained by calculating the Lindemann disorder parameter (Lindemann, 1910). This parameter is often used to analyze the liquid-to-solid transition (Bilgram, 1987; Löwen, 1994; Stillinger, 1995; Zhou et al., 1999b). The solid phase corresponds to the inactive glassy state that can be observed in many proteins (Ferrand et al., 1993; Frauenfelder and McMahon, 1998; Green et al., 1994; Reat et al., 1998; Tilton et al., 1992). Fig. 6 a shows the average value of the Lindemann disorder parameter, < Delta L> , as a function of temperature for two selected values of the bias gap, g = 0.3 and 1.3. Each result for Delta L at given temperature is a sum of the average of the root-mean-squared fluctuations over all beads. A form of the Lindemann criterion for melting is adopted here in which systems with Delta L < 0.15 are considered solid-like, whereas systems with Delta L > 0.15 are considered liquid. The dotted line in Fig. 6 a indicates the criterion for the liquid-to-solid transition. It can be seen that for g = 1.3, as was seen in the phase diagram, no solid phase is found at any investigated temperatures because Delta L > 0.15, whereas, for g = 0.3 the liquid-to-solid transition occurs at T* = 0.19. This agrees well with the temperature at which the inflection in the root-mean-squared pair separation, < Delta B> , curve is found in Fig. 2 a.



View larger version (13K):
[in this window]
[in a new window]
 
FIGURE 6   (a) Lindemann disorder parameter, < Delta L> , for the beta -sheet as a function of the reduced temperature, T*, for two selected values of the bias gap, g = 0.3 and 1.3. (b) Lindemann disorder parameter for an individual bead, Delta Li, for the beta -sheet as a function of the reduced distance of bead i from the origin, |ri|/sigma , for g = 0.3 at T* = 0.1, 0.3, and 0.6. (c) Same as (b) but for g = 1.3 at T* = 0.2, 0.6, and 0.8.

Fig. 6, b and c, shows the Lindemann disorder parameter for the individual beads i, Delta Li, as a function of the reduced distance of bead i from the origin, |ri|/sigma , for Fig. 6 b, g = 0.3 at T* = 0.1, 0.3, and 0.6, and Fig. 6 c, for g = 1.3 at T* = 0.2, 0.6, and 0.8. Note that the bead positions, ri, are defined with respect to the origin at the chain's center of mass. In Fig. 6 b, which corresponds to g = 0.3, the three selected temperatures correspond to the three different phases of the chain: the solid-like phase, partially ordered globule, and disordered globule, respectively. At high temperature, T* = 0.6, a large degree of freedom in the bead motion is observed. At low temperature, T* = 0.1, the small bead fluctuations and small bead-bead separations give rise to a solid-like phase for the chain. In Fig. 6 c, which corresponds to g = 1.3, the three selected temperatures correspond to the three different phases of the chain: the native state, ordered globule, and disordered globule, respectively. At high temperature, T* = 0.8, the large bead fluctuations in the Delta Li graph indicate that the chain undergoes a transition from the disordered globule to the ordered globule. At low temperature, T* = 0.2, the results of the individual bead fluctuations correspond to the native state. The native protein is regarded as a surface-molten solid, which means that the interior of native protein is solid-like, but the surface is liquid-like phase (Zhou et al., 1999b). However we find that the native beta -sheet at T* = 0.2 is not a surface-molten solid, because all the values of Delta Li are greater than 0.15, which means that the system is considered liquid. Therefore, the native beta -sheet is more like a molten globule because all of the beads exhibit liquid-like motion. The reason that the native beta -sheet is not a surface-molten solid is that it is planar, and consequently core beads have considerable freedom in their motion; they can move perpendicular to the molecular plane even in the native state. This can be regarded as a case in which the topology determines the phase behavior.

It is of interest to determine the extent to which this model displays thermodynamic two-state cooperativity (Ptitsyn, 1995). The most direct way to do this is to examine the shape of the free energy distribution in the transition region to see if it is bimodal. Accordingly, the energy distributions in the vicinity of transitions displayed in Fig. 3 have been investigated. Fig. 7 shows the energy distribution in Fig. 7 a for g = 0.9 at T* = 0.7949 (the disordered-to-ordered globule transition), in Fig. 7 b for g = 0.9 at T* = 0.3998 (the folding transition), and in Fig. 7 c for g = 0.3 at T* = 0.5848 (the disordered-to-partially-ordered globule transition). Fig. 7 a seems to indicate that the g = 0.9 model has a bimodal free energy distribution. As pointed out by Zhou and Karplus (1999), the logarithm of the energy distribution as calculated via the weighted histogram method is directly proportional to the free energy, and the existence of a bimodal shape in the free energy distribution indicates the presence of a two-state transition (Zhou et al., 1996, 1997). Nevertheless, to further verify the existence of a two-state transition, it is useful to also apply the calorimetric criterion for chain models of protein. The criterion says that to have two-state cooperativity in protein folding, Delta HvH/Delta Hcal approx  1 around the transition, where Delta HvH is the van't Hoff enthalpy around the peak of the specific heat after baseline subtraction and Delta Hcal is the calorimetric enthalpy change associated with the entire transition (Jackson and Brandts, 1970; Freire, 1995). We evaluate the criterion for the collapse transition of the large bias gap models using two different equations for the van't Hoff enthalpy, Delta HvH(T) = 2T<RAD><RCD><IT>k</IT><SUB>B</SUB><IT>C</IT><SUB>p</SUB><IT>(T)</IT></RCD></RAD> (Chan, 2000; Kaya and Chan, 2000) and Delta H<UP><SUB><IT>vH</IT></SUB><SUP><IT>+</IT></SUP></UP>(T) = 4kBT2Cp(T)/Delta Hcal (Zhou et al., 1999a). For the transition in Fig. 7 a, we find that Delta HvH/Delta Hcal ~ 0.48 and Delta H<UP><SUB>vH</SUB><SUP>+</SUP></UP>/Delta Hcal ~ 0.23 at the midpoint temperature. These ratios are significantly smaller than unity, indicating that either the collapse transition is not calorimetrically two-state or the baseline for the specific heat curve is not accurate (Zhou et al., 1999a). The ratios increase with the size of the bias gap, indicating that the collapse transition becomes more cooperative, but none of them becomes equal to or close to unity for the beta -sheet models we investigated. We suspect that for highly optimized models, i.e., even larger bias gaps, the transition will become more protein-like, displaying the two-state "all-or-none" character associated with temperature-induced protein denaturation.



View larger version (15K):
[in this window]
[in a new window]
 
FIGURE 7   Energy distribution in the transition region for the beta -sheet as a function of energy (a) for g = 0.9 at T* = 0.7949 (the disordered-to-ordered globule transition), (b) for g = 0.9 at T* = 0.3998 (the folding transition), and (c) for g = 0.3 at T* = 0.5848 (the disordered-to-partially-ordered globule transition).

The folding transition in Fig. 7 b for g = 0.9 is not two-state because no bimodal energy distribution occurs. This indicates that the ordered globule and native states of the beta -sheet are geometrically similar to each other. Hence, the folding transition is weak with small energy change and is continuous between the two states. Note that this transition is associated with the plateaus in the specific heat as seen in Fig. 2, c and d. For the transition from the disordered globule state to the partially ordered globule state in Fig. 7 c at g = 0.3, the transition is continuous even though it is associated with a distinct peak in the specific heat as seen in Fig. 2 a. The lack of a bimodal energy distribution for g = 0.3 suggests that the disordered and partially ordered globule structures are similar to each other.

Thermodynamic properties of beta -clip and beta -twist

The thermodynamic averages, < Rg2/sigma 2N> , < Delta B> , < Cv*/N> , and < E*/N> for the beta -clip and beta -twist models, are shown as a function of T* in Fig. 8, a and b (beta -clip) and Fig. 8, c and d (beta -twist). For brevity the figures only show results for two selected values of the bias gap, g = 0.3 and 1.3. For the small bias gap, g = 0.3, the results in Fig. 8 a for the beta -clip and Fig. 8 c for the beta -twist are qualitatively similar to those in Fig. 2 a for the beta -sheet model at g = 0.3. However, for the large bias gap, g = 1.3, the thermodynamic averages for both the beta -clip model and the beta -twist model (Fig. 8, b and d, respectively) are qualitatively different from those of the beta -sheet model at g = 1.3 in Fig. 2 d. The differences are that kinks in the < Rg2/sigma 2N> and < Delta B> curves at low temperatures and a second peak in the specific heat in Fig. 8, b and d are observed for both the beta -clip model and the beta -twist model, but not for the beta -sheet model in Fig. 2 d. The small peak in the specific heat is related to the folding transition, whereas the kinks in the < Rg2/sigma 2N> and < Delta B> curves at low temperatures are connected with the liquid-to-solid transition. The reason for the increases in < Rg2/sigma 2N> at low temperatures for both the beta -clip model and the beta -twist model at g = 1.3 is that the Rg2 value of the native protein is larger than that of the ordered globule, indicating that the collapsed chain undergoes a subtle conformational change from the ordered globule to the native state by elongating its strand.



View larger version (19K):
[in this window]
[in a new window]
 
FIGURE 8   Average values of the reduced squared radius of gyration per bead, < Rg2/sigma 2N> , the root-mean-squared pair separation fluctuation, < Delta B> , the reduced specific heat per bead, < Cv*/N> , and the reduced internal energy per bead, < E*/N> , as a function of the reduced temperature, T*, for the beta -clip model at (a) g = 0.3 and (b) g = 1.3, and for the beta -twist model at (c) g = 0.3 and (d) g = 1.3.

The Lindemann disorder parameters are shown as a function of temperature and the bias gap for the beta -clip and beta -twist in Fig. 9, a and b, respectively. It can be seen that for both models the locations of the kinks in the < Delta L> curves at low temperatures and large g are consistent with those at low temperatures found in the < Rg2/sigma 2N> and < Delta B> curves as seen in Fig. 8, b and d. The solid-like phase with Delta L < 0.15 is prevalent at low temperature for the beta -clip and beta -twist models. Sharp increases at high temperatures in the < Delta L> curves for the large gap models are also consistent with those found in the < Rg2/sigma 2N> curves and with the peaks in the < Delta B> curves in Fig. 8, b and d, indicating the collapse transition.



View larger version (74K):
[in this window]
[in a new window]
 
FIGURE 9   Lindemann disorder parameter, < Delta L> , as a function of the reduced temperature, T*, and the bias gap, g, for (a) the beta -clip and (b) the beta -twist.

Fig. 10, a and b, show the Lindemann disorder parameter for an individual bead, Delta Li, a for the beta -clip model with g = 0.9 at T* = 0.1, 0.26, and 0.5, and b for the beta -twist model with g = 0.7 at T* = 0.1, 0.28, and 0.6. For both the beta -clip and beta -twist models, the three selected temperatures correspond to the three different phases of the chain: the solid-like phase, the native state, and the ordered globule, respectively. In the native states near the liquid-to-solid transition (beta -clip with g = 0.9 at T* = 0.26 in Fig. 10 a and beta -twist with g = 0.7 at T* = 0.28 in Fig. 10 b), we find that the interiors of the chains are solid-like with Delta Li < 0.15, but the surfaces are liquid-like with Delta Li > 0.15. Thus the native proteins for the beta -clip and beta -twist are surface-molten solids. At low temperature, T* = 0.1, for both the beta -clip and beta -twist models, the bead motions are frozen with all Delta Li < 0.1, indicating an inactive solid-like phase.



View larger version (15K):
[in this window]
[in a new window]
 
FIGURE 10   Lindemann disorder parameter for an individual bead, Delta Li, as a function the reduced distance of bead i from the origin, |ri|/sigma , for (a) the beta -clip with g = 0.9 at T* = 0.1, 0.26, 0.5, and (b) the beta -twist with g = 0.7 at T* = 0.1, 0.28, 0.6.

Fig. 11 a for the beta -clip and Fig. 11 b for the beta -twist summarize the protein phases that occur in the space spanned by the reduced temperature T* and the bias gap g. The qualitative differences between the phase behavior results for the beta -clip and beta -twist models and those for the beta -sheet model (Fig. 3) are immediately apparent. For example, the native states for the beta -clip and beta -twist models exist at low temperatures for g > 0.5, whereas the native state for the beta -sheet exists at low temperatures for g > 0.7. For the beta -clip and beta -twist models at even lower temperatures, the liquid-to-solid transition exists for all values of g, but for the beta -sheet it exists only for g < 0.9. For the beta -clip and beta -twist models at g >=  0.9, although not shown in the figures, the energy distribution for the collapse transition is bimodal, indicating that the transition is two-state-like. However, the calorimetric criterion shows that at g = 1.3, Delta HvH/Delta Hcal ~ 0.8 for the beta -clip and beta -twist models, a ratio that is still smaller than the value of Delta HvH/Delta Hcal ~ 0.96 observed in experimental studies on small proteins (Privalov, 1979). This indicates that the collapse transition is only weakly cooperative.



View larger version (18K):
[in this window]
[in a new window]
 
FIGURE 11   Phase diagram as a function of the reduced temperature, T*, and the bias gap, g, for (a) the beta -clip model and (b) the beta -twist model.


    CONCLUSIONS
TOP
ABSTRACT
INTRODUCTION
MODELS AND SIMULATION METHOD
RESULTS AND DISCUSSION
CONCLUSIONS
REFERENCES

We have studied the folding thermodynamics for three different types of off-lattice four-strand antiparallel beta -strand protein models: the beta -sheet, the beta -clip, and the beta -twist. DMD simulations of these models have been performed for different sizes of the bias gap g, a measure of the strength of the native contacts relative to that of the nonnative contacts. Although all three beta -strand model proteins are simple, they undergo a complex set of protein transitions as observed in experimental studies on real proteins (Ptitsyn, 1995