help button home button Biophys. J.
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS

This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Jang, H.
Right arrow Articles by Zhou, Y.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Jang, H.
Right arrow Articles by Zhou, Y.

Biophys J, August 2002, p. 819-835, Vol. 83, No. 2

Protein Folding Pathways and Kinetics: Molecular Dynamics Simulations of beta -Strand Motifs

Hyunbum Jang,* Carol K. Hall,* and Yaoqi Zhoudagger

 *Department of Chemical Engineering, North Carolina State University, Raleigh, North Carolina 27695; and  dagger Department of Physiology and Biophysics, State University of New York at Buffalo, Buffalo, New York 14214 USA


    ABSTRACT
TOP
ABSTRACT
INTRODUCTION
MODELS AND SIMULATION METHOD
RESULTS
DISCUSSION
REFERENCES

The folding pathways and the kinetic properties for three different types of off-lattice four-strand antiparallel beta -strand protein models interacting via a hybrid Go-type potential have been investigated using discontinuous molecular dynamics simulations. The kinetic study of protein folding was conducted by temperature quenching from a denatured or random coil state to a native state. The progress parameters used in the kinetic study include the squared radius of gyration R<UP><SUB>g</SUB><SUP>2</SUP></UP>, the fraction of native contacts within the protein as a whole Q, and between specific strands Qab. In the time series of folding, the denatured proteins undergo a conformational change toward the native state. The model proteins exhibit a variety of kinetic folding pathways that include a fast-track folding pathway without passing through an intermediate and multiple pathways with trapping into more than one intermediate. The kinetic folding behavior of the beta -strand proteins strongly depends on the native-state geometry of the model proteins and the size of the bias gap g, an artificial measure of a model protein's preference for its native state.


    INTRODUCTION
TOP
ABSTRACT
INTRODUCTION
MODELS AND SIMULATION METHOD
RESULTS
DISCUSSION
REFERENCES

In our laboratory we are using computer simulation to understand the basic physical principles of protein aggregation, a cause or associated symptom of a number of lethal disorders including Alzheimer's, Parkinson's, and prion diseases (Clark and Steele, 1992; Eaton and Hofrichter, 1990; Gallo et al., 1996; Massry and Glasscock, 1983; Moore and Melton, 1997; Selkoe, 1991; Simmons et al., 1994). Our long-term goal is to learn if there are certain universal molecular-level mechanisms responsible for protein aggregation and fibril formation. Because the association of beta -strands, or the conversion of alpha -helices to beta -strands often precedes or accompanies protein aggregation (Benzinger et al., 1998, 2000; Burkoth et al., 1998; Esler et al., 1996, 2000; Lazo and Cowning, 1998; Lynn and Meredith, 2000; Sunde et al., 1997; Zhang et al., 2000), we have focused part of our efforts on the development of models of beta -strand proteins. These models must be capable of capturing the essential physical features of real beta -strand proteins and at the same time be simple enough to allow the simulation of multi-protein systems with current computer capability.

Low-resolution or simplified protein models are best suited for the construction of a beta -strand model to be used in simulations of protein folding because they allow us to study the folding behavior over relatively long time scales. These models provide numerous insights into the thermodynamic and kinetic properties of protein folding (Chan and Dill, 1994; Dokholyan et al., 1998, 2000; Guo and Brooks, 1997; Guo and Thirumalai, 1995, 1996; Gupta and Hall, 1997, 1998; Kolinski et al., 1995, 1999; Lau and Dill, 1989; Miller et al., 1992; Nymeyer et al., 1998; Pande and Rokhsar, 1998; Shea et al., 2000; Skolnick and Kolinski, 1991; Zhou and Karplus, 1997a,b, 1999). We recently introduced three minimalist models of four-strand antiparallel beta -strand peptides: the beta -sheet, the beta -clip, and the beta -twist (Jang et al., 2002). Discontinuous molecular dynamics (DMD) simulations (Alder and Wainwright, 1959; Rapaport, 1978; Smith et al., 1996) on these three models were performed to determine how the thermodynamic properties of an isolated peptide vary with temperature. Despite these models' simplicity, they undergo a complex set of protein transitions similar to those observed in experimental studies on real proteins (Ptitsyn, 1995). Starting from high temperature, these transitions include a collapse transition, a disordered-to-ordered globule transition, a folding transition, and a liquid-to-solid transition. The thermodynamics results presented in our previous paper and the kinetics results presented here set the stage for a later paper in which we will examine the folding and assembly of several of these model beta -strand proteins into fibrils.

The folding of beta -strand proteins is of current interest not only because isolated beta -strand peptides tend to adopt a beta -sheet fibril conformation when they aggregate, but also because beta -strands are often secondary structure elements in proteins. Unfortunately, unlike the formation of alpha -helices, the formation of even isolated beta -strand peptides is not easy to study experimentally due to difficulties associated with the propensities of the largely insoluble beta -structures to precipitate. This has prompted the development of designed beta -strand peptides that are more amenable to fundamental protein folding investigations. Experimental studies of such peptides include analyses of the structure and stability for de novo designed two-strand beta -sheets known as beta -hairpin (Blanco et al., 1998; Ramírez-Alvarado et al., 1996), three-strand beta -sheets known as betanova (de Alba et al., 1999; Koepf et al., 1999; Kortemme et al., 1998), and four-strand beta -sheets known as betabellin (Lim et al., 2000). Computer simulation studies have been conducted on the thermodynamics and kinetics of formation of 16-residue beta -hairpins (Dinner et al., 1999; García and Sanbonmatsu, 2001; Lee and Shin, 2001; Pande and Rokhsar, 1999; Kolinski et al., 1999; Zagrovic et al., 2001; Zhou and Linhananta, 2002; Zhou et al., 2001) taken from the C-terminal fragment (41-56) of the B1 domain of protein G (Muñoz et al., 1997), the folding kinetics of betanova (Bursulaya and Brooks, 1999; Ferrara and Caflisch, 2000), the folding thermodynamics and kinetics of a 46-mer beta -barrel (Guo and Brooks, 1997; Guo and Thirumalai, 1995), and the dynamics of a six-stranded lattice model of Greek key beta -barrel (Kolinski et al., 1995; Skolnick et al., 1989).

In this paper we investigate the folding kinetics of the three different four-strand antiparallel beta -strand peptides: the beta -sheet, the beta -clip, and the beta -twist, whose thermodynamic properties were studied previously by us (Jang et al., 2002). These beta -strand peptides each have 39 connected residues (beads) and different native state conformations as shown in Fig. 1. Nonbonded beads can interact through a hybrid Go-type potential (Go and Taketomi, 1978, 1979; Taketomi et al., 1975; Ueda et al., 1978) modeled as a square-well or square-shoulder potential depending on the value of the bias gap parameter g. DMD simulations were performed on these three model systems at intermediate values of the bias gaps ranging from 0.7 to 1.1. The progress parameters, the squared radius of gyration R<UP><SUB>g</SUB><SUP>2</SUP></UP>, the fraction of native contacts within the protein as a whole Q, and between specific strands Qab were monitored over the course of the kinetic simulations. The bias gap measures the difference in interaction strength between the native and non-native contacts; the larger it is, the more the native state is favored over the non-native state. Intermediate values of the bias gap are thought to be the most representative of real proteins in their equilibrium and dynamic behavior. The model protein's bias gap is an artificial measure of its preference for the native state. By exploring how variations in the bias gap influence the types of dynamic phase behavior observed, we can get a feeling for how a real protein's preference for its native state, as measured for example by the energy difference between the denatured and native state, is manifested in the protein's dynamic behavior and vice versa.



View larger version (36K):
[in this window]
[in a new window]
 
FIGURE 1   Stick (top) and topology (bottom) diagrams of the global energy minimum structures: (A) the beta -sheet; (B) the beta -clip; and (C) the beta -twist model proteins.

Highlights of our results are the following. We find that the beta -sheet exhibits a fast-track folding pathway without becoming trapped in any intermediates for all the bias gaps investigated. Its kinetic folding mechanism can be described by the diffusion collision model (Karplus and Weaver, 1979, 1994). In contrast, the beta -clip and beta -twist exhibit multiple folding pathways that include trapping in intermediates and direct folding to the native state depending on the size of the bias gap. For the beta -clip and beta -twist models at g = 0.7, two intermediates I1 and I2 are observed in the folding pathways. The intermediate I1 is a partially ordered globule, and the intermediate I2 is an ordered globule that can be regarded as a molten globule as described in the thermodynamic study. The intermediate I1 can fold either directly into the native state or via the intermediate I2, which then folds into the native state by elongating its strands. No fast-track folding pathway is observed. The kinetic folding mechanism for the beta -clip and beta -twist at g = 0.7 can be described by the hydrophobic-collapse theory (Dill et al., 1995). For the beta -clip and beta -twist models at g = 0.9, the kinetic folding trajectories include the fast-track folding pathway and trapping in an intermediate I. The intermediate I is an ordered globule as was seen in the g = 0.7 model result. The random coil can fold directly to the native state or via the intermediate I. We expect that as g increases, more of the trajectories will exhibit the fast-track kinetic folding pathway. The kinetic folding mechanism for the beta -clip and beta -twist at g = 0.9 can be described by a combination of the diffusion collision model and the hydrophobic-collapse theory. The folding speed of the model proteins with different native state topologies strongly depends on the contact order in the native state (Plaxco et al., 1998). Models with a high topological complexity exhibit slower folding to the native state than models with a topologically simple structure. Moreover, the former is prone to kinetic intermediates.

In the following section, a full description of the three models and the simulation method are presented. Next is the results section, which is divided into three subsections; the first presents the results for the folding kinetics, the second presents a description of intermediates and folding pathways, and the third discusses how the folding speed depends on the topology of the model proteins. The paper concludes with a discussion of the key findings in the last section.


    MODELS AND SIMULATION METHOD
TOP
ABSTRACT
INTRODUCTION
MODELS AND SIMULATION METHOD
RESULTS
DISCUSSION
REFERENCES

We consider three different off-lattice protein models whose native states are four-strand antiparallel beta -strand peptides. The global energy minimum structures for the three different beta -strand models are shown in Fig. 1. Stick diagrams for the beta -sheet, beta -clip, and beta -twist are shown at the top of the figure. Topology diagrams representing a top view of the three proteins are shown at the bottom of the figure. In the topology diagrams, thick arrows indicate chain connectivity at the top of the strands, and thin arrows indicate chain connectivity at the bottom of the strands. It is immediately apparent that in the native state, the beta -sheet is two-dimensional (planar), whereas the beta -clip and beta -twist have well-defined three-dimensional four-barrel structures. The total number of native contacts, N<UP><SUB>native</SUB><SUP>total</SUP></UP>, for each model chain and the total number of inter-strand native contacts, N<UP><SUB>ab</SUB><SUP>total</SUP></UP>, for each model where a and b are the strand number, are summarized in Table 1. Note that N<UP><SUB>native</SUB><SUP>total</SUP></UP> for the beta -sheet equals the sum of all N<UP><SUB>ab</SUB><SUP>total</SUP></UP>, but N<UP><SUB>native</SUB><SUP>total</SUP></UP> for the beta -clip and beta -twist does not, because native contacts on turn residues are not inter-strand native contacts.


                              
View this table:
[in this window]
[in a new window]
 
TABLE 1   Numbers of native contacts

Each model protein contains 39 connected beads, each representing an amino acid residue that can be regarded as being localized at the Calpha atom. The details of the model potential and the bias gap g are described in an earlier publication (Jang et al., 2002).

The folding kinetics of the three different beta -strand peptides are studied here using the DMD algorithm (Alder and Wainwright, 1959; Rapaport, 1978; Smith et al., 1996). Proteins are quenched from the denatured or random coil state equilibrated at high temperature to the temperature of interest. After quenching, the proteins undergo a conformational change toward the energy minimum state, i.e., the native state. We investigated the time series of folding behavior from the random coil state at T* = 2.0 to the native state at T* = 0.2, 0.24, and 0.28 for the beta -sheet, the beta -clip, and the beta -twist, respectively. The temperature used in this paper is a dimensionless reduced temperature, T* = kBT/epsilon , where kB is Bolzmann's constant, T is the absolute temperature, and epsilon  is the energy parameter. Our thermodynamic study on the same models (Jang et al., 2002) shows that at T* = 2.0, the three beta -strand peptides are completely random coils. The three target temperatures for each model are selected to ensure that the native state is highly populated at that temperature. The initial configurations for the kinetic simulations were generated from equilibrium simulations at T* = 2.0. The coordinates and velocities of the beads in the random coil state were obtained every 105 collisions during the equilibrium simulation to create independent initial conformations for the quenching studies. This large collision interval ensures that the sampled configurations are independent. Simulations were performed for three different values of the bias gaps, g = 0.7, 0.9, and 1.1, except for the beta -sheet where the simulations were performed at g = 0.9 and 1.1. For each bias gap model, statistical averages were obtained over 100 independent simulations.

The total time duration for the simulation strongly depends on the topology of the model and the bias gap. The time used in the simulations is expressed in terms of reduced time, t* = t<RAD><RCD><IT>&egr;/M&sfgr;<SUP>2</SUP></IT></RCD></RAD> (Zhou and Karplus, 1999), where t is the real time, epsilon  is the energy parameter, M is the mass of the bead, and sigma  is the bead diameter. For all three beta -strands, simulations were performed for ~108 collisions, which is equivalent to t* = 4 × 105 for the beta -sheet model and t* = 2 × 105 for the beta -clip and beta -twist models. This indicates that the simulation speed for the beta -sheet is twice as fast as that of the other two models. The simulation speed at large bias gaps is slightly faster than at small bias gaps for any given model, indicating that the larger the bias gap, the faster the protein folds. To highlight the folding events that occur early in the simulation, a logarithmic timescale was used to examine early folding behavior of proteins as well as folded behavior in the native state. Simulation results were recorded at linearly spaced time intervals of Delta t* = 0.1 for t* < 10 and at logarithmically spaced time intervals of Delta log t* = 0.01 for t* > 10.

To determine the compactness of the chains during the simulations, the time-dependent squared radius of gyration, R<UP><SUB>g</SUB><SUP>2</SUP></UP> (t) was determined where
R<SUP><UP>2</UP></SUP><SUB><UP>g</UP></SUB>(t)≡<FR><NU>1</NU><DE>N</DE></FR> <LIM><OP>∑</OP><LL><UP>i=1</UP></LL><UL><UP>N</UP></UL></LIM> <B><UP>r</UP></B><SUP><UP>2</UP></SUP><SUB><UP>i</UP></SUB>(t), (1)
with N equal to the number of beads. The time-dependent bead positions, ri(t) (i = 1 to N), are obtained by shifting the center of mass coordinates to the origin. The radius of gyration is widely used as an order parameter in studies of the polymer order-disorder collapse transition. At low temperatures, collapsed polymers tend to form a high-density conformation yielding a small value of the radius of gyration. In contrast, the squared radius of gyration for native beta -strand peptides is larger than that in the collapsed state, because the native beta -strands are long and have an elongated molecular shape. This indicates that a minimum value of the squared radius of gyration does not imply that the beta -strands are in the native state. Thus, to characterize the nativeness of beta -strands as collapsing progress, an alternative measure of the time-dependent squared radius of gyration is considered. This is the normalized squared radius of gyration, F*R(t), which is defined by
F<UP><SUP>*</SUP><SUB>R</SUB></UP>(t)≡1−<FR><NU>R<SUP><UP>2</UP></SUP><SUB><UP>g</UP></SUB>(<UP>eq</UP>)</NU><DE>R<SUP><UP>2</UP></SUP><SUB><UP>g</UP></SUB>(t)</DE></FR>, (2)
where R<UP><SUB>g</SUB><SUP>2</SUP></UP>(eq) is the native value of the squared radius of gyration. The normalized squared radius of gyration measures the departure of the chain's squared radius of gyration from its value in the native state. It is slightly less than 1 when the chain is loosely packed or a random coil (R<UP><SUB>g</SUB><SUP>2</SUP></UP>(t) > R<UP><SUB>g</SUB><SUP>2</SUP></UP>(eq)), equals 0 when the chain is in the native state (R<UP><SUB>g</SUB><SUP>2</SUP></UP>(t) R<UP><SUB>g</SUB><SUP>2</SUP></UP>(eq)), and is less than 0 when the chain collapses to very compact structure (R<UP><SUB>g</SUB><SUP>2</SUP></UP>(t) R<UP><SUB>g</SUB><SUP>2</SUP></UP>(eq)). Each beta -strand peptide has different values of R<UP><SUB>g</SUB><SUP>2</SUP></UP>(eq) depending on the temperature and the size of the bias gap. The native values of R<UP><SUB>g</SUB><SUP>2</SUP></UP> for the three beta -strands at different bias gaps are obtained from the equilibrium simulations at T* = 0.2, 0.24, and 0.28 for the beta -sheet, the beta -clip, and the beta -twist, respectively. These values are summarized in Table 2 along with those for the global energy minimum structures. The R<UP><SUB>g</SUB><SUP>2</SUP></UP>(eq) values for the native beta -strands are smaller than the R<UP><SUB>g</SUB><SUP>2</SUP></UP> values in the global energy minimum structures, R<UP><SUB>g</SUB><SUP>2</SUP></UP>(GM), because the beta -strands are slightly bent or kinked in the native state, especially for the smaller bias gap models. This is due to the attraction between non-native contacts for g < 1, which disturbs the propensity of the beta -strands to be straight. Note that the ratio of the native R<UP><SUB>g</SUB><SUP>2</SUP></UP> value to that in the global energy minimum structure, R<UP><SUB>g</SUB><SUP>2</SUP></UP>(eq)/R<UP><SUB>g</SUB><SUP>2</SUP></UP>(GM), is much smaller for the beta -sheet than for the other two models. This indicates that the planar shape of the beta -sheet can be easily bent due to the non-native attractions, giving the native state a slightly rolled molecular structure. In fact, most real beta -sheet structures are twisted as opposed to being perfectly planar or elongated molecular structures (Clothia, 1973; Pauling and Corey, 1951).


                              
View this table:
[in this window]
[in a new window]
 
TABLE 2   Native values of the reduced squared radius of gyration

The progression of a conformation toward the native state can be monitored by introducing the time-dependent fraction of native contacts formed (Lazaridis and Karplus, 1997; Sali et al., 1994b), Q(t), defined by
Q(t)=<FR><NU>N<SUB><UP>native</UP></SUB>(t)</NU><DE>N<SUP><UP>total</UP></SUP><SUB><UP>native</UP></SUB></DE></FR>, (3)
and further breaking down Q(t) into the time-dependent fraction of inter-strand native contacts formed between strands a and b, Qab(t), defined by
Q<SUB><UP>ab</UP></SUB>(t)=<FR><NU>N<SUP><UP>native</UP></SUP><SUB><UP>ab</UP></SUB>(t)</NU><DE>N<SUP><UP>total</UP></SUP><SUB><UP>ab</UP></SUB></DE></FR>, (4)
where Nnative represents the number of native contacts for the whole chain and N<UP><SUB>ab</SUB><SUP>native</SUP></UP> is the number of inter-strand native contacts between strands a and b in a given conformation. The fraction of native contacts Q has a value between 0 and 1. When Q = 1 the model system can be regarded as being in the native structure, whereas when Q right-arrow 0 the chain becomes a random coil that signifies the denatured state.


    RESULTS
TOP
ABSTRACT
INTRODUCTION
MODELS AND SIMULATION METHOD
RESULTS
DISCUSSION
REFERENCES

Characteristics of folding kinetics

We performed DMD simulations to investigate the folding kinetics of the three different beta -strand peptides. The systems were quenched from the denatured state at T* = 2.0 to the native states at T* = 0.2, 0.24, and 0.28 for the beta -sheet, the beta -clip, and the beta -twist, respectively. The time-dependent normalized squared radius of gyration, < F*R> , is shown in Fig. 2 as a function of the reduced time on a logarithmic scale, log(1 + t*): for the beta -sheet at g = 0.9 and 1.1 (Fig. 2 A); for the beta -clip at g = 0.7, 0.9, and 1.1 (Fig. 2 B); and for the beta -twist at g = 0.7, 0.9, and 1.1 (Fig. 2 C). The quantity < F*R> is an average over 100 independent trajectories that start from different initial configurations. Simulation results are presented up to the reduced time of t* = 105, because for t* > 105 the beta -peptides are very stable and remain in the folded state. The degree of compactness of the system can be monitored by examining how the F*R values change as folding progress. Recall that when 0 < F*R < 1, the chain has a larger squared radius of gyration, R<UP><SUB>g</SUB><SUP>2</SUP></UP>, than the native value of R<UP><SUB>g</SUB><SUP>2</SUP></UP>(eq) (see Table 2), indicating that the system is loosely packed or a random coil. For F*R = 0, the system has the same R<UP><SUB>g</SUB><SUP>2</SUP></UP> value as in the native state, so that the system can be regarded as being in the native state. For F*R < 0, the system exhibits a highly collapsed state, yielding a kinked or rolled molecular structure with smaller value of R<UP><SUB>g</SUB><SUP>2</SUP></UP> than in the native state.



View larger version (17K):
[in this window]
[in a new window]
 
FIGURE 2   The average values of the normalized squared radius of gyration, < F*R(t)> , as a function of the reduced time in logarithmic scale, log(1 + t*): (A) the beta -sheet at g = 0.9 and 1.1; (B) the beta -clip at g = 0.7, 0.9, and 1.1; and (C) the beta -twist at g = 0.7, 0.9, and 1.1.

The < F*R(t)> versus log(1 + t*) graph in Fig. 2 A for the beta -sheet shows that < F*R>  > 0 initially, that it oscillates around zero when the native state is reached, and that the folding events for the g = 0.9 and g = 1.1 models are qualitatively the same. The initial conformations of the chains begin to collapse at t* = 10 and fold into the native state by t* = 100. The < F*R(t)> versus log(1 + t*) graphs in Fig. 2, B and C, for the beta -clip and the beta -twist, respectively, are strikingly different. In contrast to the beta -sheet, the < F*R> values for the beta -clip and the beta -twist rapidly decrease to a negative minimum value as the chain collapses and then gradually increase to zero as the native state is reached. The negative value for F*R means that the R<UP><SUB>g</SUB><SUP>2</SUP></UP> value of the collapsed chain is smaller than that of the native state at the given time. This suggests that the folding process for the beta -clip and beta -twist goes through two distinct stages. In the first stage, the initial random coil collapses to a compact structure. In the second stage, strand extension takes place, recovering the elongated molecular shape of the beta -strand peptide. For the beta -clip in Fig. 2 B, the minimum value of < F*R> decreases as g decreases, indicating that the smaller the value of g, the more compact the collapsed structure is. This is due to the relatively high overall strength of the non-native attractions among the beads in the smaller gap models. The strand extension is much quicker in the larger gap model. Similar behavior is seen in Fig. 2 C for the beta -twist.

The time-dependent fraction of native contacts, < Q> , is shown in Fig. 3 as a function of log(1 + t*): for the beta -sheet at g = 0.9 and 1.1 (Fig. 3 A); for the beta -clip at g = 0.7, 0.9, and 1.1 (B); and for the beta -twist at g = 0.7, 0.9, and 1.1 (C). The quantity< Q> is an average over 100 independent simulations with different independent initial configurations. For the beta -sheet, no difference in the formation of native contacts between the g = 0.9 and g = 1.1 models is observed. The formation of native contacts and the chain collapse (compare to Fig. 2 A) are concurrent. This indicates that the collapse time and the folding time are identical for the beta -sheet. The native states for both the bias gap models are located at < Q>  approx  1. In contrast, for the beta -clip in Fig. 3 B and for the beta -twist in Fig. 3 C, 60-70% of the total native contacts (depending on the size of the bias gap) are built up when < F*R> reaches its negative minimum at around t* = 100 as seen in Fig. 2, B and 2 C. The < Q> values continuously increase thereafter and begin to saturate slowly to the native value during the strand extension process as < F*R> goes to zero. This reflects the result that for the beta -clip and beta -twist the collapse time and the folding time are different. For the beta -clip, the native states are located at < Q>  approx  0.82, 0.92, and 0.95 for g = 0.7, 0.9, and 1.1, respectively. As g increases, the < Q> value in the native state increases, and the speed of native contact formation is much faster. For the beta -twist, the native states are located at < Q>  approx  0.88, 0.92, and 0.93 for g = 0.7, 0.9, and 1.1, respectively. The reason that the native < Q> values are all different for the three beta -strand models with the same bias gap is that the kinetic simulations are performed at different target temperatures.



View larger version (14K):
[in this window]
[in a new window]
 
FIGURE 3   The average values of the fraction of native contacts, < Q(t)> , as a function of the reduced time in logarithmic scale, log(1 + t*), for: (A) the beta -sheet at g = 0.9 and 1.1; (B) the beta -clip at g = 0.7, 0.9, and 1.1; and (C) the beta -twist at g = 0.7, 0.9, and 1.1.

More detailed information on the formation of native contacts can be obtained by considering the fraction of inter-strand native contacts, < Qab> , between strands a and b where the strand number runs from 1 to 4. The beta -sheet model has three possible strand-to-strand native interactions: 1-2, 2-3, and 3-4 (see the topology diagrams in Fig. 1 A). The time-dependent fraction of inter-strand native contacts, < Qab> , for the beta -sheet model is shown in Fig. 4 as a function of log(1 + t*) for the bias gaps: g = 0.9 (Fig. 4 A) and g = 1.1 (Fig. 4 B). Each value of < Qab> is averaged over 100 independent simulations. For g = 0.9, < Q12> and < Q34> (the fraction of inter-strand native contacts experienced by the outer strands) increase before < Q23> (the fraction of inter-strand native contacts experienced by the inner strands) increases as the system collapses. This indicates that assembly of the native state for the beta -sheet at g = 0.9 starts with the formation of native contacts between strands 1 and 2 and between strands 3 and 4, followed by the formation of native contacts between strands 2 and 3. This is due to the fact that strands 1 and 4 are the chain's terminals, which are more flexible and easier to rearrange than the other strands. We find that < Q12>  approx  < Q23>  approx  1 at t* = 160, and < Q34>  approx  1 at t* = 200. However, for g = 1.1 all < Qab> values increase concurrently and < Q12>  approx  < Q23>  approx  < Q34>  approx  1 at t* = 100. The native secondary structure formation for the beta -sheet at g = 1.1 starts at the same time as the chain collapses.



View larger version (14K):
[in this window]
[in a new window]
 
FIGURE 4   The average values of the fraction of inter-strand native contacts, < Qab(t)> , where a and b are the strand number, for the beta -sheet as a function of the reduced time in logarithmic scale, log(1 + t*), for the bias gaps: (A) g = 0.9; (B) g = 1.1.

In marked contrast to the beta -sheet, the formation of inter-strand native contacts for the beta -clip is more complex as shown in Fig. 5 A at g = 0.7 and Fig. 5 B at g = 0.9. As shown in Fig. 1 B (top view topology diagram), there are six possible strand-to-strand native interactions for the beta -clip: Q12, Q14, Q23, and Q34 correspond to the four peripheral strand-to-strand interactions, and Q13 and Q24 correspond to the two diagonal strand-to-strand interactions. For g = 0.7, the formation of beta -clip native contacts begins to develop at the ends of the chain; i.e., < Q12> and < Q34> increase first, as was the case in Fig. 4 A for the beta -sheet at g = 0.9, because the chain's terminals (strands 1 and 4) are very flexible and easy to rearrange. All values of < Qab> increase initially as the chain collapses and saturate at their respective native values of < Qab> , except for < Q14> which continuously increases and reaches a value of < Q14>  approx  0.85 at t* = 105. The native state is located at t>=  105, the time at which < Q14> starts to oscillate about 0.85. In the native state, the four peripheral values, < Q12> , < Q14> , < Q23> , and < Q34> oscillate about their average value of < Q>  approx  0.82, and the two diagonal values, < Q13> and < Q24> oscillate about 0.6, which is much smaller. Because the number of native contacts in the four peripheral strand-to-strand interactions is much larger than that in the two diagonal strand-to-strand interactions (see Table 1), this indicates that the larger the number of strand-to-strand native contacts, the faster the formation of native contacts in the strand-to-strand interactions. For g = 0.9, Fig. 5 B shows similar information on the formation of inter-strand native contacts. The speed of the formation of native contacts is faster than in the g = 0.7 case. The native state is located at t>=  104 when < Q14> reaches its native value of 0.95. Note that in the native state the diagonal strand interactions < Q13> and < Q24> oscillate at around 0.8, which is significantly closer to the native value for the whole chain of < Q>  approx  0.92 than occurs in the g = 0.7 case. This is based on the fact that the larger the bias gap, the more the native state is favored over the non-native state. In fact, the large values of < Q13> , < Q14> , and < Q24> are essential to maintain the beta -clip structure, because without these strand-to-strand native interactions, the model beta -clip reduces to the beta -sheet.



View larger version (20K):
[in this window]
[in a new window]
 
FIGURE 5   The average values of the fraction of inter-strand native contacts, < Qab(t)> , where a and b are the strand number, for the beta -clip as a function of the reduced time in logarithmic scale, log(1 + t*), for two selected values of the bias gap: (A) g = 0.7 and (B) g = 0.9.

The formation of inter-strand native contacts for the beta -twist is shown in Fig. 6 A at g = 0.7 and Fig. 6 B at g = 0.9. As shown in Fig. 1 C (top view topology diagram), there are five possible strand-to-strand native interactions for the beta -twist: 1-2, 1-3, 2-3, 2-4, and 3-4. For g = 0.7 in Fig. 6 A, native contacts between strands 1 and 2 and between strands 3 and 4 form rapidly as folding progresses, but native contacts between strands 1 and 3 and between strands 2 and 4 form slowly. Native contacts between strands 2 and 3 form at an intermediate speed. After the initial rapid increase, < Q12> and < Q34> oscillate about 0.88 in the native state, whereas < Q13> and < Q24> continuously increase to 0.88. The native state is located at t>=  3.2 × 104 when < Q13> and < Q24> begin to oscillate about 0.88. For g = 0.9 in Fig. 6 B, it is immediately apparent that there are three fast rates in the formation of inter-strand native contacts (Q12, Q23, and Q34) and two slow rates in the formation of inter-strand native contacts (Q13 and Q24). The quantities < Q13> and < Q24> increase faster than they do in the g = 0.7 model. The native state is located at t>=  104 when < Q13> and < Q24> oscillate about 0.92. For the beta -twist, the large values of < Q13> and < Q24> are key factors in the conformation of the native beta -twist structure, because without these strand-to-strand native interactions, the model beta -twist reduces to the beta -sheet.



View larger version (19K):
[in this window]
[in a new window]
 
FIGURE 6   The average values of the fraction of inter-strand native contacts, < Qab(t)> , where a and b are the strand number, for the beta -twist as a function of the reduced time in logarithmic scale, log(1 + t*), for two selected values of the bias gap: (A) g = 0.7; (B) g = 0.9.

The evolution of the formation of inter-strand native contacts for the beta -sheet model is reflected in the snapshots in Fig. 7 at selected times that display typical folding events for: g = 0.9 (Fig. 7 A) and g = 1.1 (Fig. 7 B). As mentioned above, for g = 0.9 native contacts start forming at the ends of the chain as can be clearly seen in the structure at t* = 32. The native state is observed at t>=  100. For g = 1.1, the chain collapse and the formation of native contacts are concurrent. An example of the collapsed chain is shown at t* = 32. A native-like structure is observed at t* = 63. A well-ordered native beta -sheet appears to exist at t>=  100. 



View larger version (34K):
[in this window]
[in a new window]
 
FIGURE 7   Snapshots of the kinetic chain conformation of the beta -sheet at selected times, t* = 0.2, 32, 63, and 100 for the bias gaps: (A) g = 0.9; (B) g = 1.1.

Fig. 8 shows snapshots for the beta -clip at selected times as the chain evolves toward the native state. For g = 0.7 in Fig. 8 A, the four snapshots are taken after < F*R> reaches its minimum as seen in Fig. 2 B. The structure at t* = 100 corresponds to the collapsed chain at the minimum value of < F*R> . It exists for a very short time. This collapsed chain is a transient in protein folding (Silow and Oliveberg, 1997) and can be regarded as a kinetic disordered globule because it has a Q value, < Q>  approx  0.6 (see Fig. 3 B), that is similar to that of a thermodynamic disordered globule (Jang et at., 2002) but with a smaller R<UP><SUB>g</SUB><SUP>2</SUP></UP>. The existence of a disordered compact structure is also reported in the kinetic study of a model three-helix bundle protein (Zhou and Karplus, 1997b, 1999). At t* = 1.6 × 103 and at t* = 1.3 × 104, we see the structures of two typical intermediates; these are intermediate I1 and intermediate I2, respectively (see below). These intermediates are found in the strand extension process after the initial chain collapse and have very long lifetimes. An example of the native structure is shown at t* = 105. For g = 0.9, the evolution of the chain toward the native state is totally different. No kinetic disordered globule state is observed when < F*R> goes to the minimum at tapprox  63. Instead, a slightly bent beta -sheet structure occurs as shown in Fig. 8 B at t* = 63. This beta -sheet-like structure is a transient in folding trajectories toward the beta -clip. Examples of the intermediate I and the native structure are observed at t* = 160 and t* = 104, respectively.



View larger version (38K):
[in this window]
[in a new window]
 
FIGURE 8   Snapshots of the kinetic chain conformation of the beta -clip for two values of the bias gap: (A) g = 0.7 at times t* = 100, 1.6 × 103, 1.3 × 104, and 105; (B) g = 0.9 at times t* = 25, 63, 160, and 104.

Fig. 9 shows snapshots for the beta -twist at selected times as the chain evolves toward the native state. The behavior displayed in Fig. 9 A for g = 0.7 is similar to that shown in Fig. 8 A for the beta -clip. The collapsed chain at the minimum value of < F*R> is shown at t* = 63; this is a transient in protein folding (Silow and Oliveberg, 1997) and can be regarded as a kinetic disordered globule. The kinetic disordered globule has more non-native interactions than the thermodynamic disordered globule giving a smaller R<UP><SUB>g</SUB><SUP>2</SUP></UP>. Due to the disordered nature of this state, no native-state secondary structure is observed. As time increases, the native contacts begin to form as part of the strand extension process, leading to the native structure. This process is very slow because the collapsed chain undergoes a subtle conformational change to the native state via intermediates that have long lifetimes. Examples of two typical intermediates, the intermediate I1 and the intermediate I2 are observed at t* = 250 and at t* = 1.3 × 103, respectively. The native structure is observed at t* = 4 × 104. For g = 0.9 in Fig. 9 B, no kinetic disordered globule state is observed when < F*R> goes to the minimum at tapprox  63. (This was also the case for the beta -clip at g = 0.9). Instead, a slightly bent beta -sheet structure is observed at t* = 63. The structures of the intermediate I (see below) are shown at t* = 320 and at t* = 3.2 × 103. The native structure is observed at t* = 104.



View larger version (42K):
[in this window]
[in a new window]
 
FIGURE 9   Snapshots of the kinetic chain conformation of the beta -twist for two values of the bias gap: (A) g = 0.7 at times t* = 63, 250, 1.3 × 103, and 4.0 × 104; (B) g = 0.9 at times t* = 63, 320, 3.2 × 103, and 104.

Intermediates and folding pathways

It is of interest to examine the intermediates that occur as the folding progresses in more detail. The contour graph in Fig. 10 A shows the population distribution of the fraction of native contacts, Q, as a function of reduced time on a logarithmic scale, log(1 + t*), for the beta -sheet at g = 0.9. The population distribution of Q is calculated at selected logarithmic time intervals of log(1 t*) = 0.1 for all 100 trajectories. The graph clearly shows highly populated states as those enclosed by many contour lines. The beta -sheet at g = 0.9 is a fast-folding protein, and the folding trajectories clearly show a fast-track or bottleneck pathway of kinetic folding between two highly populated states: the initial random coil state at t* < 10 and Q approx  0.08 and the native state at t* > 100 and Q approx  1. More evidence of fast-track folding is shown in Fig. 10 B, which shows the population distribution of F*R versus Q averaged over all 100 trajectories and all times. The peaks indicate the highly populated states. The peak at F*R approx  0.8 and Q approx  0.1 corresponds to the initial random coil state and the other peak at F*R approx  0 and Q approx  1 corresponds to the native state. The kinetic folding mechanism for the beta -sheet at g >=  0.9 is represented in Fig. 10 C.



View larger version (29K):
[in this window]
[in a new window]
 
FIGURE 10   (A) The population distribution of the fraction of native contacts, Q, as a function of the reduced time in logarithmic scale, log(1 + t*); (B) the population distribution of the normalized squared radius of gyration, F*R, versus the fraction of native contacts, Q; (C) the kinetic folding mechanism for the beta -sheet at g = 0.9.

The contour graph for the distribution of Q versus log(1 + t*) for the beta -clip at g = 0.7 is shown in Fig. 11 A, and the corresponding population distribution of F*R versus Q from all 100 trajectories is shown in Fig. 11 B. In the contour graph of Fig. 11 A, the random coil state is observed in the range 0 < log(1 + t*) < 0.8 and 0 < Q < 0.1. Two long-lived populations are observed in the range 2.6 < log(1 + t*) < 3.4 at Q approx  0.76 and 3.4 < log(1 + t*) < 5 at Q approx  0.8; these correspond to the intermediates I1 and I2, respectively. Because the native state is located at t>=  105 as shown in Fig. 5 A for g = 0.7 and the results are shown within the time scale up to log(1 + t*) = 5, the population distribution of Q for the native state is not observed in the contour graph. In Fig. 11 B, the peak at F*R approx  -1 and Q approx  0.76 corresponds to the intermediate I1, which can be regarded as a partially-ordered globule as shown in the thermodynamic study of the same model (Jang et al., 2002). It has a small value of R<UP><SUB>g</SUB><SUP>2</SUP></UP>, a large value of Q, and the rolled or kinked structure seen in Fig. 8 a at t* = 1.6 × 103. This partially-ordered globule state occurs because the chain becomes trapped in a conformation dominated by the attraction between non-native contacts, thus disturbing the tendency to order into the native state. Note that the intermediate I1 is highly populated at tapprox  103 and has a value F*R approx  -1, which is much smaller than the value of < F*R>  = -0.4 at tapprox  103 seen in Fig. 2 B, which is the average value of F*R for all trajectories. This suggests that not all 100 trajectories fold via the intermediate I1. In fact, 40% of the trajectories are trapped in the intermediate I1, and the rest of the trajectories fold via the other intermediate, I2, which is located at F*R approx  -0.2 and Q approx  0.8 in Fig. 11 B. The peak for intermediate I2 is obscured by the peak of the highly populated native state at F*R approx  0 and Q approx  0.82. The intermediate I2 is an ordered globule; it has a larger R<UP><SUB>g</SUB><SUP>2</SUP></UP> than the intermediate I1 and has almost the same Q as the native state. The intermediate I2 has a loosely packed and slightly bent structure as seen in Fig. 8 A at t* = 1.3 × 104. There are two possible folding pathways for the collapsed chain from the state in which < F*R> is at its minimum. The first is that it could further collapse to become trapped in the intermediate I1, which can fold either directly into the native state or via the intermediate I2 by elongating its strands. The other pathway is that it can fold via the intermediate I2 to the native state. The kinetic folding scheme for the beta -clip at g = 0.7 is represented in Fig. 11 C.



View larger version (24K):
[in this window]
[in a new window]
 
FIGURE 11   (A) The population distribution of the fraction of native contacts, Q, as a function of the reduced time in logarithmic scale, log(1 + t*); (B) the population distribution of the normalized squared radius of gyration, F*R, versus the fraction of native contacts, Q; (C) the kinetic folding mechanism for the beta -clip at g = 0.7.

The folding kinetics for the beta -clip at g = 0.9 is very different than that at g = 0.7. Fig. 12 A shows the contour plot of the population distribution of Q versus log(1 + t*). The initial random coil state is located at 0 < log(1 + t*) < 0.6 and 0.05 Q < 0.1. There is a short-lived intermediate I located at 0.75 Q < 0.85 and 2.1 < log(1 + t*) < 2.7. The highly populated native state is located at t>=  104 with Q approx  0.9. The properties of intermediate I are the same as those of the intermediate I2 for the g = 0.7 model. In Fig. 12 B, it is located at F*R approx  -0.2 and Q approx  0.8, but its peak is obscured by the native peak at F*R approx  0 and Q approx  0.9. The intermediate I is an ordered globule with a native-like secondary structure as seen in Fig. 8 B at t* = 160. Almost 80% of the trajectories become trapped in the intermediate I, and the rest fold directly to the native state via a fast-track folding pathway. The kinetic folding scheme for the beta -clip at g = 0.9 is represented in Fig. 12 C.



View larger version (29K):
[in this window]
[in a new window]
 
FIGURE 12   (A) The population distribution of the fraction of native contacts, Q, as a function of the reduced time in logarithmic scale, log(1 + t*); (B) the population distribution of the normalized squared radius of gyration, F*R, versus the fraction of native contacts, Q; (C) the kinetic folding mechanism for the beta -clip at g = 0.9.

The folding kinetics for the beta -twist at g = 0.7 is qualitatively similar to that of the beta -clip at g = 0.7 (not shown in figures). The kinetic folding of the beta -twist involves two intermediates I1 and I2. However, unlike the beta -clip at g = 0.7 the intermediate I1 is short-lived and relatively unpopulated in the F*R versus Q plane. Approximately, 20% of the trajectories are trapped in intermediate I1, most of the trajectories fold into intermediate I2, and none of the trajectories fold via the fast-track folding pathway.