| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||


* INFM-CNR Istituto dei Sistemi Complessi, Rome, Italy;
Centro Interdipartimentale per lo Studio delle Dinamiche Complesse, Sezione INFN di Firenze, Florence, Italy; and
Dipartimento di Fisica Università di Firenze, Centro Interdipartimentale per lo Studio delle Dinamiche Complesse, Sezione INFN di Firenze e INFM, Florence, Italy
Correspondence: Address reprint requests to Dr. Fabio Cecconi, Tel.: 39-06-4993-7452; E-mail: cecconif{at}roma1.infn.it.
| ABSTRACT |
|---|
|
|
|---|
model correctly reproduces the cooperative, two-state, folding mechanism of the WW-domain, while the SHG model predicts a transition occurring in two stages: a collapse, followed by a structural rearrangement. The lack of a cooperative folding in the SHG simulations appears to be related to the nonfunnel shape of the energy landscape featuring a partitioning of the native valley in subbasins corresponding to different chain chiralities. However, the SHG approach remains more reliable in estimating the
-values with respect to G
-like description. This may suggest that the WW-domain folding process is stirred by energetic and topological factors as well, and it highlights the better suitability of chemically based models in simulating mutations. | INTRODUCTION |
|---|
|
|
|---|
The structure of this domain, resolved through NMR (4) and x-ray diffraction (3
) (Fig. 1), is characterized by hydrophobic clusters providing the largest contribution to the thermodynamic stability (5
). Cluster 1 (CL1) involves residues Leu7, Trp11, Tyr24, and Pro37, the second cluster (CL2) comprises Tyr23, Phe25, and Arg14. The stability of the molecule also derives from a network of hydrogen bonds whose central element is the highly conserved Asn26 located on strand ß2 and acting both as donor and acceptor in bonds with Pro9, Trp11, Ile28, and Thr29, thus linking strands ß1 and ß3. Two loops are present: Loop I (L1) plays a key role in substrate recognition (3
), since it binds to the phosphate of the pS-P motif of the Proline-rich ligands; Loop II (L2), on the other hand, gives an important contribution to thermal stability (5
). Thermal denaturation experiments (5
) and simplified statistical physics approaches (6
) have shown that the Pin1 WW domain folds following a cooperative two-state mechanism at the temperature TM = 332 K. The mutagenesis analysis performed by the same authors (5
) identified the mutations on Ser16, Ser18, and Ser19 in Loop I as maximally destabilizing for the transition state, so that the formation of L1 appears to be the rate-limiting step in the folding/unfolding process. Loop II (L2) is involved in the formation of the transition state only at high temperatures (5
). Due to the ability of inducing conformational changes in Proline-rich, phosphorylated substrates, Pin1 is a potential regulator of the cell cycle, and may be involved in pathologies like Liddle's syndrome, muscular dystrophy, and Alzheimer's disease (7
,8
).
|
-model (9
The G
-force field is independent of the amino-acid sequence, and it requires the knowledge of the tertiary structure of native states to identify native interactions. Accordingly, the native centric approach cannot be used for ab initio predictions of native folds, even if recent works (22
26
) provide growing evidence that it can be confidently used for the characterization of transition states of real small, fast-folding (submillisecond) (27
,28
) proteins that are characterized by a low level of energetic frustration. However, topological models might not correctly reproduce the folding process when chemical interactions play a relevant role. The SHG model, which is instead based on the chemical and physical properties of amino acids such as hydrophobicity, is, in principle, better applicable to proteins with a higher level of energetic frustration. Moreover, requiring the knowledge of primary and secondary structures only, the model has a greater predictive power, and in this sense, could be considered closer to an ab initio representation. The above arguments motivate a detailed comparison between the two protein models to assess their applicability and potentialities in the study of biomolecules.
| THEORY AND METHODS |
|---|
|
|
|---|
-carbon atoms. The G
energy function, mimicking a perfect funnel landscape, assigns to the native state the lowest energy by simply promoting the formation of native interactions. Here we employ the force field proposed by Clementi et al. (22
atoms relative to residues i and j (|i j|
3) is less than Rc in the native state. This pair undergoes an attractive LJ-interaction
![]() | (1) |
![]() |
= 4.5 Å. These nonnative interactions, besides ensuring the self-avoidance of the chain, generally enhance the cooperativity of the overall folding process (30
) formed by three consecutive residues
![]() |
= 20
. The most important determinant of the secondary structure is the dihedral potential arising from the torsional energy. Each dihedral angle, identified by four consecutive beads, contributes to the potential with the terms
![]() |
is the value of angle i in the native structure, k
(1
and k
(3
. Finally, consecutive residues interact with each other through the potential harmonic in their distance ri, i+1,
![]() | (2) |
Therefore, the global G
-potential reads
![]() |
ij = 1 (0) if the contact is native (nonactive). G
models of the type just outlined may produce a gradual folding behavior incapable of reproducing the typical kinetic cooperativity of two-state folders. Experimental studies suggest (31
![]() | (3) |
= 2 is the scaling factor. The force rescaling determines a higher free energy barrier between the folded and unfolded states in correspondence to the folding temperature, which results in a higher cooperativity. Therefore, the residence times in the folded and unfolded state are expected to be significantly longer.
The SHG model is an off-lattice minimal model that generalizes a previous model introduced by Thirumalai and co-workers (34
,35
). This approach represents
-carbons with beads of three possible types: hydrophobic (B), hydrophilic (L), and neutral (N), according to Table 1. The driving force responsible for the collapse onto a compact structure is the attraction between B-beads, whereas the repulsion between L and N beads determines the rearrangements of the compact structure into the native topology. The long-range interaction between residues, which may be far apart in sequence space, is modeled through the potential
![]() | (4) |
h (1.65 Kcal mol1 see below) sets the energy scale and
= 4.0 Å. The attractive forces between hydrophobic residues is attained by setting S1 = S2 = 1 for BB pairs, while the interactions involving the LL and LB pairs are characterized by S1 = 1/3 and S2 = 1. This interaction is repulsive and the r6 term, which accounts for the hydration shell around the hydrophilic residues, makes the potential longer-ranged than the usual r12. The forces involving neutral residues are also repulsive and amount to an excluded volume potential by setting S1 = 0 and S2 = 0. The secondary structure arises as a result of bending and dihedral interactions, which subrogate side-chain packing and hydrogen-bonding. The analytic expression of the dihedral potential is
![]() | (5) |
i indicates the angle between the two adjacent planes identified by the positions of four consecutive beads. The information on secondary structures is systematically stored in the coefficients A, B, C, and D, which determine a bias on the angles reflecting the propensity of residues to form a specific secondary motif. Indeed, each dihedral in the chain is defined to be either Helical (H: Ai = 0, Bi = Ci = Di = 1.2
h), or Extended (E: Ai = 0.9
h, Ci = 1.2
h, Bi = Di = 0), or Turn (T: Ai = Bi = Di = 0, Ci = 0.2
h). Therefore, the primary structure must be complemented with the auxiliary sequence of E,H,T symbols assigning the appropriate set of coefficients. The decoupling between primary and dihedral sequence, not present in similar models (40
![]() | (6) |
= 20
h/(rad)2, so that large deviations from the equilibrium value
0 = 1.8326 rad are unlikely, and bond angles result basically fixed. Also in this model, stiff springs (2
h/R = 1070.96 K, time in units of t =
(
h/M)1/2= 4.44 ps (
= 4.0 Å is the equilibrium length of Lennard-Jones interactions, M = 110 is the average amino-acid mass), energy in units
h, specific heat in units R = 1.9872 x 103 Kcal mol1 K1, and the radius of gyration in units
. The energy scale
h was set to 1.65 Kcal mol1 to reach a denaturation temperature compatible with experimental data (5
-model, the same units apply, except for the energy scale set to
= 0.66 Kcal mol1. During the simulations, we monitored the difference from the native state or reference state through the overlap Q, representing the fraction of formed native contacts as
![]() |
(u) indicates the unitary step function. A value Q
1, indicates that the conformation is nativelike, while values close to zero refer to denatured states. We also considered, as further reaction coordinates of the folding/unfolding process, the gyration radius and the root mean-square distance (RMSD) between the current and reference conformations after an optimal superposition performed according to Kabsch's algorithm (39
(E, Q), which, in turn, will be used to compute the thermodynamics of the system. The knowledge of
(E, Q) can be also employed to derive the probability that, at temperature T, the protein states are characterized by energy E and reaction coordinate Q,
![]() |
![]() |
|
![]() |
···
is taken over time, assuming the dynamics to be ergodic. The value Pij(T) typically features a sigmoidal shape, keeping values close to 1 at low temperatures and decreasing to zero at high temperatures. The knowledge of probabilities Pij(T) allows for a classification and ranking of native contacts according to their thermodynamic relevance (42
-values
The comparison of the G
and SHG models on the WW domain provide the opportunity to study the relevance of topological versus energetic frustration (47
) in the folding mechanism. This can be accomplished by
-values computation and by the further comparison with experimental data. The
-values (15
) measure the perturbation effects of a site-directed mutation which, by altering the free energy difference among native, transition, and unfolded states, may affect the thermodynamics and the kinetics of the reaction. A prevalence of topological or energetic frustration may be argued from a better fit with the experiments of the G
-derived or SHG-derived
-values, respectively (22
,45
). The
-values can be computed through a kinetic approach from the folding and unfolding rates of the mutant and wild-type protein (48
),
![]() | (7) |

G0. The use of Eq. 7 is computationally demanding, as it requires a simulation for each mutation. This motivates the use of a thermodynamic strategy for the
-value evaluation (48
![]() | (8) |

G
is the change in stability of the free-energy barrier between the native and denatured state. Equation 8 is equivalent to Eq. 7 when Kramer-like theory applies (47
If the effect of the mutations is sufficiently small, then, following Clementi et al. (22
), the
-values can be derived by a free-energy perturbation approach,
![]() | (9) |
In this article, the
-values are computed according to Eq. 9, using a method developed in Clementi et al. (22
) that can be summarized in the following steps:
-values.
Structural information about the native-like-ness of the transition state was also gained from the so-called structural
-values (49
),
![]() | (10) |
| RESULTS |
|---|
|
|
|---|
-model and then we discuss the corresponding scenario in the SHG-model approach. Since the implementation of the SHG model requires a well-designed sequence, we employed the 640 truncated sequence already optimized in Brown et al. (11
-simulations, the corresponding fragment was extracted from the NMR structure stored in the PDB file 1NMV (4
G
-model
A folding simulation was performed through a gradual cooling of a random coil structure from a temperature T = 1.5 down to 0.5 in 40 steps. The specific heat profile, Fig. 2, is characterized by a single narrow peak at temperature Tf = 1.0, suggesting a possible two-state process. The same conclusion can be drawn from the ratio of van 't Hoff over the calorimetric enthalpy changes amounting to 0.74 without and 0.99 with standard baseline subtraction (50
). The folding/unfolding processes are reversible in temperature, as shown by the agreement between specific heat plots. The other observables used to characterize the folding transition such as, RMSD, overlap, and gyration radius exhibit an abrupt change in correspondence to the folding temperature Tf (Fig. 3). Free energy profile (Fig. 4) as a function of the overlap, around the folding temperature, clearly features two distinct wells identifying the folded and unfolded ensembles separated by a barrier corresponding to the transition-state conformations. The shape of the free-energy plot suggests a choice of overlap windows for the sampling of conformations in the three ensembles F, U, and TS (see Fig. 4 legend) for the computation of
-values (Methods). In Fig. 5 we compare our single-site simulated
-values (Eq. 9) with the experimental data by Gruebele (5
). In the G
-like approach, a mutation can be modeled as the removal of a single native contact (22
) or in alternative, as an average over all possible removals of contacts involving the same residue. We followed the second strategy, considering only contacts |i j|
3. In this scheme, we cannot evaluate
-value of Ser18 because it lacks such contacts. The theoretical
-values in Fig. 5 vary in the range [0.0, 0.5], whereas the experimental ones are distributed in a much wider interval. This feature is an expected result of the very limited energetic frustration of the G
-force fields (47
). The discrepancy is reflected by the modest value of the linear correlation coefficient r = 0.54 (see regression line in the Fig. 5 a, inset). Of course we cannot exclude that a possible improvement of
-value accuracy might be achieved either by employing other mutation implementations or by using alternative contact maps accounting for the high flexibility of the native structure of peptides and small proteins (51
). Despite this not-high correlation, the theoretical
-values provide a qualitative indication about the molecule regions that are still nativelike in the transition state. The plot in fact indicates that the sites most sensitive to mutations are those in the region of loops L1 and L2, in agreement with experimental results (see Fig. 5 b).
|
|
|
|
-values (Fig. 5 c) is consistent with that derived from the perturbation method. In fact, in this case, the highest
-values also correspond to residues located in L1 (Ser19), L2 (Thr29), or in the neighborhood of the first hydrophobic cluster CL1 (Pro8). The low
-values pertain mainly to residues in strands ß1 and ß2, suggesting that these two regions are unlikely to be in contact in the transition state.
SHG-model
Ten independent folding simulations starting from random-coil conformations were performed through a gradual cooling schedule from temperature T = 1.0 to T = 0.01 in 40 steps. The final structures were further relaxed by a steepest-descent cycle until the maximal total force per monomer reached a value smaller than 108 Kcal mol1 Å1. We obtained different folds, and chose the conformation with lowest energy (E = 19.0035
) and lowest RMSD (4.74 Å) from the PDB structure as the reference structure
0 (Fig. 1). However, the simulations revealed also the existence of another degenerate minimum with the same energy and specular to
0 resulting in much higher RMSD.
Despite the large value of RMSD,
0 correctly displays the topology of a triple-stranded, antiparallel ß-sheet; this lacks the typical twist of the PDB structure, though, making loop L2 almost perpendicular to loop L1 (see Fig. 1). As a result, the folded structure is much more compact than the real protein and has a much larger number of native contacts (71 vs. 41). The fact that 22 out of the 41 PDB contacts are also present in the folded structure is an indication of the satisfactory structural performance of the SHG simulation.
Structure
0 was then denatured through 10 independent runs with the same but inverse temperature schedule, involving a thermalization stage of 6 x 106 time steps (
t = 0.005) at each temperature, followed by a run over the same length, where control parameters were measured to assess the unfolding progress. The course of both folding and unfolding simulations was monitored through the analysis of the energy, the specific heat, RMSD from
0, the overlap, and the radius of gyration, Rg.
|
The existence of the shoulder in the folding Cv plot is a signature of a noncooperative folding mechanism in which an initial collapse is followed by a structural chain rearrangement, characterized by a significant increase in the number of native contacts unaffecting the overall compactness of the molecule (see Fig. 7). This is confirmed by the thermal fluctuation of the structural overlap
![]() |
|
-collapse, whereas the shoulder corresponds to the folding transition. In fact, kinetic simulations at temperature T = 0.237 show that the time evolution of Q(t) exhibit jumps between the two free energy wells (upper panel of Fig. 8). The double-well shape of the free-energy profile again allows us to sample conformations in the folded (F), transition state (TS), and unfolded (U) ensemble used to implement the perturbation technique for
-value computation (Methods). The plot in Fig. 9 shows the
-values restricted to the set of residues mutated by Gruebele (5
-values feature two major peaks in correspondence with loop L1 and loop L2, which is a qualitative resemblance with experiments. A more quantitative comparison is provided by the correlation coefficient between theoretical and experimental data amounting to r = 0.65.
|
|
0. This is a further indication of the high level of frustration of the free-energy landscape associated to the sequence, and it is in agreement with the findings by Miller and Wales (52
force field, indicates that this feature is mainly peculiar to the model, rather than to this specific protein.
|
To analyze the folding/unfolding process, we considered three temperature windows corresponding to different regions of the specific heat plots. The first window (T < 0.15) refers to the pre-transition baseline of the Cv plot, the second window (0.15
T
0.30) insists on the region of the shoulder, and the third window (T > 0.30) includes the main peak. The contacts appearing or disappearing in correspondence of the three windows are shown in black, red, and green, respectively, in the contact maps (Fig. 11) summarizing the main events of the pathway. Shaded symbols represent weak interactions with probability of formation below 50% at the lowest simulation temperature T = 0.01.
|
0, so that their formation probabilities are always below 50%. The folding is completed by the appearance of a few contacts between residues very far from each other along the protein chain.
During the unfolding reaction, no native (with respect to
0) contact breaks down in the low-temperature window because the heating schedule enables the protein to escape easily from kinetic traps, making the process much less gradual than folding. This reflects on the smaller number of contacts broken in the shoulder region as compared to the number of contacts formed in the same temperature range during folding. In particular, the cleavage occurs of ß1-ß3, ß2-tail and head-ß2 contacts, whereas the dissolution of the contacts of loop 1 and loop 2 is delayed to the region of the peak of the Cv plot where most ß1-ß2 and ß2-ß3 contacts also disappear.
The comparison of the two contact maps thus reveals that the sequences of the molecular events in the folding and unfolding processes are basically reverse to each other, even if the unfolding is a more abrupt phenomenon occurring in a narrower temperature window.
| DISCUSSION |
|---|
|
|
|---|
and SHG models are appropriate to simulate complementary features of the folding process. The G
model, in fact, being based on the influence of the native-state topology on the folding process, is independent from the amino-acid sequence and it completely disregards the chemical properties of the molecule. The SHG model, on the other hand, is a minimal model, where the chemical features of amino acids are partially included, determining the folding driving force.
Our simulations showed that the G
model with angular bias (22
) and rescaling (33
) can correctly reproduce the reversible, cooperative, two-state mechanism of folding of hPin1 WW domain (5
). The reversibility, indeed, appears from the almost perfect superposition of the CV plots of folding and unfolding. Several elements, on the other hand, suggest a cooperative, two-state mechanism: the CV plots show a single sharp peak, the ratio of the van 't Hoff to calorimetric enthalpy is close to 1 (
) (53
), all the indicators used to monitor the similarity with the native state exhibit a sharp sigmoidal thermal behavior, and the barrier between the two free-energy wells at the folding temperature is very high (54
,55
). The results from the simulation using the SHG model were rather ambiguous. The simulated thermograms featured not only a peak, but also a shoulder at lower temperature. This is the signature of a noncooperative folding involving a collapse into a compact, only partially structured globule, followed by a rearrangement into a native conformation. This scenario is confirmed by the thermal behavior of the structural parameters (overlap, gyration radius, and RMSD from native structure) used to monitor the folding reaction. The results are consistent with the findings by Nymeyer et al. (56
) and by Guo and Brooks III (41
), in their simulations on the model by Honeycutt and Thirumalai (34
). The SHG formulation, although being an improvement of the latter model, still retains some of its drawbacks. Indeed, the conformations of the native ensemble (F), sampled at the folding temperature, can be clustered in two groups with nonoverlapping RMSD distributions and opposite chiralities (57
). The existence of two distinct clusters of nativelike conformations can be easily explained by examining the low-temperature free-energy profiles as a function of the RMSD (from reference structure
0): the native basin appears to be partitioned in subbasins separated by barriers. The partitioning of the native basin is likely a feature that the SHG model inherited from the Thirumalai model. Miller and Wales (52
), in fact, analyzed the disconnectivity graph of the potential energy surface of Thirumalai's force field, drawing the conclusion that the energy hypersurface is not a single funnel, but it contains low-energy minima separated by high barriers.
Presumably, the reason for the degeneration of the native state of the SHG model relies on the symmetry of the dihedral potential V
(Eq. 5). In particular, the sequence designed to represent the hPin1 WW domain, contains only Extended or Turn symbols, so that V
is a polynomial in cos(
) and becomes symmetric for the inversion
. The symmetry of the V
term, however, is not the only reason for the poor performance of the SHG model. In fact, we find that the energy histograms of the folded- and unfolded-state ensembles are significantly overlapping, thus suggesting the existence of many low-energy, nonnative conformations (41
).
We suspect that this is an effect of the only approximated maximization of the energy gap between the native conformation and the decoy set used in the sequence optimization procedure (11
). This would call for further refinements of the threading procedure.
Despite the several drawbacks, the SHG model enabled the computation of perturbation
-values in qualitative agreement with experimental data. The linear correlation coefficient between theoretical and experimental
-values (r = 0.65) is actually better than the one yielded by the G
simulation (r = 0.54). The explanation of these results must be sought in the partial incorporation of the chemistry in the SHG description. Indeed, real mutations are chemical transformations of the molecule and they are better simulated by a chemically based model such as the SHG rather than by a topological model. In the G
model, in fact, mutations are generally simulated by the removal of native contacts (22
); however, they may affect all the interactions in which a residue is involved. The SHG model, conversely, offers the possibility to treat mutations in a more realistic way because it implements shifts in the hydrophobic character of residues or changes in the secondary structural bias of dihedral angles. Moreover, the better agreement of experimental data with the SHG-computed
-values may show that the folding mechanism of hPin1 WW domain is controlled not only by topological but also by energetic factors.
The significant differences, beyond statistical errors, between the
-values profiles yielded by the two models, in our opinion, reflect the different strategies upon which the two models are built.
A final issue that deserves some discussion is the quality of the structural prediction using the SHG model. The SHG is a minimal model based on chemical properties of the system and a good outcome of the simulation is not a-priori guaranteed. The simulations show that, apart from chirality problems, the best final structure
0 (Fig. 1) presents the correct topology of a three-stranded antiparallel ß-sheet of the hPin1 WW domain, even if the structure appears to be more compact. This, however, does not prevent the correct formation of both hydrophobic clusters. Moreover,
0 shares 22 of the 41 native contacts of the PDB structure.
| CONCLUSIONS |
|---|
|
|
|---|
and SHG models that represent two different strategies to the folding problem. Our simulations indicated that for the specific WW domain considered in this work, the G
model, with angular bias and rescaling, correctly reproduces the cooperative, two-state, reversible folding mechanism, whereas the SHG model does not. The reasons for the limitations of the SHG model must be sought in the insufficient optimization the sequence and in the nonfunnel shape of the landscape. As a consequence, the present version of the SHG model does not allow reliable predictions of the folding mechanism. The satisfactory performance of the SHG model in the computation of
-values, however, clearly shows the importance of incorporating the chemical properties of the sequence in a protein model. Our work, highlighting the limits of the SHG model, is thus intended to be a starting point for a further refinement of the model, in the firm belief that coarse-grained, minimal models represent viable alternatives to computationally demanding all-atom simulations in investigations of large-sized, slow-folding proteins. Submitted on June 22, 2005; accepted for publication April 6, 2006.
| REFERENCES |
|---|
|
|
|---|
2. Karanicolas, J., and C. L. Brooks III. 2003. Structural basis for biphasic kinetics in the folding of the WW domain from a formin-binding protein: lessons for protein design? Proc. Natl. Acad. Sci. USA 100:39543959.
3. Verdecia, M. A., M. E. Bowmann, K. P. Lu, T. Hunter, and J. P. Noel. 2000. Structural basis for phosphoserine-proline recognition by WW domains. Nat. Struct. Biol. 7:639643.[CrossRef][Medline]
4. Bayer, E., S. Goettsch, J. W. Mueller, B. Griewel, E. Guiberman, L. M. Mayr, and P. Bayer. 2003. Structural analysis of the mitotic regulator hPin1 in solution: insights into domain architecture and substrate binding. J. Biol. Chem. 278:2618326193.
5. Jäger, M., H. Nguyen, J. C. Crane, J. W. Kelly, and M. Gruebele. 2001. The folding mechanism of a ß-sheet: the WW domain. J. Mol. Biol. 311:373393.[CrossRef][Medline]
6. Bruscolini, P., and F. Cecconi. 2005. Analysis of Pin1 WW domain through a simple statistical mechanics model. Biophys. Chem. 115:153158.[CrossRef][Medline]
7. Garnier, L., J. W. Wills, M. F. Verderame, and M. Sudol. 1996. WW domains and retrovirus budding. Nature. 381:744745.[CrossRef][Medline]
8. Sudol, M. 1996. Structure and function of the WW domain. Prog. Biophys. Mol. Biol. 65:113132.[Medline]
9. G
, N., and H. A. Scheraga. 1976. On the use of classical statistical mechanics in the treatment of polymer chain conformations. Macromolecules. 9:535542.[CrossRef]
10. Sorenson, J. M., and T. Head-Gordon. 2000. Matching simulation and experiment: a new simplified model for simulating protein folding. J. Comp. Biol. 7:469481.
11. Brown, S., N. J. Fawzi, and T. Head-Gordon. 2003. Coarse-grained sequences for protein folding and design. Proc. Natl. Acad. Sci. USA. 100:1071210717.
12. Micheletti, C., J. R. Banavar, A. Maritan, and F. Seno. 1999. Protein structures and optimal folding from a geometrical variational principle. Phys. Rev. Lett. 82:33723375.[CrossRef]
13. Munõz, V., E. R. Henry, J. Hofrichter, and W. A. Eaton. 1998. A statistical mechanical model for ß-hairpin kinetics. Proc. Natl. Acad. Sci. USA. 95:58725879.
14. Alm, E., and D. Baker. 1999. Prediction of protein-folding mechanisms from free-energy landscapes derived from native structures. Proc. Natl. Acad. Sci. USA. 96:1130511310.
15. Fersht, A. R. 2000. Transition-state structure as a unifying basis in protein-folding mechanisms: contact order, chain topology, stability, and the extended nucleus mechanism. Proc. Natl. Acad. Sci. USA. 97:15251529.
16. Baker, D. 2000. A surprising simplicity to protein folding. Nature. 405:3942.[CrossRef][Medline]
17. Martinez, J. C., M. T. Pisabarro, and L. Serrano. 1998. Obligatory steps in protein folding and the conformational diversity of the transition state. Nat. Struct. Biol. 5:721729.[CrossRef][Medline]
18. Martinez, J. C., and L. Serrano. 1999. The folding transition state between SH3 domains is conformationally restricted and evolutionarily conserved. Nat. Struct. Biol. 6:10101016.[CrossRef][Medline]
19. Riddle, D. S., V. P. Grantcharova, J. V. Santiago, E. Alm, I. Ruczinski, and D. Baker. 1999. Experiment and theory highlight role of native state topology in SH3 folding. Nat. Struct. Biol. 6:10161024.[CrossRef][Medline]
20. Chiti, F., N. Taddei, P. M. White, M. Bucciantini, F. Magherini, M. Stefani, and C. M. Dobson. 1999. Mutational analysis of acylphosphatase suggests the importance of topology and contact order in protein folding. Nat. Struct. Biol. 6:10051009.[CrossRef][Medline]
21. Plaxco, K. W., K. T. Simons, and D. Baker. 1998. Contact order, transition state placement and the refolding rates of single domain proteins. J. Mol. Biol. 277:985994.[CrossRef][Medline]
22. Clementi, C., H. Nymeyer, and J. N. Onuchic. 2000. Topological and energetic factors: what determines the structural details of the transition state ensemble and "en-route" intermediates for protein folding? J. Mol. Biol. 298:937953.[CrossRef][Medline]
23. Galzitskaya, O. V., and A. V. Finkelstein. 1999. A theoretical search for folding/unfolding nuclei in three-dimensional protein structures. Proc. Natl. Acad. Sci. USA. 96:1129911304.
24. Zhou, Y., and M. Karplus. 1999. Folding of a model three-helix bundle protein: a thermodynamic and kinetic analysis. J. Mol. Biol. 293:917951.[CrossRef][Medline]
25. Ding, F., N. V. Dokholyan, S. V. Buldyrev, H. E. Stanley, and E. I. Shakhnovich. 2002. Direct molecular dynamics observation of protein folding transition state ensemble. Biophys. J. 83:35253532.
26. Ozkan, S. B., K. A. Dill, and I. Bahar. 2003. Computing the transition state populations in simple protein model. Biopolymers. 68:3546.[CrossRef][Medline]
27. Kubelka, J., J. Hofrichter, and W. A. Eaton. 2004. The protein folding "speed limit". Curr. Opin. Struct. Biol. 14:7688.[CrossRef][Medline]
28. Gruebele, M. 1999. The fast protein folding problem. Annu. Rev. Phys. Chem. 50:485516.[CrossRef][Medline]
29. Miyazawa, S., and R. L. Jernigan. 1985. Estimation of effective interresidue contact energies from protein crystal structures: quasi-chemical approximation. Macromolecules. 18:534552.[CrossRef]
30. Clementi, C., and S. S. Plotkin. 2004. The effects of nonnative interactions on protein folding rates: theory and simulations. Protein Sci. 13:17501766.
31. Northey, J. G. B., A. A. D. Nardo, and A. R. Davidson. 2002. Hydrophobic core packing in the Sh3 domain folding transition state. Nat. Struct. Biol. 9:126130.[CrossRef][Medline]
32. Di Nardo, A. A., D. M. Korzhnev, P. J. Stogios, A. Zarrine-Afsar, L. E. Kay, and A. R. Davidson. 2004. Dramatic acceleration of protein folding by stabilization of a nonnative backbone conformation. Proc. Natl. Acad. Sci. USA. 101:79547959.
33. Kaya, H., Z. Liu, and H. S. Chan. 2005. Chevron behavior and isostable enthalpic barriers in protein folding: successes and limitation of simple G
-like modeling. Biophys. J. 89:520535.
34. Honeycutt, J. D., and D. Thirumalai. 1992. The nature of folded states of globular proteins. Biopolymers. 32:695709.[CrossRef][Medline]
35. Veitshans, T., D. Klimov, and D. Thirumalai. 1996. Protein folding kinetics: timescales, pathways and energy landscapes in terms of sequence-dependent properties. Fold. Des. 2:122.
36. Sorenson, J. M., and T. Head-Gordon. 1999. Redesigning the hydrophobic core of a model ß-sheet protein: destabilizing traps through a threading approach. Proteins Struct. Funct. Gen. 37:582591.
37. Shakhnovich, E. I. 1998. Protein design: a perspective from simple tractable models. Fold. Des. 3:R45R58.[CrossRef][Medline]
38. Evans, D. J., W. G. Hoover, B. H. Failor, B. Moran, and A. J. C. Ladd. 1983. Non-equilibrium molecular dynamics via Gauss's principle of least constraint. Phys. Rev. A. 28:10161021.[CrossRef]
39. Kabsch, W. 1976. Solution for best rotation to relate two sets of vectors. Acta Crystallogr. 32:922923.[CrossRef]
40. Ferrenberg, A. M., and R. H. Swendsen. 1989. Optimized Monte Carlo data analysis. Phys. Rev. Lett. 63:11951198.[CrossRef]