help button home button Biophys. J.
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS

Originally published as Biophys J. BioFAST on April 7, 2006.
doi:10.1529/biophysj.105.077057
This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow All Versions of this Article:
biophysj.105.077057v1
91/1/263    most recent
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Geierhaas, C. D.
Right arrow Articles by Clarke, J.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Geierhaas, C. D.
Right arrow Articles by Clarke, J.
Biophysical Journal 91:263-275 (2006)
© 2006 The Biophysical Society

Structural Comparison of the Two Alternative Transition States for Folding of TI I27

Christian D. Geierhaas *, Robert B. Best *, Emanuele Paci {dagger}, Michele Vendruscolo {ddagger} and Jane Clarke *

* Department of Chemistry, Medical Research Council Centre for Protein Engineering, University of Cambridge, Cambridge CB2 1EW, United Kingdom; {dagger} Institute of Molecular Biophysics, School of Physics and Astronomy, University of Leeds, Leeds LS2 9JT, United Kingdom; and {ddagger} Department of Chemistry, University of Cambridge, Cambridge CB2 1EW, United Kingdom

Correspondence: Address reprint requests to Jane Clarke, Tel.: 44-1223-336426; Fax: 44-1223-336362; E-mail: jc162{at}cam.ac.uk; or Michele Vendruscolo, Tel.: 44-1223-763848; Fax: 44-1223-336362, E-mail: mv245{at}cam.ac.uk.


    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS
 DISCUSSION
 CONCLUSIONS
 ACKNOWLEDGEMENTS
 REFERENCES
 
TI I27, a ß-sandwich domain from the human muscle protein titin, has been shown to fold via two alternative pathways, which correspond to a change in the folding mechanism. Under physiological conditions, TI I27 folds by a classical nucleation-condensation mechanism (diffuse transition state), whereas at extreme conditions of temperature and denaturant it switches to having a polarized transition state. We have used experimental {Phi}-values as restraints in ensemble-averaged molecular dynamics simulations to determine the ensembles of structures representing the two transition states. The comparison of these ensembles indicates that when native interactions are substantially weakened, a protein may still be able to fold if it can access an alternative transition state characterized by a much larger entropic contribution. Analysis of the probability distribution of {Phi}-values derived from ensemble averaged simulations, enables us to identify residues that form contacts in some members of the ensemble but not in others illustrating that many interactions present in transition states are not strictly required for the successful completion of the folding process.


    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS
 DISCUSSION
 CONCLUSIONS
 ACKNOWLEDGEMENTS
 REFERENCES
 
Considerable experimental evidence suggests that most of the proteins that fold by a two-state mechanism reach their native state via a single transition state (1Go–3Go). These proteins populate only two dominant free-energy minima and also often allow only one pathway to interconvert between these states. Such an evolutionary design may be a consequence of the necessity to minimize the tendency to misfold (4Go–8Go). It appears to be remarkably effective; indeed only a few proteins are known to fold by multiple folding pathways (9Go), the majority of those result from cis/trans prolyl isomerization (10Go,11Go) or from the presence of intermediates that might be en route to the native state or a nonproductive off-pathway trap (11Go–15Go). The existence of multiple unfolding pathways has also been proposed on the basis of theoretical considerations, e.g., (4Go,16Go–18Go). One of the most unambiguous experimental examples of a protein having two alternative folding pathways is the case of the immunoglobulin domain TI I27 (7Go,8Go).

In the case of two-state proteins, important insights about the determinants of the folding process have been provided by the evidence that the folding nucleus can be changed by mutations (e.g., protein G and protein L) (19Go) and by circular permutation (e.g., S6) (6Go). It is therefore of great interest to analyze in detail the case in which changes in the thermodynamic conditions, not simply in the sequence, can promote folding along alternative pathways.

It has recently been suggested that folding mechanisms fall on a continuum between strictly hierarchical, with early formation of secondary structure, to purely nucleation-condensation (concomitant formation of secondary and tertiary structure) and that the folding mechanism for any given protein will depend on the secondary structure propensity of the molecule (20Go). Where secondary structure propensity is high, a hierarchical mechanism (such as diffusion-collision) is likely, as has been observed in the engrailed homeodomain and protein G (20Go,21Go). By contrast, where secondary structure propensity is low, a pure nucleation-condensation mechanism will be observed, as shown classically for CI2 (22Go,23Go). The transition states observed in nucleation condensation mechanisms are diffuse, with a characteristic pattern of high and low fractional {Phi}-values distributed throughout the molecule.

Polarized transition states have been observed in a number of proteins, probably representing a hybrid between nucleation-condensation and diffusion-collision folding mechanisms (20Go,24Go,25Go). High {Phi}-values are not evenly distributed over the molecule but usually clustered at one position, with the rest of the structure exhibiting mainly low {Phi}-values (hence they may also be called "localized" transition states). A polarized TSE is observed in SH3 domains, in which fully established contacts are mostly observed in turns that have to be formed to bring the remaining chain together (26Go–30Go). A polarized transition state is not necessarily unstructured: Garcia-Mira et al. recently examined the transition state for folding of CspB using {Phi}-value analysis (25Go). CspB folds via a strongly polarized transition state with a ßT of 0.9, thus most regions of the protein are folded in the TS—but most native interactions are not yet fully established (25Go). The most polarized {Phi}-value distribution, and thus transition state, has been observed in the circular permutant P13-14 of S6 (6Go). This result is particularly interesting because wild-type S6 usually folds via a nucleation-condensation mechanism (31Go,32Go). In the circular permutant, the folding nucleus is polarized toward the artificially linked C- and N-termini.

In this work we investigate whether the different patterns of {Phi}-values observed for TI I27 under different experimental regimes indicate a switch in folding mechanism, brought about by changes in conditions, rather than a change in sequence or chain connectivity. We analyze the two transition states for folding of TI I27 that have been characterized individually by {Phi}-value analysis (7Go). We use the experimental {Phi}-values of either transition state as restraints in molecular dynamics simulations. The simultaneous enforcement of a large number of restraints derived from experimental {Phi}-values in molecular dynamics simulations not only results in an ensemble of conformations that represents the transition state, but also avoids possible problems in the interpretation of individual {Phi}-values (33Go,34Go). The fact that a single polypeptide chain is required to satisfy all the restraints simultaneously considerably reduces the number of alternative explanations that should be considered when one considers individual {Phi}-values on their own. The remarkable success of this type of restrained simulation is a consequence of the nucleation mechanism, which requires the formation of a specific set of interactions in the transition state. Therefore the topology of the transition state is determined when just a small number of {Phi}-values are specified (3Go,35Go–38Go).

The comparison of the two corresponding transition state ensembles of TI I27 indicates that at physiological conditions (denoted as pathway L), this protein folds by a classical nucleation-condensation mechanism, characterized by a tightly packed folding nucleus with the remaining structure in a nativelike topology, whereas at high concentrations of denaturant (denoted as pathway H) it follows a different mechanism, characterized by a polarized transition state.


    METHODS
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS
 DISCUSSION
 CONCLUSIONS
 ACKNOWLEDGEMENTS
 REFERENCES
 
Protein system
The 89-residue protein TI I27, the 27th immunoglobulin domain from human cardiac titin, exhibits a ß-sandwich structure with two ß-sheets. The A-B-E-D sheet is the N-terminal sheet and the C-F-G-A' sheet is the C-terminal sheet. As starting structure for all simulations we used the NMR solution structure of TI I27 (67Go), 1TIT in the Protein Data Bank (see Fig. 1).


Figure 1
View larger version (32K):
[in this window]
[in a new window]
 
FIGURE 1  Structure of TI I27 and experimental {Phi}-values. (A) TI I27 has an Ig-like fold (67Go). The native structure has one ß-sheet containing the A, B, D, and E strands and a second ß-sheet with the C, F, G, and A' strands. This picture was generated using the programs RASMOL (74Go), MOLSCRIPT (75Go), and RENDER (76Go). Experimental {Phi}-values of TSEH (B) (7Go), (C) TSEL (7Go,41Go), and (D) TNfn3 (52Go).

 
The {Phi}-values were measured by Wright et al. using unfolding kinetics and extrapolated to 0 M GdmCl (7Go). For the pathway dominant at moderate conditions, i.e., via TSEL, the same set of 22 {Phi}-values as used in our previous study were taken as input in simulations. The {Phi}-value of I23A was measured to be 1.22(±0.03), but in the simulations this value was set to 1.0 because a {Phi}-value >1.0 cannot be interpreted in our model using fractions of native contacts. Similarly, G32A was neglected because {Phi}-values for Gly are undefined in our simulations (38Go). In the pathway dominant at extremes of denaturant the negative {Phi}-values of V4A, L8A, H56A, and A75 were set to zero because the negative {Phi}-values cannot be interpreted in the model of calculated fractions of native contacts. We expect these residues to be nonnative in the transition state. The {Phi}-values for I23A, G32A, A82G, and V86A were neglected because they are either nonclassical {Phi}-values or associated with large errors and I2A was also set to zero because we expect it to be zero in TSEH. Finally a set of 19 experimental {Phi}-values were used as input in simulations for the pathway dominant at extremes of denaturant, i.e., via TSEH.

For TNfn3, residues 803–891 in the original Protein Data Bank entry 1TEN were renumbered 1–89 with L803 as residue 1. It is the same numbering as used in our previous simulations (35Go,36Go). This numbering differs from that used by Hamill et al. (52Go); they numbered residues 1–90 with the incompletely resolved R802 as residue 1.

Equilibration of the native state
We used the same protocol for the equilibration of the native state described in Paci et al. (36Go). Simulations were performed with the program CHARMM (68Go) and with an all-atom energy function (EEF1) that implicitly includes the effects of the solvent (69Go).

Starting from the experimental structure (1TIT), we first performed a short steepest descent minimization to remove possible steric clashes. We then heated the protein to 300 K and equilibrated it for 0.5 ns. During equilibration the protein remained relatively close to the initial conformation. The resulting structure is the actual starting structure for the simulations to determine the TSE. The integration step was 2 fs for all of the simulations performed. Temperature was kept constant using the Nose-Hoover thermostat.

Molecular dynamics simulations restrained with {Phi}-values
A pseudoenergy term was imposed on the MD simulations to guide the sampling toward transition state structures. In the simulations, the {Phi}-value of residue i in a particular configuration was defined as the fraction of native contacts made by that residue, i.e., Formula. When more than one replica was used, the {Phi}-value was defined to be the average {Phi}-value over all replicas. To drive the simulations toward satisfying the experimental data, the biased molecular dynamics method of Paci and Karplus (70Go) was adapted for the replica method as follows: a reaction coordinate {rho} was defined as the mean-squared difference between the Ne experimental {Phi}-values, Formula, and the simulated {Phi}-values, Formula (Eq. 1):

Formula 1(1)

To avoid forcing the system toward the desired low values of {rho}, an energy was added to the Hamiltonian only when {rho} exceeded the lowest value, {rho}0, attained up to that point in the simulation, as defined by Eq. 2:

Formula 2(2)

In this expression, the constant {alpha} controls the weight of the bias relative to the force field. The factor M, corresponding to the number of replicas, ensures that the restraints will have the same weight relative to the force-field energy, regardless of the number of replicas. Information was shared between the replicas using MPI routines, as described in more detail in Best et al. (71Go). Transition state ensembles consisting of one, two, or four replicas were generated by this method (72Go).

Selection of the sampling temperature
A harmonic potential is applied to keep the reaction coordinate {rho} close to zero, so that the {Phi}exp values are restrained around the {Phi}cal values and the TSE is sampled at different temperatures. The prefactor {alpha} of the harmonic term was chosen so that {rho}(t) is kept at <0.001. Then the temperature is increased to enhance sampling efficiency and to favor the presence of higher energy, nonnative structures.

We have used the ßT-value to estimate SASA of the TSE of both TSL and TSH/TNfn3 (C. D. Geierhaas,. A. A. Nickson, K. Lindorff-Larsen, J. Clarke, and M. Vendruscolo, unpublished data). The results are 7000 ± 800 and 5500 ± 500 Å2 for TSH/TNfn3 and TSL, respectively. In the multireplica simulations for TSL, the structures at 360 K showed the best agreement with the expected value of 5500 ± 500 Å2. For TSH and TNfn3, the structures sampled at 500 K were in the range of the predicted value (7000 ± 800 Å2). In the single replica case structures were sampled at 300, 360, 430, 500, 640, and 780 K and accepted if their solvent-accessible surface area was within the predicted region.

Calculation of the interaction maps
The all-atom energy function (EEF1) that implicitly includes the effects of the solvent (69Go) can be decomposed into the sum of pairwise interactions between residues (69Go,73Go). Thus, the effective energy, which includes the solvent contribution, can be written as a sum over pairs of residues as:

Formula 3(3)

The energy map is a graphical representation of interaction matrices where the element i, j is the EEF1 interaction energy between residues i and j, averaged in the TSE and in the equilibrated native state. Only noncovalent interactions between amino acids that are at least two residues apart are considered. The B-score is calculated as defined by Vendruscolo et al. (39Go).


    RESULTS
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS
 DISCUSSION
 CONCLUSIONS
 ACKNOWLEDGEMENTS
 REFERENCES
 
The transition state ensemble of pathway H (TSEH)
The general properties of the ensembles of structures representing the transition state ensemble for the pathway dominant at high concentration of denaturant, TSEH, are reported in Table 1. The four-replica ensemble is slightly more heterogeneous in terms of RMSD than the one- and two-replica ensembles, but they share other overall features, such as average solvent accessible surface area (SASA) and radius of gyration (Rg). The data that are discussed in the following sections were gathered from four-replica simulations, but similar results are obtained from the one- and two-replica simulations.


View this table:
[in this window]
[in a new window]
 
TABLE 1  General properties of the transition state ensembles

 
A total of 4000 structures were generated using 19 experimental {Phi}-values as restraints (7Go). Simulations were performed at a pseudotemperature of 500 K, because at this temperature the structures sampled have a ß-Tanford value (ßT) close to the experimental value of 0.7 (see Methods). The TSEH is highly heterogenous, with an average C{alpha}-RMSD of 11.3 ± 2.2 Å (see Table 1). The average SASA of the TSEH is 7200 ± 500 Å2, an increase of 41% compared to the native state. In parallel to this increase in SASA, the Rg increases to 14.5 ± 0.8 Å, 12% larger than the native state. The average over all residues (measured experimentally or not) of the calculated {Phi}-values, <{Phi}cal>, is ~0.18, which is close to the experimental average <{Phi}exp> = 0.24. A correlation coefficient of 0.99 between the experimental and the calculated {Phi}-values indicated that the procedure used to determine the transition state ensemble was self-consistent. Fig. 2 A shows the {Phi}-values for individual residues. The red curve is the average calculated value, whereas the blue dashed boundaries represent mean ±1 SD, respectively. The area between the dashed lines in the plot, therefore, characterizes the level of confidence with which {Phi}-values can be determined in the simulations. As expected from the low values of <{Phi}cal> and ßT, the TSEH is rather heterogeneous and contains several nonnative interactions, except in the region of residues L58–V71 (E strand, E-F loop, and F strand) and residue I49 (D strand), which have higher experimental {Phi}-values (>0.46) than the other residues. The residues in the G strand are also predicted to be rather structured, although reliable experimental {Phi}-values in this strand could not be measured. Thus we do not have sufficient confidence in these results for the G strand. (Note that in the absence of {Phi}-values to constrain the system away from the native state the force field may tend to maintain native contacts). The standard deviation of the calculated {Phi}-values is large in most regions of the protein as a consequence of the heterogeneous nature of TSEH.


Figure 2
View larger version (29K):
[in this window]
[in a new window]
 
FIGURE 2  Experimental and calculated {Phi}-values for the TSEH of TI I27. (A) TSEH determined using the initial set of 19 {Phi}exp. The diamonds show the experimental {Phi}-values. The red line is the average calculated {Phi}-value, and the blue dashed lines represent the average ±1 SD within the TSEH. The size of the white area between the dashed lines represents the confidence with which the {Phi}-values are predicted at any position where no experimental {Phi}-values are available. Most of the regions of the protein are highly heterogeneous. (B) Comparison of {Phi}-values calculated using either all 19 experimental {Phi}-values (black curve), or subset L58, L60, C63, and M67 (red curve).

 
The average C{alpha}-RMSD between pairs of structures is 12.5 Å indicating a substantial structural heterogeneity in the transition state ensemble. To analyze the structures in more detail, they were clustered together using a 3 Å cutoff (38Go). We obtained 430 groups, of which only nine have more than 20 members, as expected for a heterogeneous ensemble of structures. The cluster centers have very different properties, in particular {Delta}SASA ranges from +22% to +74% and {Delta}Rg ranges from 0% to +39%. Yet, all these structures fulfill the restraints of the experimental {Phi}-values. Representative cluster centers are shown in Fig. 3.


Figure 3
View larger version (56K):
[in this window]
[in a new window]
 
FIGURE 3  Representative structures of the transition state ensembles of TSEH and TSEL. The structures that make up the TSEH were clustered using a RMSD of 3 Å as a cutoff. (A) Stereographic view of four representative structures (cluster centers) of the TSEH (blue, thin lines) and the native state (red, thick line). The structures are significantly unfolded and less structured than the native state. (B) Stereographic view of the eight highest populated representative structures (cluster centers) of the TSEL (blue, thin lines) and the native state (red, thick line). The structures are very nativelike compared to those of TSEH. The figures were drawn with the program MOLMOL (77Go).

 
Secondary structure and hydrogen bond patterns
To analyze the secondary structure content of the TSEH in detail we characterized the hydrogen bond patterns and the secondary structure content of the nine highest populated cluster centers—each of them with more than 20 members. This analysis (Fig. 4) suggests that TSEH consists of a partially formed B-E-D ß-sheet and a partially formed C-F(-G) ß-sheet. In the B-E-D sheet the contacts between strands E and D are more fully formed with an average of three native H-bonds between them. The B strand is attached more loosely (an average of one native H-bond). A number of nonnative H-bonds between the strands are also observed between stands B and E. The ß-sheets are held together by the intersheet loops, especially the E-F loop (see also "The structure of the loop regions" in the Discussion section).


Figure 4
View larger version (21K):
[in this window]
[in a new window]
 
FIGURE 4  The secondary structure content of the TSEs. Probability of ß-strand formation (in %) for TSEH (dashed line) and TSEL (solid line). The secondary structure was calculated using the program DSSPcont (78Go). The F and G strands appear to have the highest probability in TSEH but the results concerning the G strand may depend on the force field used because reliable experimental {Phi}-values in this strand could not be measured. Thus it is not possible to benchmark calculated {Phi}-values in this strand. The E and D strands are predicted to have a significant probability of formation, whereas the A, A', and B strands are relatively unlikely to be formed. In comparison to TSEH, in TSEL all strands are almost fully formed.

 
Network of interactions in the transition state
To analyze the network of interactions in detail we calculated the pairwise interaction energies between all possible residue pairs for both transition states and for the native state. These energy maps are shown in Fig. 5, where the native states are represented above the main diagonal and the transition states below. Although the TSEH is rather unstructured, the overall topology of the network of interactions is still nativelike. Most of the nonnative interactions are weak compared to the interactions that establish the native topology of the Ig-like fold. From an analysis of hydrogen bonding patterns we proposed that in the B-E-D sheet contacts are more fully formed between strands E and D than between strands B and E. This result is also illustrated by the interaction map. Interactions between the E and D strands are strong, whereas interactions between the B and E strands are formed but significantly weaker. Weak C-F interactions are evident from the interaction maps. Again we ignore the G-strand contacts because we have no confidence that the simulations are correct. It is interesting to note that in the TSEH most of the short-range interactions are still strong, but the long-range interactions tend to be considerably weakened. Most of the A-B and A'-B interactions are lost, confirming the unstructured nature of A and A'.


Figure 5
View larger version (59K):
[in this window]
[in a new window]
 
FIGURE 5  Interactions in the TSEs and equilibrated native states. Energy maps in the equilibrated native state (upper part of the matrix) and the transition state ensemble (lower part of the matrix) for (A) TSEH and (B) TSEL. The energy map is a graphical representation of interaction matrices where the element i, j is the EEF1 (69Go) interaction energy between residues i and j, averaged in the TSE and in the equilibrated native state. Energies are in kcal/mol.

 
Key residue simulations
To determine if at extreme concentrations of denaturant the protein folds via a mechanism that involves key residues, we investigated whether a set of key residues according to Vendruscolo et al. (3Go) could be defined. Predictions made by using the network betweenness, the so-called B-score (39Go), indicated that residues F21, W34, L58, L60, M67, V71, F73, and L84 are the most central residues in the network of interactions in the TSEH. But in key residue simulations (i.e., using only these residues as restraints) the {Phi}-value profile (the whole set of 19 experimental {Phi}-values) could not be reproduced accurately (data not shown). In contrast the set consisting of residues L58, L60, C63, and M67 can reproduce the {Phi}-value profile with high confidence. The coefficient of correlation to the full set of experimental {Phi}-values (excluding the key residues used as restraints) is 0.7 (and the coefficient of correlation including the key residues is 0.9). The agreement with the full set of simulated {Phi}-values is also good (correlation coefficients of 0.8 and 0.9 without and with the key residues, respectively). Major differences between {Phi}cal from the key residue simulation and {Phi}cal calculated from a simulation using the full set of 19 experimental {Phi}-values appear in the B strand (see Fig. 2 B). Additional properties such as SASA, Rg, and RMSD can be determined with confidence (see Table 1). Two other simulations were performed using different sets of four residues each excluding one of the key residues identified above, but including another residue with a high {Phi}-value (58,60, 63 plus 49, and 60,63,67 plus 71). In these cases the correlation with the experimental {Phi}-values was low (0.3 and 0.5, respectively).

Ensemble averaged molecular dynamics simulations
The use of ensemble averaged molecular dynamics simulations allows comparison of the distribution of probabilities of {Phi}-values for each residue. The results are shown in Fig. 6 A for both TSEL and TSEH.


Figure 6
View larger version (39K):
[in this window]
[in a new window]
 
FIGURE 6  Distribution of {Phi}-value probabilities in the ensembles. (A) Probability distributions for {Phi}-values were calculated for single (sr) and multi- (mr) replica systems and for both TSE. The position of the strands is indicated in the first row. The second and third rows represent single and multireplica systems for TSL, respectively. The fourth and fifth rows represent single and multireplica systems for TSH, respectively. Residues with a nonunimodal distribution of probabilities for {Phi}-values are highlighted. All other residues have a unimodal distribution. Interestingly there are more residues in TSL with a nonunimodal distribution of probabilities for {Phi}-values than in TSH. (B) {Phi}I49 and {Phi}V71 were calculated for each structure of the ensemble (two-replica system). They are anticorrelated with a correlation coefficient of –0.75; 80% of the structures are in one of the ellipsoids. This clearly shows that in most cases only I49 or V71 play an important role in folding.

 
Several residues have a bimodal distribution of {Phi}-values in TSEH. Of particular interest are the cases of residues I49 and V71, both of which could be considered key according to their B-score (39Go). A simulation with the key residues L58, L60, C63, and M67 plus I49 and V71 produced similar results to the simulation that included L58, L60, C63, and M67 alone, suggesting that I49 and V71 do not belong to the key residue set. Both I49 and V71 have a unimodal distribution of {Phi}-values in the single-replica system, with a maximum at ~0.4. In the multireplica simulations, however, they exhibit a bimodal distribution of {Phi}-values, with two maxima at 0.3 and 0.7. This finding supports the idea that I49 and V71 are not key residues, and they can either participate in the folding process or not; hence they are not crucial for the successful establishment of the transition state (32Go,33Go,40Go). To demonstrate this behavior of I49 and V71 in more detail we computed the {Phi}-values of I49 and V71 for each structure in the TSE (2 replica system, Fig. 6 B). Most of the {Phi}I49/{Phi}V71 pairs either have low {Phi}I49 and high {Phi}V71 or vice versa. The structures that correspond to the data in the ellipsoids represent 75% of the whole ensemble. As expected, {Phi}I49 and {Phi}V71 are anticorrelated with a correlation coefficient of –0.75. This result was not observed using single replica simulations and demonstrates the advantages of ensemble averaged molecular dynamics simulations. Importantly, the key residues that are important in the formation of TSEH show a unimodal distribution of {Phi}-values even in the four-replica simulations.

In TSEL several of the {Phi}-values of the residues in the region of strands A and A' have bimodal distributions, at least in the multireplica simulations. This result is in agreement with the finding that the A and A' strands are much less structured than the B, C, D, E, and F strands in TSEL (35Go,41Go). They were found to be in a variety of conformations in TSEL and participate in several nonnative interactions. Most of the bimodal distributions of {Phi}-values for these residues have maxima at low values ({Phi} close to 0) and at high values ({Phi} ≥ 0.6). Thus they are either unstructured in TSEL or are already in their nativelike conformation. This result was observed for nine of the 15 residues in strands A and A' in the multireplica case but only for three in the single-replica simulations. Another region with many bimodal distributions of {Phi}-values is the C-D loop (residues 37–46) with five bimodal distributions. Also in these cases these residues are either unstructured with a maximum close to {Phi} = 0 or they are already significantly structured with {Phi} ≥ 0.6. Most of these residues exhibit native contacts with the large W34 in the hydrophobic nucleus of TSEL. Thus, most residues in the C-D loop are only marginally involved in the folding nucleus and participate to a different extent in each folding event (32Go,33Go,40Go). This situation is comparable to the one observed in the similar simulations of the first transition state for folding of barnase in which an {alpha}-helix is formed or not (42Go). Importantly, residues 41 (C-D loop) and 76 (F-G loop) are very interesting, because their {Phi}-values are highly anticorrelated, with a correlation coefficient of –0.80. Thus, either 41 exhibits a high and 76 a low {Phi}-value or vice versa. Both residues pack into the core and are on opposite sites of it, hence either the C-D loop or the F-G loop pack onto the core to facilitate folding. Interestingly, most of the bimodal distributions of {Phi}-value probabilities in the C-D loop were also sampled by the single-replica system.


    DISCUSSION
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS
 DISCUSSION
 CONCLUSIONS
 ACKNOWLEDGEMENTS
 REFERENCES
 
Comparison of the two TSEs of the parallel pathways
The behavior of TI I27 is very unusual, as it has been shown experimentally to fold via two different pathways and they have both been characterized by {Phi}-value analysis (7Go). In the following section we will investigate these remarkable differences in detail and will show that TI I27 can fold both via a polarized transition state and via a more common nucleation-condensation mechanism.

Overall properties
The folding transition state of pathway L is very nativelike compared to the transition state for pathway H, as indicated by the comparison of the average experimental {Phi}-values (0.24 and 0.57 for TSH and TSL, respectively) and the ßT (0.74 and 0.95 for TSH and TSL, respectively) (7Go,41Go). Crucially, the TSH is not just an expanded version of TSL, as the patterns of {Phi}-values are very different.

The general properties of the TSEH and TSEL are reported in Table 1. To simplify the comparison between the ensembles we also determined TSEL using ensemble-averaged simulations. Although in TSEL the protein has a C{alpha}-RMSD of 4.2 Å from the native state and the SASA increases by 8%, other changes relative to the native state are only marginal. For example, the radius of gyration decreases slightly, a result that may indicate that the TSEL has a more spherical structure than the native state (35Go). In comparison TSEH is much more unstructured, with an increase in SASA of 41% and an increase in Rg of 12%. In addition, clustering with a 3 Å threshold yielded 430 cluster centers whereas TSEL yielded only seven. The cluster centers of TSEL have very nativelike properties, whereas the TSEH cluster centers are very different in terms of C{alpha}-RMSD, Rg, and SASA and are more heterogeneous, as judged by the significantly higher number of cluster centers (see Fig. 3).

Interactions and secondary structure
The secondary structure content of the TSEL has already been analyzed in detail (35Go). It is very nativelike and almost all strands are fully formed, except for the A, A', and G strands. As well as the B, C, E, and F strands, the D strand, which is not part of the nucleus, is also fully structured. In the TSEH only the D, E, and F strands are significantly structured (see also Fig. 4).

In comparison to TSEH, the interactions of TSEL are very nativelike (see Fig. 5). In TSEL, most of the long-range interactions are defined as clearly as the short-range interactions in the energy map, as expected in a nucleation-condensation mechanism (35Go,43Go).

Compatibility between loosely structured individual conformers and the native topology indicated by the energy maps
According to the analysis of the energy maps (Fig. 5), in the TSEH, native interactions are still dominant over nonnative ones and the overall topology of the Ig-like fold is present. This result is in apparent contrast with a direct examination of individual structures. Although these structures show partial formation of secondary structure, they are relatively heterogeneous and contain many nonnative contacts. Our calculations show that all these observations are indeed consistent. First, individual structures are high-resolution descriptions of the protein whereas energy maps emphasize the general topology of the molecule. Second, whereas individual structures are snapshots of the TSE, energy maps are obtained from an averaging procedure over the whole ensemble and therefore they show the interactions that are most likely to be formed. Lindorff-Larsen et al. (30Go) showed that TS structures could be compared to structures in the SCOP database (44Go) to determine whether nativelike topology is already established in the TSE. Following this procedure, 770 protein domains of a length ranging from 71 to 110 residues were extracted from the SCOP database (44Go) to represent protein domains from a variety of folds. Using the DALI method (45Go) we classified the structures of the TSEH. To determine if individual members of the TSEH can be classified as having Ig-like topology we aligned the 430 cluster centers that represent the TSEH to each of the 770 protein domains from the SCOP database (44Go). Analyzing the best hits from the alignments revealed that 61% of the cluster centers (which represent 55% of the total ensemble) can be classified as having the Ig-like fold (SCOP domain b.1). The quality of a DALI alignment (45Go) can be described using a Z-score; the higher the Z-score, the better the alignment and thus the more significant is the result. It has been suggested that a Z-score between 2 and 6 is the borderline for structural similarity between native-state structures (46Go). For the 61% of the structures for which the Ig-like fold was the best hit a Z-score between 1.0 and 5.0 was obtained, thus the similarity is universally low. In contrast all seven cluster centers and thus the whole TSEL ensemble match the Ig-like fold best, with Z-scores between 5.3 and 8.9. Hence they are also more similar to the Ig-fold than the 55% of TSEH that match best the Ig-like fold. Where a member of the TSEH could not be aligned to the topology of the Ig-like fold the structural similarity to any other "best hit" was very low (Z-score < 1.0); i.e., there was no significant match to any other fold. This means that, as we have seen in the energy maps, the overall topology of the Ig-fold is established to some extent in TSEH. This is consistent with the current view that the native topology is already established in the transition state, e.g., (22Go,23Go,30Go,37Go,47Go–51Go).

Key residue simulations
The folding nucleus of TSEL has recently been analyzed in detail (35Go). F21, I23, W34, H56, L58, V71, and F73 in the B, C, E, and F strands are key residues for folding along pathway L. The large hydrophobic W34 forms the center of the folding nucleus and the remaining six residues pack onto it. Consistent with these results, the topology of the native state can be established by using the {Phi}-values of every subset of four residues that included one residue from each strand as restraints (35Go). W34 is the most important residue in the nucleus of TSEL; key residue simulations with all key residues except W34 are unable to reproduce TSEL accurately. Thus in TSL there is a well-defined folding nucleus.

In this work, four residues, L58, L60, C63, and M67, were shown to play a key role in folding via pathway H. It is important to note that the existence of key residues does not necessarily imply the existence of a folding nucleus. According to Vendruscolo et al. key residues are defined as residues whose interactions alone are sufficient to form the topology of the transition state and to predict all other {Phi}-values (3Go).

The {Phi}-value pattern (Fig. 1 B) is very different from the pattern expected for a classical nucleation-condensation mechanism, as found for example in TSL of TI I27 and another Ig-like protein TNfn3 (Fig. 1, C and D). TSEH is not dominated by long-range interactions but by local interactions, and most of the high {Phi}-values are colocalized. We determined the interactions and the distances between the key residues in the TSE. The average distance between key residue Cß-atoms is 9 Å, and there are only four intrakey residue contacts. Thus we suggest that the protein does not fold via a conventional nucleation-condensation mechanism in pathway H.

We determined the packing density of the system of key residues for both TSEs by calculating the solvent-accessible surface area for each key residue (see Table 2). The solvent-accessible surface area is much lower for all the key residues in the TSEL (the packing density is much larger). Representative structures of both sets of key residues are shown in Fig. 7.


View this table:
[in this window]
[in a new window]
 
TABLE 2  Solvent-accessible surface area of key residues in the transition state

 

Figure 7
View larger version (70K):
[in this window]
[in a new window]
 
FIGURE 7  Comparison of the packing density in the folding cores of TI I27 for both pathways. The packing density is much higher in TSEL than in TSEH, as illustrated by the CPK representations of the folding core of both transition states. Only the key residues are shown and they are colored according to the ß-strand in which they are located. (A) Native state and (B) highest populated cluster center of the TSE of TSL: I23 (yellow), W34 (red), L58 (green), and F73 (blue). For comparison only, one key residue from each strand is shown. (C) Native state and (D) highest populated cluster center of the TSE of TSH: L58, (green), L60 (green), C63 (green), and M67 (brown). The pictures were generated using the program INSIGHTII.

 
Interestingly, mutations that destabilize residues in the E-F loop, which is critical in the formation of structure in pathway H (C62A and M67A) fold only by pathway L under all experimental conditions whereas mutations that significantly destabilize the folding nucleus of pathway L (F21A, I23A, H56A, and F73L) reach pathway H at significantly lower concentrations of denaturant than wild-type. This supports the observations that are made here of the relative importance of these key residues in the different folding pathways.

The structure of the loop regions
We have previously analyzed the behavior of the two loops crossing from one ß-sheet to the other for TSEL in detail (35Go). The E-F loop connects the E strand with the F strand and the B-C loop connects the B strand with the C strand. In TSEL of TI I27 the nucleus lies closer to the B-C loop, therefore this loop is more structured. Thus, the relative position of the nucleus constrains the intersheet loops to different extents (52Go).

We studied the structural variability of the B-C and the E-F loops by isolating them from the rest of the protein and by clustering similar structures together using a 1-Å cutoff. In TSEL the average intraloop C{alpha}-RMSD between two structures is 0.6 and 1.0 Å for the B-C and E-F loop, respectively, thus the B-C loop is more structured (35Go). In TSEH both loops are more unstructured but interestingly the E-F loop is now the more structured of the two. The average intraloop C{alpha}-RMSD between two structures is 1.7 and 1.2 Å for the B-C and E-F loop, respectively. The structures of the loops are also more heterogeneous in TSEH; the distributions are much broader for this unstructured transition state (data not shown).

Comparison of TSEH with the transition state of TNfn3
In the "fold approach" proteins with similar structures that are unrelated in sequence or function are analyzed. We have investigated the folding of a number of proteins with Ig-like topology extensively using {Phi}-value analysis (7Go,41Go,52Go,53Go). The TSE of TNfn3 and TSEL were determined using experimental {Phi}-values as restraints in molecular dynamics simulations and compared in detail (35Go,36Go). These studies revealed that both TNfn3 and TI I27 (TSL) fold by a nucleation-condensation mechanism with structurally equivalent residues forming the folding nucleus. The determination of the second transition state of TI I27 enables us to compare TSH to that of TNfn3.

At a first glance, one might expect TSEH to be very similar to the TSE of TNfn3 in terms of overall structural features. TI I27 and TNfn3 have the same topology, i.e., the Ig-like fold, and about the same number of residues. The average experimental {Phi}-values are very similar (0.24 and 0.28 for TSH and TNfn3, respectively) so are the ßT values (0.74 and 0.7 for TSH and TNfn3, respectively) (7Go,41Go,52Go). The patterns of {Phi}-values are, however, very different, because the {Phi}-value pattern of TNfn3 appears simply to be a weaker version of the TSEL {Phi}-value pattern, indicating a nucleation-condensation mechanism. In contrast TSEH is a polarized transition state.

The general properties of the TSE of TNfn3 as presented in Table 1 are very similar to those of TSEH, consistent with the similar values of <{Phi}> and ßT. The change in SASA and Rg relative to the native state and the C{alpha}-RMSD to the native state are remarkably similar. We also clustered the 4000 structures that we determined to represent the TSE of TNfn3 together using a 3-Å cutoff, obtaining 430 cluster centers. This result is identical to that obtained from clustering in the TSE of TSEH, which also gave 430 clusters. A closer look, however, at the properties of the TSE of TNfn3 and TSEH confirms the suggestion that they are very different. Molecular dynamics simulations of TNfn3 restrained using the {Phi}-values of just the four residues I19, Y35, I58, and V69 suggested that these are the residues that interact in the core to form the folding nucleus (36Go). This nucleation-condensation transition state is stabilized mainly by long-range tertiary interactions. In TSEH the key residues L58, L60, C63, and M67 are localized, as expected in a polarized transition state, and do not interact to form a nucleus. Hence the distribution of structure is very different. Paci et al. showed that the C'-C-F-G sheet is largely ordered in the TSE of TNfn3, but the A-B-E sheet is disordered (36Go). Although the relative position of the B and E strands is established by interactions of residues I49 and I58 with the other nucleus residues, the hydrogen bonding between the two strands is minimal. This situation is very different for the secondary structure in TSEH where only the E-D strands are significantly ordered.

Thus although many of the characteristics of the TSE of TNfn3 and TSEH are similar (for example mean {Phi}-value, ßT, {Delta}SASA, RMSD), the structures in the ensembles are actually remarkably different. These results reflect the different nature of the transition states—a polarized TS in TI I27 and a diffuse TS in TNfn3. The TS of TNfn3 is mainly stabilized by long-range tertiary interactions whereas in the TS of pathway H of TI I27 long-range contacts are significantly weakened. This in turn suggests a different folding mechanism.

Obligate versus critical residues in the folding nucleus
It has been suggested that the folding nucleus can be decomposed into obligate and critical residues (32Go,33Go,40Go). The former are obliged to participate with their interactions so that the folding nucleus can be formed and the topology can be established. These interactions are supported by the critical residues that stabilize the obligate nucleus by packing onto it. In both TSL and TSH we found several residues that participate in folding in only a subset of the TSE. Interestingly, the {Phi}-values of some of these residues are highly anticorrelated, meaning that only one of them supports folding and the other not. In TSH either I49 or V71 pack onto the nucleus to stabilize it and in TSL either 41 or 76. Thus for these residues there are two possible conformations for each transition state; suggesting the existence of "parallel pathways in the parallel pathways". In one of the earliest works describing {Phi}-value analysis Fersht and co-workers suggested that "It is ... possible that a mixture of states, some with structure fully formed and others with the elements of structure completely unfolded is responsible for a fractional value of {Phi}." (54Go). This is what we observe in our simulations.

Diffuse versus polarized transition states
It has been suggested that if there is a high propensity for secondary structure formation, the folding mechanism tends to follow a hierarchical mechanism, such as the diffusion-collision model, as has been shown for En-HD (20Go) and protein G (21Go,55Go). In contrast, if the propensity for formation of secondary structure is low, secondary structure elements will only form when supported by tertiary interactions. The resulting diffuse transition state is characteristic of the nucleation-condensation mechanism, where the formation of a well-defined nucleus is achieved simultaneously with the condensation of the rest of the structure resulting in a nativelike topology (22Go). This type of transition state shows a characteristic pattern of high and low fractional {Phi}-values distributed throughout the molecule (see, for example, Fig. 1, C and D); hence the term "diffuse" transition state. Such a pattern often arises from the presence of a folding nucleus stabilized by long-range tertiary contacts (22Go,23Go,56Go). The majority of proteins that have been studied by {Phi}-value analysis fold via the nucleation-condensation mechanism, e.g., CI2 (23Go,57Go,58Go), the immunity proteins (59Go,60Go), and proteins with the Ig-like fold such as TNfn3 or TI I27 (35Go,36Go,41Go,52Go). In a classical nucleation-condensation mechanism, secondary and tertiary structure are formed concomitantly in the TS in the absence of intermediates, as found for example for hTRF (20Go). Thus, the transition state for folding in a nucleation-condensation pathway is often an expanded version of the native state (6Go,22Go). It is important to note that the term "diffuse" transition state does not necessarily imply a low density packed folding nucleus, but it rather describes a particular type of distribution of contacts. In this work we have shown that the folding nucleus of TSEL in TI I27 is indeed very compact and that it contains predominantly long-range contacts, and thus "high" {Phi}-values are distributed widely.

In contrast the polarized transition state seems to represent a hybrid between nucleation-condensation and diffusion-collision folding mechanisms (20Go,24Go,25Go). High {Phi}-values are not evenly distributed over the molecule but usually clustered at one position, with the rest of the structure exhibiting mainly low {Phi}-values; hence they are also called "localized transition states" (see Fig. 1 B). A polarized TSE is observed in SH3 domains, in which fully established contacts are mostly observed in turns that have to be formed to bring the remaining chain together (26Go–30Go). Several WW domains also seem to fold via a polarized transition state, whereby folding is initiated by loop or hairpin nucleation events (61Go–63Go). A polarized transition state is not necessarily unstructured: Garcia-Mira et al. recently examined the transition state for folding of CspB using {Phi}-value analysis (25Go). CspB folds via a strongly polarized transition state with a ßT of 0.9, thus most regions of the protein are folded in the TS, but the energetic interactions are not yet properly established (25Go). The most polarized {Phi}-value distribution, and thus transition state, has been observed in the circular permutant P13-14 of S6 (6Go). This result is particularly interesting because wild-type S6 usually folds via a nucleation-condensation mechanism (31Go,32Go). In the circular permutant, the folding nucleus is polarized toward the artificially linked C- and N-termini.

Weikl et al. have used a theoretical approach based on the "effective contact order" to compute the rates and routes of folding of two-state proteins (64Go). In CI2, a protein with a typical nucleation-condensation mechanism (diffuse transition state) all major regions are involved in the formation of the rate-limiting cluster. In contrast the src-SH3 domain, which exhibits a polarized transition state, uses only regions with high {Phi}-values to form the rate-limiting cluster. Their results support the discrimination between diffuse and polarized transition states.

We propose that TSEH is a polarized transition state. Polarized transition states usually feature only local formation of secondary structure. The {Phi}-value pattern of TSEH has the typical profile of a polarized transition state; except for I49, all residues outside the region from 58 to 71 are very low. In this transition state, short-range local contacts are dominant. We found secondary structure mainly in the region of high {Phi}-values; most of the remaining regions in the protein are significantly unstructured. The structure distribution in TSEH can be categorized as being anisotropic compared to the isotropic distribution of structure in TSEL.


    CONCLUSIONS
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS
 DISCUSSION
 CONCLUSIONS
 ACKNOWLEDGEMENTS
 REFERENCES
 
TI I27 can fold via two different pathways. The first is dominant in moderate conditions, such as those found in vivo, and exhibits a nucleation-condensation mechanism as found for most two-state proteins (diffuse TS). In contrast, TI I27 folds via a more polarized transition state at extreme values of denaturant or temperature. Highly destabilizing mutations in the nucleus of pathway L cause the folding mechanism to switch to pathway H. Conversely mutations of the key residues in the E-F loop prevents the switch to pathway H. The ensemble of structures that represent the two transition states for folding of TI I27 enable us to suggest an explanation for the change in mechanism. At near-physiological conditions, nucleation condensation seems to be preferred for proteins with Ig-like fold, despite the higher entropic cost for the protein chain to fold into the highly compact structure of TSEL (41Go,65Go). This is analogous to the observation from simulation that the folded state of a structured peptide is preferentially stabilized over low energy, high entropy alternative, nonnative conformations (66Go). Almost the entire protein chain is involved in the folding process, thus exhibiting a network of long-range tertiary interactions. This highly cooperative folding process ensures that that partly folded species with exposed hydrophobic residues do not form before crossing the top of the free-energy barrier. It has been suggested that this kind of folding mechanism, which involves burial of most hydrophobic groups, reflects the presence of a negative design feature against aggregation (4Go–8Go). The entropic cost of structuring the polypeptide chain is compensated for by burial of the hydrophobic residues. In contrast, at high concentration of denaturant, when the hydrophobic interactions are weakened, to fold efficiently TI I27 can revert to a mechanism with a lower entropic cost and a more heterogeneous transition state, because the propensity to aggregate is lower under these conditions.


    ACKNOWLEDGEMENTS
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS
 DISCUSSION
 CONCLUSIONS
 ACKNOWLEDGEMENTS
 REFERENCES
 
C.D.G. holds a Wellcome Trust Prize Studentship. E.P. acknowledges financial support from Forschungskredit der Universität Zürich. M.V. is supported by the Royal Society and both M.V. and R.B. by the Leverhulme Trust. J.C. is a Wellcome Trust Senior Research Fellow.


    FOOTNOTES
 
Robert B. Best's current address is Laboratory of Chemical Physics, NIDDK, National Institutes of Health, Bethesda, MD 20892-0520.

Abbreviations used: TS, transition state; TSE, transition state ensemble; TSL, transition state of TI I27 dominant at low concentration of denaturant; TSH, transition state of TI I27 dominant at high concentration of denaturant; TSEL, transition state ensemble of TI I27 dominant at low concentration of denaturant; TSEH, transition state ensemble of TI I27 dominant at high concentration of denaturant; pathway L, folding pathway of TI I27 dominant at low concentration of denaturant; pathway H, folding pathway of TI I27 dominant at high concentration of denaturant; SASA, solvent accessible surface area; Rg, radius of gyration; RMSD, root mean-square deviation; dRMS, distance-based root-mean-square deviation; EEF1, effective energy function; and Ig, immunoglobulin.

Submitted on November 3, 2005; accepted for publication February 14, 2006.


    REFERENCES
 TOP
 ABSTRACT
 INTRODUCTION
 METHODS
 RESULTS
 DISCUSSION
 CONCLUSIONS
 ACKNOWLEDGEMENTS
 REFERENCES
 
1. Fersht, A. R., L. S. Itzhaki, N. Elmasry, J. M. Matthews, and D. E. Otzen. 1994. Single versus parallel pathways of protein-folding and fractional formation of structure in the transition-state. Proc. Natl. Acad. Sci. USA. 91:10426–10429.[Abstract/Free Full Text]

2. Lazaridis, T., and M. Karplus. 1997. "New view" of protein folding reconciled with the old through multiple unfolding simulations. Science. 278:1928–1931.[Abstract/Free Full Text]

3. Vendruscolo, M., E. Paci, C. M. Dobson, and M. Karplus. 2001. Three key residues form a critical contact network in a protein folding transition state. Nature. 409:641–645.[CrossRef][Medline]

4. Davis, R., C. M. Dobson, and M. Vendruscolo. 2002. Determination of the structures of distinct transition state ensembles for a beta-sheet peptide with parallel folding pathways. J. Chem. Phys. 117:9510–9517.[CrossRef]

5. Dobson, C. M. 1999. Protein misfolding, evolution and disease. Trends Biochem. Sci. 24:329–332.[CrossRef][Medline]

6. Lindberg, M., J. Tangrot, and M. Oliveberg. 2002. Complete change of the protein folding transition state upon circular permutation. Nat. Struct. Biol. 9:818–822.[Medline]

7. Wright, C. F., K. Lindorff-Larsen, L. G. Randles, and J. Clarke. 2003. Parallel protein-unfolding pathways revealed and mapped. Nat. Struct. Biol. 10:658–662.[CrossRef][Medline]

8. Wright, C. F., A. Steward, and J. Clarke. 2004. Thermodynamic characterisation of two transition states along parallel protein folding pathways. J. Mol. Biol. 338:445–451.[CrossRef][Medline]

9. Dinner, A. R., A. Sali, L. J. Smith, C. M. Dobson, and M. Karplus. 2000. Understanding protein folding via free-energy surfaces from theory and experiment. Trends Biochem. Sci. 25:331–339.[CrossRef][Medline]

10. Wu, Y., and C. R. Matthews. 2003. Proline replacements and the simplification of the complex, parallel channel folding mechanism for the alpha subunit of Trp synthase, a TIM barrel protein. J. Mol. Biol. 330:1131–1144.[CrossRef][Medline]

11. Wedemeyer, W. J., E. Welker, and H. A. Scheraga. 2002. Proline cis-trans isomerization and protein folding. Biochemistry. 41:14637–14644.[CrossRef][Medline]

12. Wildegger, G., and T. Kiefhaber. 1997. Three-state model for lysozyme folding: triangular folding mechanism with an energetically trapped intermediate. J. Mol. Biol. 270:294–304.[CrossRef][Medline]

13. Zaidi, F. N., U. Nath, and J. B. Udgaonkar. 1997. Multiple intermediates and transition states during protein unfolding. Nat. Struct. Biol. 4:1016–1024.[CrossRef][Medline]

14. Shastry, M. C., and J. B. Udgaonkar. 1995. The folding mechanism of barstar: evidence for multiple pathways and multiple intermediates. J. Mol. Biol. 247:1013–1027.[CrossRef][Medline]

15. Baldwin, R. L. 1997. Com