| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Department of Physics and Astronomy, University of British Columbia, Vancouver, British Columbia, Canada
Correspondence: Address reprint requests to S. S. Plotkin, E-mail: steve{at}physics.ubc.ca.
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
Native stability may be modified by adjusting temperature T or denaturant concentration c. Many proteins show linearity over the majority of the branches of their Chevron plot, implying a linear dependence of folding and unfolding barriers on denaturant concentration c (Jackson and Fersht, 1991
),
![]() | (1a) |
![]() | (1b) |
> 0 and mU
> 0.
Subtracting Eq. 1b from Eq. 1a, and defining
G
GFU = GU GF and m = mF
+ mU
, we have that
![]() | (2) |
For two-state folders, the kinetically determined m above equals, to good approximation, the thermodynamically determined m-value from relative stabilities.
Applying Kramers rate theory, the log forward folding rate is given by
![]() | (3) |
Eliminating c from Eqs. 2 and 3 gives
![]() | (4) |
ln ko(T). Denaturant concentration does not have a significant effect on the rate at which the system escapes from local traps (at least for those proteins studied). We make this assumption here as well.
Because the prefactor is independent of c, the change in log folding rate with denaturant is directly proportional to the change in barrier with denaturant,
![]() | (5) |
![]() | (6) |
![]() | (7) |
This quantifies the assertion above that log folding rates depend linearly on the relative stability of the products. If we let mU
/m
Q
, Eq. 6 can be rewritten as
![]() | (8) |
Inspection of rate-stability isotherms for several different proteinscytochrome C (cyt-C; Mines et al., 1996
), protein L (Scalley and Baker, 1997
), cspB (Schindler and Schmid, 1996
), N-terminal protein L9 (Kuhlman et al., 1998
), and S6 (Otzen and Oliveberg, 2004
)shows linearity over ranges up to
25 kJ x mol1
10 kBT, indicating large and robust folding barriers, which are substantially larger than the folding barriers seen in many simulations (for example, see Fig. 1).
|
| METHODOLOGY |
|---|
|
|
|---|
The temperature-dependence of the stability is given by the Gibbs-Helmholtz expression (Fersht, 1999
; Jackson and Fersht, 1991
) as
![]() | (9) |
Then at equal stabilities
G(To, co) =
G(T, c),
![]() | (10) |
For two-state folders, the heat-capacity ratio
CPU
/
CP is approximately equal to m-value ratio mU
/m, giving the fractional solvent accessibility of the transition state. We assume this equality here as well, which gives for Eq. 10,
![]() | (11) |
Inserting Eq. 11 into Gibbs-Helmholtz expressions for the barrier heights
GU
at (T, c) and (To, co) gives the change in barrier height at fixed stability,
![]() | (12) |
For changes in temperature of a few degrees, the change in barrier height
'
GU
is only a few percent of the total barrier height, when the rates versus temperature and denaturant are fit to a model to extract thermodynamic parameters, as in Kuhlman et al. (1998)
, Otzen and Oliveberg (2004)
, Scalley and Baker (1997)
, and Schindler and Schmid (1996)
. We used Eq. 12 for the change in barrier height when thermodynamic data were available. For the case of cyt-C we set
'
G = 0.
The rates for pairs of states at the same stability
G are given from Eq. 3 as
![]() | (13a) |
![]() | (13b) |
Random energy model for the temperature-dependent prefactor
At the mean field level for a landscape of uncorrelated states (random energy model or REM), the temperature-dependence of the prefactor in Eq. 3 is super-Arrhenius (Bryngelson and Wolynes, 1989
; Onuchic et al., 1997
). Moreover, the prefactor goes as the reciprocal of the viscous friction coefficient (Hanggi et al., 1990
; Klimov and Thirumalai, 1997
; Socci et al., 1996
), so the log prefactors at (To, co) and (T, c) may be written as
![]() | (14a) |
![]() | (14b) |
To compare rate theories with experimental data we must introduce a fundamental timescale or rate constant koo, which is then modified by barriers representing the ruggedness of the energy landscape. Rates for short loop closure are
2 x 107 s1 (Lapidus et al., 2000
), comparable to helix formation rates of
107 s1, and somewhat faster than rates of hairpin formation
106 s1 (Eaton et al., 2000
). Prefactors obtained from plots of experimental rate versus powers of chain length are of order µs (Li et al., 2004
); however, these implicitly include any effects due to ruggedness. We take 107 s1 as an estimate of the fastest local rate. Since
10100 loops and/or secondary structural elements exist in a protein, we then take koo = 109 s1. This estimate for koo may appear somewhat large; we will see later that smaller estimates for koo give smaller estimates for inferred folding barriers. We use the known temperature dependence of the viscosity in water (CRC, 2003
). The quantity
2 measures the ruggedness of the energy landscape. It may be eliminated from Eqs. 14a and 14b to give an equation relating the prefactors as
![]() | (15) |
Equations 13a, 13b, and 15 constitute a system of three linear equations for three unknowns:
GU
(To, co), ln ko(To), and ln ko(T), which can be solved analytically at any given stability, from linear fits to the log rate-stability data.
| RESULTS |
|---|
|
|
|---|
|
|
30 kJ/mol plus whatever the barrier was at conditions of zero denaturant.
The slope 
GU
/
G
0.8 is also larger than its empirical value of mU
/m
0.4 (Mines et al., 1996
), thus the barriers vanish at weaker stabilities than the wild-type protein. This indicates a breakdown in the validity of the theory at higher stabilities (larger
G).
There are two parameters in the theory for which we have put in approximate values: the value of the attempt frequency koo = 109 s1, and the value of
'
GU
, which we have set to zero for cytochrome C in the absence of an empirically determined value. Increasing koo or decreasing
'
GU
raises barriers, but does not change the slope 
GU
/
G. The value of
G where the barrier vanishes linearly decreases as
'
GU
is decreased below zero, with the barrier vanishing at the stability of the wild-type when
'
GU
is
1.6 kJ/mol. This is not an unreasonable number compared to experimental numbers for other proteins (see below); however, it is somewhat disconcerting that barrier heights are such a strong function of the barrier change
'
GU
. We will see later that this sensitivity is not present when a correlated landscape model is used for the prefactor.
Fig. 2 also shows that at least for the REM approximation it is important to account for changes in the viscosity of the solution with temperature, as the barrier substantially decreases when the viscosity is held constant versus temperature.
Equations 14a or 14b may now be solved for
, giving a number
15 kJ/mol, that only weakly depends on stability
G or barrier change
'
GU
. Estimating the chain conformational entropy as
100 kB (D'Aquino et al., 1996
; Leach et al., 1966
), we can give an estimate for the glass temperature TG for this system,
![]() | (16) |
150 K, giving T/TG
2.0 at 296 K.
|
![]() | (17a) |
![]() | (17b) |
Here S
is the chain entropy at the transition state, and
and ß are parameters measuring the mismatch between entropy and energy giving the typical free energy barrier governing trap escape. The values for a bulk polymer
0.5, ß
1.8 are used below (Plotkin and Onuchic, 2002a
,b
; Wang et al., 1997
). The temperature TG was adjusted to the value that reproduced the experimentally determined slope of barriers versus stability, mU
/m. In Table 1 this number is compared to the value of TG that emerges from the REM analysis. A mismatch of these two values may indicate a breakdown of the REM approximation for states in determining prefactors, i.e., a breakdown in the validity of Eqs. 14a and 14b. For cyt-C the value of TG giving the correct slope is
1.2 kJ/mol, versus 1.0 kJ/mol from the REM analysis.
The entropy S
may be eliminated from Eqs. 17a and 17b, giving an equation that relates the prefactors, which replaces Eq. 15,
![]() | (18) |
Equations 13a, 13b, and 18 again define a system of three linear equations for three unknowns:
GU
(To, co), ln ko(To), and ln ko(T), which may be solved analytically. The results are shown in Fig. 4.
|
22 kBT300K, and prefactors are almost unactivated.
The REM value of TG resulted from approximating a value of 100 kB for the chain entropy So, so it is feasible that this estimate for the REM TG could differ from the TG that gives the correct mU
/m. The parameters
and ß could, in principle, have been adjusted to best match the experimental slope; however, it can be shown that this results in the same solution of Eqs. 13a, 13b, and 18 as that determined by varying TG.
In contrast to the REM approximation, the effects of the temperature dependence of viscosity were not significant here (Fig. 4). Nor were there any significant effects due to barrier height differenceas
'
GU
changed from 2 kJ/mol to 0 kJ/mol, the barrier changed by <2%. The effects due to TG are modest as well: over the range of TG values in Fig. 3 B, the barrier height changed by <15%. Lastly, the prefactors of the correlated landscape model are nearly constant over the range of experimental stabilities (Fig. 4), consistent with empirical observations (see the comments below Eq. 4).
Equation 17a or 17b may now be solved for S
as a check, giving S
40 kB, or
40% of the unfolded chain entropy assumed in finding the REM TG. Alternatively, we can estimate the unfolded entropy So from the value of S
as S
(1 mU
/m)So, then Eq. 16 gives
14 kJ/mol. Since the variances of individual residues add to give
2,
2
N(1 mU
/m)b2, where b is a non-native energy scale per residue, here as
0.7 kBT300.
Fig. 5 shows that the inferred barriers and prefactors increase as the value of the bare reconfiguration rate koo increases. The prefactor ln ko(To) closely follows the bare reconfiguration rate ln koo; i.e., they are approximately equal. The barriers at the transition midpoint
and at the stability of the wild-type protein
increase linearly, as
2kBTo ln koo.
|
1, ß
2, S
So. Values obtained tended to be bracketed by the REM and correlated models.
For NTL9, the solution of the REM gave a TG that monotonically decreased from a value of 0.4 at the stability of the wild-type protein, to zero at a stability of
11 kJ/mol. Similarly, the prefactor monotonically increases from 108 s1 at the stability of the wild-type to 1010 s1 at zero stability. We note that these problems are not present if the stability difference
'
GU
is set to zero, if the prefactor is two or more orders-of-magnitude slower, or if the temperature dependence of the viscosity is neglected. We take this sensitivity as a shortcoming of the procedure of rigorously demanding that the landscape theory fit to a limited subset of the experimental data. In this sense, a best (but not exact) fit to experimental rate surfaces as a function of both T and c as in Kuhlman et al. (1998)
, Otzen and Oliveberg (2004)
, Scalley and Baker (1997)
, and Schindler and Schmid (1996)
, is likely to give more accurate numbers. Likewise in the correlated model for NTL9, the prefactor increased from
108 s1 at the stability of the wild-type to unphysical values at zero stability. A similar situation exists in the REM recipe for protein S6; however, it is resolved in the correlated landscape model for that protein.
CspB showed some difficulties that arose from its unusually late transition state (mU
/m
0.9) (Perl et al., 2002
). The parameter TG in the correlated model could not be adjusted to reproduce the high slope of barrier versus stability, without giving negative barriers. Again this may be an artifact of the exact fitting method mentioned above, i.e., more experimental data may also be needed to obtain more accurate numbers, or it may indicate that a simple mean field prefactor does not fully adequately describe the folding dynamics of this protein. In this case we took the temperature TG = 1.81 kJ/mol that induced the barrier to vanish at the stability of the wild-type protein (while acknowledging that other fits give large barriers (Perl et al., 2002
)). This has a steep barrier-stability curve, with slope mU
/m = 0.8 (as opposed to 0.9 observed empirically), very small barrier (7 kJ/mol at zero stability), and rugged landscape with very slow prefactor (
102 s1). Such small barriers are consistent with estimates taken from simulations using C
-models (Shea and Brooks, 2001
).
The Arrhenius model generally admits no solution
A model often proposed for the prefactor assumes an Arrhenius temperature-dependence with single activation energy EA, so that Eqs. 14a and 14b are replaced by
![]() | (19a) |
![]() | (19b) |
![]() | (20) |
Eliminating
GU
from Eqs. 13a and 13b gives another equation relating the prefactors:
![]() | (21) |
![]() | (22) |
A geometric analog may be helpful in understanding the situation. The solution to three equations in three variables is equivalent to finding the point where three planes intersect. Letting
![]() |
![]() |
![]() |
![]() | (23a) |
![]() | (23b) |
![]() | (23c) |
![]() |
![]() |
![]() |
B in general, Eqs. 23a and 23b depict two parallel planes. Thus there is no point of intersection and the system of equations is ill-posed. For the special case of A = B there is a whole family of solutions consistent with the rate equations, but as mentioned above this scenario can only hold under very special circumstances. | CONCLUSIONS AND DISCUSSION |
|---|
|
|
|---|
The numerical values of the barriers obtained from the above recipes should be taken with a grain of salt; however, it consistently emerged that folding barriers were large (except for CspB): the average barrier at the transition midpoint for the REM analysis is
19 kBT, and the corresponding barrier in the correlated model is
18 kBT. If CspB is omitted, the barriers are 21 kBT and 22 kBT, respectively. Wild-type S6, a protein known to fold very cooperatively (Lindberg et al., 2002
), had the highest barriers.
With the exception of CspB, the prefactors in the correlated model tended to be quite highapproximately the bare reconfiguration rate for the whole protein (109 s1). The folding barrier obtained from the recipe decreases as estimates for the bare reconfiguration rate decrease (Fig. 5). The prefactors from the REM recipe varied considerably.
All of the proteins analyzed here are considered two-state folders, so we would expect a Kramers theory to describe them. In lower temperature regimes the distribution of first passage times may be more relevant to study (Plotkin and Onuchic, 2002b
; Zhou et al., 2003
).
We found that in practice it was quite important to have accurate fits for the empirical rate-stability curves. For example, as temperature increased, the slope of the log rate versus stability curve had to remain approximately constant or tend to increase, to obtain reasonable solutions of the rate equations. Otherwise we found an unphysical situation where barriers did not increase as stability decreased. This sensitivity to the experimental data may favor a less stringent fit to the experimental constraints.
In fact, reflection on the procedure raises a general issue on the rigorous application of experimental constraints to energy landscape theory. For example, if we were to add data at a third temperature T1, two new equations would be introduced according to the recipeone Kramers rate equation and one landscape equation for the prefactor, but only one new variable is introducedthe prefactor ln ko(T1). The system becomes overdetermined. Demanding equality rather than a best fit at several temperatures becomes too stringent a constraint on the theory, as long as the parameters in the theory (e.g.,
2 or EA) are fixed. The more temperatures used, the more variables must be introduced into the theory, or the parameters must themselves become temperature-dependent. Nevertheless, the fact that the Arrhenius activation model fails in general to provide a solution for even two temperatures (two data points) should probably be seen as evidence against its strict applicability.
A perhaps more viable method would be to fit several temperatures with functional forms such as Eqs. 14a, 17a, or 19a to extract parameters such as
2 and EA. The difficulty in previous fits to data has been in the separation of EA and the activation enthalpy
HU
(Scalley and Baker, 1997
). One can ask which temperature dependence (EA/T or
2/T2) gives the best fit to the data, but there is not yet enough accurate data to distinguish between the two scenarios (Kuhlman et al., 1998
; Scalley and Baker, 1997
) by this method. However, the Arrhenius model becomes severely restricted by applying experimental constraints rigorously at two temperatures and denaturant concentrations, at the same stability. Because the activation energy in the prefactor can be absorbed into the enthalpic part of the barrier, and only the entropic part of the barrier is relevant in determining rate differences at fixed stability (by Eq. 12), the activation energy becomes irrelevant, and the difference in rates must then be due to quantities independent of denaturant concentration (entropic part of the barrier, temperature-dependent viscosity, etc.). All rate-stability curves for a given protein must be exactly parallel in the Arrhenius modela situation not observed empirically.
Topological features of the native structure have been neglected in the rate theory. Including polymer physics into the theoretical model (Plotkin and Onuchic, 2000
; Portman et al., 2001
; Shoemaker et al., 1999
) may also eliminate some of the sensitivity of the theoretically derived values in Table 1 on the experimental data.
Other methods have been used to estimate barrier heights. Adding a three-body contribution to a pairwise-interacting energy function to give best agreement with experimental
-values, a barrier height for protein L of
16 kJ/mol was obtained (Ejtehadi et al., 2004
). Other proteins such as FKBP and CI2 had larger barriers of 25 kJ/mol and 42 kJ/mol, respectively (Ejtehadi et al., 2004
). The large barriers observed here also suggest that many-body interactions may be playing a significant role in the energy function. A variational theory for the free energy surface of
-repressor gave a barrier of
12 kJ/mol (Portman et al., 2001
). All-atom simulations of a three-helix bundle fragment of protein A in explicit water gave barrier heights
17 kJ/mol at the transition midpoint (Garcia and Onuchic, 2003
). Applying Kramers theory with an experimentally determined estimate for the prefactor gave an estimate for the free energy barrier of
18 kJ/mol for the cold shock protein CspTm (Schuler et al., 2002
). An analysis which took prefactors from experimental data, along with a thermodynamic analysis to extract enthalpic and entropic contributions to the barrier, gave typical barrier heights of
30 kJ/mol for the proteins analyzed (Akmal and Munoz, 2004
). However, these last two methods found barrier heights under conditions of zero denaturantthe barrier heights at zero stability would likely be significantly higher. For example, the average
(mU
/m)
G
for the proteins in Table 1 is
17 kJ/mol, to be added to the barrier height at conditions of zero denaturant.
Applying this method to a simulation model, where one knows the answers in advance, provides a good control for the study and is a topic for future work.
| ACKNOWLEDGEMENTS |
|---|
|
|
|---|
S.S.P. acknowledges support from the Natural Sciences and Engineering Research Council and the Canada Research Chairs program.
Submitted on September 9, 2004; accepted for publication March 2, 2005.
| REFERENCES |
|---|
|
|
|---|
Bryngelson, J. D., J. N. Onuchic, N. D. Socci, and P. G. Wolynes. 1995. Funnels, pathways and the energy landscape of protein folding. Proteins. 21:167195.[CrossRef][Medline]
Bryngelson, J. D., and P. G. Wolynes. 1989. Intermediates and barrier crossing in a random energy model (with applications to protein folding). J. Phys. Chem. 93:69026915.[CrossRef]
CRC. 2003. CRC Handbook of Chemistry and Physics, 84th Ed. D.R. Lide, editor. CRC Press, New York.
D'Aquino, J. A., J. Gomez, V. J. Hilser, K. H. Lee, L. M. Amzel, and E. Freire. 1996. The magnitude of the backbone conformational entropy change in protein folding. Proteins Struct. Funct. Gen. 25:143156.
Eaton, W. A., V. Munoz, S. J. Hagen, G. S. Jas, L. J. Lapidus, E. R. Henry, and J. Hofrichter. 2000. Fast kinetics and mechanisms in protein folding. Annu. Rev. Biophys. Biomol. Struct. 29:327359.[CrossRef][Medline]
Ejtehadi, M. R., S. P. Avall, and S. S. Plotkin. 2004. Three-body interactions improve the prediction of rate and mechanism in protein folding models. Proc. Natl. Acad. Sci. USA. 101:1508815093.
Fersht, A. R. 1999. Structure and Mechanism in Protein Science, 1st Ed. W.H. Freeman, New York.
Garcia, A. E., and J. N. Onuchic. 2003. Folding a protein in a computer: an atomic description of the folding/unfolding of protein A. Proc. Natl. Acad. Sci. USA. 100:1389813903.
Hanggi, P., P. Talkner, and M. Borkevec. 1990. Reaction-rate theory: fifty years after Kramers. Rev. Mod. Phys. 62:251341.[CrossRef]
Jackson, S. E., and A. R. Fersht. 1991. Folding of chymotrypsin inhibitor 2. 1. Evidence for a two state transition. Biochemistry. 30:1042810435.[CrossRef][Medline]
Klimov, D. K., and D. Thirumalai. 1997. Viscosity dependence of the folding rates of proteins. Phys. Rev. Lett. 79:317320.[CrossRef]
Kuhlman, B., D. L. Luisi, P. A. Evans, and D. P. Raleigh. 1998. Global analysis of the effects of temperature and denaturant on the folding and unfolding kinetics of the N-terminal domain of the protein l9. J. Mol. Biol. 284:16611670.[CrossRef][Medline]
Lapidus, L. J., W. A. Eaton, and J. Hofrichter. 2000. Measuring the rate of intramolecular contact formation in polypeptides. Proc. Natl. Acad. Sci. USA. 97:72207225.
Leach, S. J., G. Nemethy, and H. A. Scheraga. 1966. Computation of sterically allowed conformations of peptides. Biopolymers. 4:369407.[CrossRef][Medline]
Li, M. S., D. K. Klimov, and D. Thirumalai. 2004. Thermal denaturation and folding rates of single domain proteins: size matters. Polym. 45:573579.[CrossRef]
Lindberg, M., J. Tangrot, and M. Oliveberg. 2002. Complete change of the protein folding transition state upon circular permutation. Nat. Struct. Biol. 9:818822.[Medline]
Mines, G. A., T. Pascher, S. C. L. J. R. Winkler, and H. B. Gray. 1996. Cytochrome-c folding triggered by electron transfer. Chem. Biol. 3:491497.[CrossRef][Medline]
Onuchic, J. N., Z. Luthey-Schulten, and P. G. Wolynes. 1997. Theory of protein folding: the energy landscape perspective. Annu. Rev. Phys. Chem. 48:545600.[CrossRef][Medline]
Otzen, D. E., and M. Oliveberg. 2004. Correspondence between anomalous m- and
cp-values in protein folding. Protein Sci. 13:32533263.
Perl, D., M. Jacob, M. Bano, M. Stupak, M. Antalik, and F. X. Schmid. 2002. Thermodynamics of a diffusional protein folding reaction. Biophys. Chem. 96:173190.[CrossRef][Medline]
Plotkin, S. S., and J. N. Onuchic. 2000. Investigation of routes and funnels in protein folding by free energy functional methods. Proc. Natl. Acad. Sci. USA. 97:65096514.
Plotkin, S. S., and J. N. Onuchic. 2002a. Understanding protein folding with energy landscape theory. I. Basic concepts. Q. Rev. Biophys. 35:111167.[CrossRef][Medline]
Plotkin, S. S., and J. N. Onuchic. 2002b. Understanding protein folding with energy landscape theory. II. Quantitative aspects. Q. Rev. Biophys. 35:205286.[CrossRef][Medline]
Portman, J. J., S. Takada, and P. G. Wolynes. 2001. Microscopic theory of protein folding rates. II. Local reaction coordinates and chain dynamics. J. Chem. Phys. 114:50825096.[CrossRef]
Scalley, M., and D. Baker. 1997. Protein folding kinetics exhibit an Arrhenius temperature dependence when corrected for the temperature dependence of protein stability. Proc. Natl. Acad. Sci. USA. 94:1063610640.
Schindler, T., and F. X. Schmid. 1996. Thermodynamic properties of an extremely rapid protein folding reaction. Biochemistry. 35:1683316842.[CrossRef][Medline]
Schuler, B., E. A. Lipman, and W. A. Eaton. 2002. Probing the free-energy surface for protein folding with single-molecule fluorescence spectroscopy. Nature. 419:743747.[CrossRef][Medline]
Shea, J. E., and C. L. Brooks III. 2001. From folding theories to folding proteins: a review and assessment of simulation studies of protein folding and unfolding. Annu. Rev. Phys. Chem. 52:499535.[CrossRef][Medline]
Shoemaker, B. A., J. Wang, and P. G. Wolynes. 1999. Exploring structures in protein folding funnels with free energy functionals: the transition state ensemble. J. Mol. Biol. 287:675694.[CrossRef][Medline]
Socci, N. D., J. N. Onuchic, and P. G. Wolynes. 1996. Diffusive dynamics of the reaction coordinate for protein folding funnels. J. Chem. Phys. 104:58605868.[CrossRef]
Wang, J., S. S. Plotkin, and P. G. Wolynes. 1997. Configurational diffusion on a locally connected correlated energy landscape; application to finite, random heteropolymers. J. Phys. I (Fr.). 7:395421.[CrossRef]
Zhou, Y., C. Zhang, G. Stell, and J. Wang. 2003. Temperature dependence of the distribution of the first passage time: results from discontinuous molecular dynamics simulations of an all-atom model of the second ß-hairpin fragment of protein-G. J. Am. Chem. Soc. 125:63006305.[CrossRef][Medline]
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |