| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
Institut für Biochemie, Eidgenössische Technische Hochschule, ETH-Hönggerberg, HPM, 8093 Zurich, Switzerland
Correspondence: Address reprint requests to Ernesto E. Di Iorio, Institut für Biochemie, Eidgenössische Technische Hochschule, ETH-Hönggerberg, HPM, 8093 Zurich, Switzerland. Tel.: +41-1-6323137; Fax: +41-1-6321298; E-mail: diiorio{at}bc.biol.ethz.ch.
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
Frustration, be it energetic or geometric (Sadoc and Mosseri, 1999
), is a physical feature that appears to mingle folding and structural flexibility. Energetic frustration implies the existence of competing nonbonded interactions, whereas geometric frustration comes about when local order cannot be freely propagated throughout the space. It has been suggested that the folding process is driven by a search for minimal frustration (Bryngelson et al., 1995
; Wolynes and Eaton, 1999
; Wolynes et al., 1996
), as there is evidence for a direct correlation between frustration and structural flexibility of a fold (Tavernelli and Di Iorio, 2001
).
In this context, a relevant question is whether structural flexibility and frustration phenomena are involved in the in vivo control of protein folding. A particularly interesting case is reglucosylation of the glycan moiety of glycoproteins. As briefly explained below, this is a key step in the control of the folding state of glycoproteins secreted by the endoplasmic reticulum (ER). We have, therefore, decided to compare the dynamic properties of folds that do or do not undergo reglucosylation. In particular, we have investigated bovine RNase-A and its derivative, referred to as S-protein, resulting from the removal of the first 20 residuesthe S-peptideafter cleavage with subtilisin (Fig. 1). The glycosylated counterparts of these two proteins, which contain an N-linked glycan moiety at Asn-34 and are known as RNase-B and BS-protein, have been used as model system to investigate experimentally how folding is controlled in the ER (Ritter, 2002
). Thanks to this quality control, only correctly folded and assembled glycoproteins can exit the ER. The process, which involves among other enzymes several molecular chaperones, has been reviewed by the Helenius group (Ellgaard et al., 1999
), who named it the calnexin/calreticulin cycle. Here we outline its essential features. After its translocation from the cytosol, the nascent glycoprotein chain is cotranslationally glycosylated by the addition of an N-linked glycan to an Asn that lies in a sequence Asn-X-Ser/Thr. The glycan has a branched structure formed by two N-acetylglycosamine residues, nine mannose units, and three glucoses. Glucosidase I and II rapidly trim two of the three glucoses, leading to the monoglucosylated derivative. Only as such the glycoprotein can bind either calreticulin or calnexin, two lectins acting as molecular chaperones in the ER. The interaction with one of these chaperones and other enzymes helps the glycoprotein in attaining its fully folded structure. Upon dissociation from the macromolecular complex, the third glucose is also removed from the glycan moiety and the molecule undergoes a control by UDP-glucose:glycoprotein glucosyltransferase (UGGT), which checks whether the glycoprotein is correctly folded. If this is the case, the newly assembled molecule is allowed to enter the secretory pathway, otherwise a single glucose is added back to its glycan moiety. This is interpreted as a signal for the rebinding to calnexin or calreticulin and therefore the protein is given another chance to fold properly. After a few unsuccessful attempts, the glycoprotein is degraded. Clearly, UGGT plays a central role in this quality-control cycle acting as folding sensor.
|
We report here the results of a comparative computational investigation on bovine RNase-A and its counterpart AS-protein. They help in rationalizing the experimental data just discussed and provide new insights on the dynamic properties of folding states that share the overall three-dimensional organization of the native structure, but are characterized by a reduced compactness.
| METHODS |
|---|
|
|
|---|
atom. A single protein molecule was centered in a truncated octahedron box, of edge 6.88 nm, leaving a minimum distance of 1.25 nm between any solute atom and the box walls. SPC/E water was added in blocks of previous equilibrated 216 molecules, resulting in a fully hydrated system containing 4854 water molecules. Bond lengths were constrained using the SHAKE algorithm (Ryckaert et al., 1977
The starting structure of the S-protein was obtained from that of RNase-A. First, we divided the protein into two submolecules (RNase-S and S-peptide). Subsequently the atoms belonging to the S-peptide (residues 1-20) were annihilated by the free-energy perturbation method (van Gunsteren et al., 1996
) within 500 ps of molecular dynamics. The resulting structure was used as starting point for a simulation under the same conditions described for RNase-A and as reference configuration during the analysis of the trajectories. In this case, 4777 water molecules surrounded the protein in a truncated octahedral box with an initial edge of 6.86 nm. Other conditions were as previously described (Stella et al., 1999
; Tavernelli and Di Iorio, 2001
). The total simulation time for each of the two systems has been of 10 ns. Temperature, pressure, total potential energy, radius of gyration (RoG), root mean square deviations (RMSD) from the starting structure, and nonbonded interactions between solute and solvent, as well as between various segments of the protein, were continuously monitored.
To allow a direct comparison between the two proteins, the analyses carried out on the simulated RNase atomic trajectories were performed on the S-protein atoms set, unless stated otherwise.
We followed the diffusion of the shape and size of the polypeptide chain by determining the distance between its configuration at time t and that at time zero. In addition to the Euclidean distance between atoms, this was done using the more informative D4 distance (Pliska and Marinari, 1993
), as described in the preceding paper (Tavernelli et al., 2003
).
Radius of gyration, RMSD, fluctuations along the first two principal components (Amadei et al., 1996
) and D4 distance were used to obtain a two-dimensional representation of the free-energy landscape visited during the simulation, as discussed in detail in the accompanying paper (Tavernelli et al., 2003
).
To monitor the spatial organization within the core of the protein, we have computed the time series of the so-called Bond Orientation Order Parameter (BOOP) and their time correlation function, employing a previously reported procedure (Tavernelli and Di Iorio, 2001
) with the following modifications. For the definition of the core, we have selected all the atoms with exposed solvent surface equal to zero and belonging to hydrophobic residues present in both simulated proteins. The bond orientation order parameter was computed considering all the pairs formed by each core atom and its neighbor particles at a distance Rbond linked to it by a bonded or nonbonded interaction.
The solvent exposed surface was calculated using a probe radius of 0.14 nm. The time series of the Voronoi volume were calculated using the software package developed by Gerstein (Gerstein et al., 1995
). The Voronoi method consists in the determination of a Voronoi polyhedron around an atom. To construct it, one defines as many lines as needed to connect the central atom to all of its neighbors within a certain cut-off distance. Subsequently, one defines planes perpendicular to such lines, positioning them at the midpoint between the connected particles. The smallest polyhedron formed by the intersection of such planes is unique and is called the Voronoi polyhedron. If one or more neighbors around an atom cannot be defined, the constructed polyhedron will remain open. This is what happens with atoms at the protein surface. Hence, the Voronoi volume is calculated only for atoms not exposed to the solvent.
We derived from the atomic trajectories the time series of the Voronoi volumes averaged over all core atoms common to both proteins.
| RESULTS AND DISCUSSION |
|---|
|
|
|---|
1.34 nm. Instead, for the S-protein, the same parameter displays large oscillations, whose distributions present two peaks centered at
1.47 nm and 1.49 nm (Fig. 2, insert). For a globular protein, the radius of gyration provides quantitative information on its compactness, but not for a kidney-like shaped molecule like RNase. Nevertheless, comparing the RoG time evolution of RNase-A with that of the S-protein can pinpoint qualitative differences in the overall packing of the two molecules. The data of Fig. 2 indicate that during our simulations the S-protein tends to relax toward a looser configuration whereas RNase-A keeps its original compactness. To better describe the diffusion of shape and size of the system under investigation we used the distance D4 (Pliska and Marinari, 1993
-atoms in a configuration at time t with the corresponding distances in the starting structure. One expects D4 to diffuse toward a maximum value and thereafter to remain practically steady if the system, after having explored the accessible conformational space, finds a minimum (global or local) in the landscape. As shown in Fig. 3, this kind of behavior applies quite well to RNase-A, but not to the S-protein. In the latter system, the D4 distance displays first a sharp rise, immediately followed by a decline that leads to a relatively unstable profile, characterized by values consistently higher than those of RNase-A and possibly by a positive slope. This confirms that the S-protein loses compactness, as revealed by the time evolution of the RoG (Fig. 2). In addition, the data of Fig. 3 show that the S-protein has access to a larger local conformational space compared to RNase-A, most likely not completely visited during the simulation.
|
|
-RMSD from the starting structure for RNase-A and for the S-protein (top), along with the difference between the two profiles (bottom). RMSDs are notoriously correlated with the flexibility of a molecule. As expected for a system that loses compactness, the S-protein displays larger RMSD than RNase-A, thus implying a greater structural flexibility. The profile of the RMSD difference does not correlate with the secondary structural organization of RNase-A, the largest differences being observed in the region around the glycosylation site (Asn-34 in bovine RNase-B) and for the two segments Ala-64Gln-74 and Cys-84Ala-96. For this reason, we have analyzed the time series of the number of residues belonging to a particular secondary structure (data not shown). As expected, RNase-A keeps its secondary structural organization throughout the simulation. The S-protein undergoes an initial adaptation to the absence of the S-peptide, which causes a temporary loss of
-helical structure, partially counterbalanced by an increase in the number of residues involved in 3/10-helix segments. However, within the first 2.5 ns of simulation, the number of residues belonging to
-helical segments returns to the starting value, hence implying that the increased structural mobility of the S-protein is not due to loss of secondary structural organization. This is confirmed by the model representation of the S-protein at the end of the simulation depicted in the bottom panel of Fig.1. The secondary structure is essentially maintained, the major difference between the starting and final structures being a relative rotation of the two lobes of the kidney-shaped protein due to a longitudinal twisting of the molecule.
|
Nonbonded interactions
To better define the picture just depicted, one needs a detailed analysis of the energetic changes arising from the removal of the S-peptide from RNase-A. Adding all the nonbonded interaction terms provides a good estimate of the interaction energy within a protein. For the two systems under investigation, the distributions of the total nonbonded interaction energy are all Gaussian in shape and characterized by the parameters listed in Table 1. Clearly, the S-peptide has an overall stabilizing effect quantifiable in roughly 1100 kJ/mol. RNase-A without the contributions from the S-peptide and the S-protein display comparable total nonbonded interaction energies, as expected for two systems that have the same overall three-dimensional arrangement. The moderate, but significant shift toward less negative interaction energies for the S-protein is in agreement with its reduced overall compactness discussed above.
|
In Fig. 5 we report this type of map for RNase-A (lower right triangular half) and for the S-protein (upper right triangular half). The plot shows that there are not many strong attractive pairs (black or blue dots). Furthermore, several strong interactions are located near the diagonal region, thus representing local contacts between nearby residues along the sequence. Interactions that stabilize the fold are between residues that are well separated along the sequence and therefore away from the diagonal of the plot. Our systems display few such interactions that in addition are, to a large extent, common to both proteins. One important exception is the pair Asp-83Lys-104, which is very strongly attractive in the S-protein (black dot) and only moderately so in RNase-A (green dot). However, this interaction has standard deviations of 98 kJ/mol and 31 kJ/mol respectively for the S-protein and for the native molecule, thus indicating that the removal of the S-peptide not only increases its energy, but also its frustration. This is yet another sign reflecting the looser structure of the S-protein. The map of Fig. 5 also shows the existence of two very strong attractive interactions in the native protein not present in the S-protein. These interactions involve the S-peptide and certainly contribute in stabilizing the fold. The central position occupied by the
-helical segment of the S-peptide within the kidney-shaped molecule certainly locks the system such that the longitudinal twisting observed during the simulation on the S-protein cannot take place (Fig. 1).
|
Topological analysis of the hydrophobic core
An important element for the characterization of the dynamic properties of the hydrophobic core of a protein is its geometrical organization (Tavernelli and Di Iorio, 2001
). We have analyzed how the average Voronoi volume of core atoms evolves during the simulation and computed the corresponding histograms from the entire trajectory (data not shown). The S-protein displays a broader and more asymmetric distribution compared to RNase-A, thus indicating a drop in the geometrical order of the core when the S-peptide is removed. To quantify this phenomenon we analyzed the autocorrelation of the BOOP computed as reported previously (Tavernelli and Di Iorio, 2001
) and considering all first neighbors of each core atom. The results are reported in Fig. 6. For RNase-A, after an initial decay, the curve approaches a level with a persistent correlation greater than zero. Instead, the trace relative to the S-protein displays an additional decay that rapidly reduces its correlation. This additional process reveals a loss of order within the core and the tendency of its atoms to organize themselves in subclusters with relatively independent dynamics. In other words, the interactions among particles in each subcluster tend to dominate over global interactions involving the whole core of the protein. This behavior indicates that the S-protein is diffusing toward a looser state, where the system gains in ergodicity, similarly to what is observed in amorphous systems and in spin glasses (Fischer and Hertz, 1991
; Mezard et al., 1987
). Ergodicity breaking means that time and ensemble averages do not coincide and is typical of systems with multivalley energy landscapes below a critical temperature, when trapping within local energetic minima takes place. Native proteins display this behavior, but, if the system starts diffusing toward a looser state, long-time correlations decrease rapidly while the volume of the sampled phase space increases (Tavernelli and Di Iorio, 2001
). Eventually, this could lead to complete unfolding, which is the most ergodic condition for a protein.
|
Fig. 7 depicts the free-energy contour plots computed from the total nonbonded energy of the solute and the conformational distance D4. Instead, in Fig. 8 we report the projection of the trajectories on the same graphs. RNase-A (top panel of Fig. 7) displays three well-defined minima. The system remains in the leftmost minimum for the first 0.85 ns (red trace in the top graph of Fig. 8), thereafter it moves to the middle well where it stays for most of the following 3.15 ns (yellow), and finally jumps to the rightmost basin where it remains during the last 6 ns of the simulation (magenta).
|
|
Although different in the details and less informative, the same overall picture emerges from contour plots computed using global parameters other than the D4 distance like RMSD, radius of gyration, or fluctuations along principal components (data not shown). The larger conformational volume visited by the S-protein implies that entropic contributions are higher in this protein than in RNase-A, in agreement with its enhanced flexibility.
Concluding remarks
Our computational investigation shows that the removal of the S-peptide from RNase-A triggers the transition to a state that, conserving its three-dimensional arrangement, displays: i), an increased structural flexibility; ii), a looser packing; iii), a lower coherence in the dynamics of the hydrophobic core; and iv), higher frustration and ergodicity. This is in full agreement with the finding that the rate of proton exchange strongly decreases when the S-protein forms a complex with the S-peptide (Rosa and Richards, 1981
). Therefore, it is reasonable to conclude that UGGT, the molecular system responsible for the folding quality control of glycoproteins in the ER, senses the structural flexibility of its substrates. We envisage the following picture. When UGGT binds a glycoprotein, it attempts to pull the glycan moiety inside its active site, where the glucosylation reaction should be catalyzed. Only if the glycoprotein is loosely folded the glycan will have the necessary mobility for this process to be successfully completed. The finding that UGGT recognizes misfolded RNase, no matter where the sugar moiety is located along the sequence (Ritter, 2002
), is in agreement with this interpretation.
Besides providing a plausible explanation for the mechanism underlying the control of protein folding in the ER, this investigation confirms the power of comparative analyses of the time autocorrelation of the BOOP (Tavernelli and Di Iorio, 2001
) and of the energy landscape (Tavernelli et al., 2003
). Even from short simulations relative to the timescale of protein denaturation, these approaches pinpoint features indicative of a transition to folding intermediates.
| ACKNOWLEDGEMENTS |
|---|
|
|
|---|
This work was supported by the Eidgenössische Technische Hochschule, ETH-Zurich (grants 41-2517.5 and 0-50503-00).
| FOOTNOTES |
|---|
Simona Cotesta's present address is Pharmacia Italia, Computational Sciences, Department of Chemistry, Viale Pasteur 10, 20014 Nerviano, Italy.
Ivano Tavernelli's present address is Ecole Polytechnique Fédérale de Lausanne, Institut de Chimie Moléculaire et Biologique, BCHLCBC, 1015 Lausanne, Switzerland.
Submitted on December 17, 2002; accepted for publication June 26, 2003.
| REFERENCES |
|---|
|
|
|---|
Bryngelson, J. D., J. N. Onuchic, N. D. Socci, and P. G. Wolynes. 1995. Funnels, pathways, and the energy landscape of protein folding: a synthesis. Proteins. 21:167195.[Medline]
Chakshusmathi, G., S. Ratnaparkhi-Girish, P. K. Madhu, and R. Varadarajan. 1999. Native-state hydrogen-exchange studies of a fragment complex can provide structural information about the isolated fragments. Proc. Natl. Acad. Sci. USA. 96:78997904.
Di Iorio, E. E., I. Tavernelli, and W. Yu. 1997. Dynamic properties of the monomeric insect erythrocruorin-III from Chironomus thummi-thummi. Relationships between structural flexibility and functional complexity. Biophys. J. 73:27422751.
Ellgaard, L., M. Molinari, and A. Helenius. 1999. Setting the standards: quality control in the secretory pathway. Science. 286:18821888.
Fischer, K. H., and J. A. Hertz. 1991. Spin Glasses. Cambridge University Press, Cambridge UK.
Gerstein, M., J. Tsai, and M. Levitt. 1995. The volume of atoms on the protein surface calculated from simulation, using Voronoi polyhedra. J. Mol. Biol. 249:955966.[Medline]
Koradi, R., M. Billeter, and K. Wüthrich. 1996. MOLMOL: a program for display and analysis of macromolecular structures. J. Mol. Graph. 14:5155.[Medline]
Mezard, M., G. Parisi, and M. A. Virasoro. 1987. Spin Glass Theory and Beyond. Word Scientific, Singapore.
Pliska, P., and E. Marinari. 1993. On heteroplymer shape dynamics. Europhysics Lett. 22:167173.
Ritter, C. 2002. Characterization of the substrate properties recognized by the ER folding sensor UDP-glucose:glycoprotein glucosyltransferase. In Biology. Swiss Federal Institute of Technology-ETH, Zurich. 111.
Ritter, C., and A. Helenius. 2000. Recognition of local glycoprotein misfolding by the ER folding sensor UDP-glucose: glycoprotein glucosyltrasferase. Nat. Struct. Biol. 7:278280.[Medline]
Rosa, J. J., and F. M. Richards. 1981. Hydrogen exchange from identified regions of the S-protein component of ribonuclease as a function of temperature, pH, and the binding of S-peptide. J. Mol. Biol. 145:835851.[Medline]
Ryckaert, J.-P., G. Ciccotti, and H. J. C. Berendsen. 1977. Numerical integration of the cartesian equations of motion of a system with constraints: molecular dynamics of n-alkanes. J. Comput. Phys. 23:327341.
Sadoc, J.-F., and R. Mosseri. 1999. Geometrical Frustration. Cambridge University Press, Cambridge, UK. 307
Stella, L., M. Nicotra, G. Ricci, N. Rosato, and E. E. Di Iorio. 1999. Molecular dynamics simulations of human glutathione transferase P 11: analysis of the induced-fit mechanism by GST binding. Proteins. 37:19.[Medline]
Tavernelli, I., S. Cotesta, and E. E. Di Iorio. 2003. Protein dynamics, thermal stability, and free energy landscapes: a molecular dynamics investigation. Biophys. J. 85:26412649.
Tavernelli, I., and E. E. Di Iorio. 2001. The interplay between protein dynamics and frustration of non-bonded interactions as revealed by molecular dynamics simulations. Chem. Phys. Lett. 345:287294.
Tironi, I. G., R. Sperb, P. E. Smith, and W. F. van Gunsteren. 1995. A generalized reaction field method for molecular dynamics simulations. J. Chem. Phys. 102:54515459.
van Gunsteren, W. F., S. R. Billeter, A. A. Eising, P. H. Hünenberger, P. Krüger, A. E. Mark, A. E. Scott, and I. G. Tironi. 1996. Biomolecular Simulation: The GROMOS96 Manual and User Guide. VDF Hochschulverlag AG an der ETH Zürich, Zürich.
Wolynes, P. G., and W. A. Eaton. 1999. The physics of protein folding. Physics World. 12:3944.
Wolynes, P. G., Z. Luthey-Schulten, and J. N. Onuchic. 1996. Fast folding experiments and the topography of protein folding energy landscapes. Chemistry & Biology. 3:425432.[Medline]
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |