| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St. Louis, Missouri 63110
Correspondence: Address reprint requests to Kathleen B. Hall, Tel.: 314-362-4196; Fax: 314-362-7183; E-mail: hall{at}bionmr3.wustl.edu.
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
12 amino acid C-terminal extension (totaling the N-terminal 101 residues), this domain is sufficient for high affinity binding to stem/loop II (SL2) of the U1 small nuclear RNA (3
A), loop 3 (connecting ß2 and ß3), and the linker following ß4 (T89D90S91; Fig. 1 a).
|
The RNA-binding surface of RBD1 has been studied extensively by site directed mutagenesis (10
14
), as has the RNA loop (9
,15
). In vitro RNA-binding experiments have revealed that most of the RNA-binding surface of the domain is thermodynamically coupled (13
,14
); i.e., residues on the protein's binding surface do not function independently of each other. In a striking example of communication across the ß-sheet, mutants with disrupted thermodynamic coupling were shown to have significantly altered loop 3 flexibility, even if the site of mutation was not proximal to the loop (14
). The RNA is part of the network of thermodynamically coupled sites that determine and maintain the protein/RNA complex (13
,14
).
Most RBDs contain a glycine at the junction of ß3 and loop 3, suggesting that it has a special function in these proteins. This glycine (G53) was replaced in U1A RBD1, and indeed mutants had a weaker affinity for SL2; specificity of RNA recognition was also changed (16
). Molecular dynamics trajectories calculated for the wild-type and G53 variant domains (17
) were analyzed using the reorientational eigenmode dynamics (RED) formalism (18
). This analysis showed that there are correlated motions spanning unbound RBD1-WT involving loop 1, loop 3, and the C-terminal tail; these correlation networks are disrupted by mutation of the conserved glycine (17
). It appears that correlated intramolecular motions of the protein are required for the conformational changes that must occur upon RNA binding. We propose that rapid correlated motions are the physical manifestation of the thermodynamic coupling across the RNA-binding surface of RBD1.
The structure and dynamics of the complex (19
23
) and also the free RNA (21
,22
) have been studied through molecular dynamics simulations. Simulations of free SL2 predict a highly flexible (disordered) loop with the bases turned inward and shielded from solvent, consistent with the observations of Hall (9
). Simulations of the RBD1:SL2 complex confirm the prediction that the RNA becomes locked into a well-ordered conformation and that the rigidity of the buried interface is similar to that expected for residues buried in the core of a protein (20
). In each simulation of the RBD1:SL2 complex, the well-conserved GCAC sequence (Fig. 1 b) shows minimal conformational fluctuations and participates in several high occupancy hydrogen bonding interactions with the protein. Unlike the bases that are recognized specifically, the solvent exposed UCC sequence 3' to the conserved GCAC is highly flexible in all simulations of the complex. In most simulations (20
,22
), the base of U8 (numbering relative to Fig. 1 b), which makes only van der Waals contacts with RBD1, is no more flexible than the GCAC sequence after it; but in one case it displays high flexibility, much like the mobile UCC sequence (21
). MM/PBSA analysis of simulations calculated for the free protein, free RNA, and the complex support the hypothesis that a free energy penalty must be paid upon binding in return for the extensive ordering of both the protein and the RNA relative to their free forms (22
).
Although these simulations do present an intriguing picture of the RBD1:SL2 complex, consistent with binding thermodynamics, they were not analyzed with a method (such as RED; (18
)) designed to investigate the network of correlated motions in the complex. Extensive correlations between distal sites on the RNA-binding surface of unbound RBD1 have been predicted from RED analysis of MD trajectories (17
); similar analysis of an RBD1:SL2 trajectory would reveal whether these correlations are retained in the complex and, more interestingly, if they are expanded to include the RNA. Here we present the analysis of an MD trajectory of the RBD1:SL2 complex using the RED formalism to characterize the amplitude and timescale of motion sampled by RBD1 and SL2 and the extent of dynamic correlation in the motions sampled. Through comparison of the calculated order parameters with previously published experimental and computational results for the unbound state, this analysis confirms that much of RBD1 and the RNA loop are stiffened by complex formation. Most importantly, the analysis also shows dramatic changes in correlated motion throughout RBD1 and reveals correlations spanning the protein/RNA interface.
| MATERIALS AND METHODS |
|---|
|
|
|---|
RED analysis
The RED formalism has been described in detail elsewhere (18
). All calculations were performed using in-house programs written in either Perl or C. The implementation of RED was consistent with our previous work (17
), except that matrix diagonalization was performed using a new in-house C program. After superposition to remove translational and rotational diffusion, the polar angles representing the orientation of a unit vector colinear with the amide bond of each of the 91 nonproline, non-N-terminal protein residues was calculated every 0.5 ps along the final 4 ns of the 5 ns trajectory. Likewise, the polar angles representing the unit vector colinear with each backbone C1'-H1' bond, the purine C8H8 bonds, adenine C2H2 bonds, and the pyrimidine C5H5 bonds of the RNA were computed. The saved coordinates were used to generate the 136 x 136 reorientational covariance matrix M, which has elements (18
)
![]() | (1) |
Y2M = Y2M(
)
Y2M are the normalized spherical harmonics of rank 2, and the bar indicates ensemble averaging over the coordinates from the simulation. One important property of the matrix M in this representation is that its diagonal elements are proportional to the generalized order parameter S2 of Lipari and Szabo (18
![]() | (2) |
2 is the variance of spherical harmonic Y2M. The covariance matrix can be diagonalized by solving the eigenvalue problem M|k > =
k|k> (k = 1, ..., n; n = 136). The resulting normalized reorientational eigenvectors |k> contain information about which spin interactions reorient in concert under the influence of each motional mode, and the eigenvalues
k represent the amplitude of the observed motion.
Although the eigenvectors of M contain detailed information about the dynamic correlations between sites, they contain no information about the timescale on which the observed motion occurs. The characteristic timescale of motion for each eigenvector was determined by constructing the correlation function (18
)
![]() | (3) |
m associated each motional mode can be established (30
Eigenvector collectivity
The collectivity of an eigenvector is defined by the parameter
, which is roughly proportional to the percentage of sites significantly reoriented by the motion represented by that vector (32
):
![]() | (4) |
ranges from 1/N to 1. | RESULTS |
|---|
|
|
|---|
|
|
Order parameter predictions from RED analysis
The RED formalism has been applied to the RBD1:SL2 trajectory to model the spin relaxation active dynamics. Snapshots collected every 0.5 ps over the final 4 ns of the trajectory were superposed and used to construct the RED covariance matrix (Materials and Methods, Eq. 1), for a total of 136 spin pairs: the 91 non-N-terminal, nonproline RBD1 backbone amides; the 20 backbone C1'-H1', 8 purine C8-H8, 5 adenine C2-H2, and 12 pyrimidine C5-H5. Order parameters for each of these sites were computed using Eq. 2 (Materials and Methods). It was anticipated that the result would show a loss of RBD1 backbone flexibility due to the bound RNA, especially in loop 3 and the TDS linker, which contact the RNA. This was indeed observed in NMR relaxation experiments of a complex of RBD1 and an internal bulge RNA (33
). Analogously, most sites in SL2 that contact RBD1 should be well ordered, yielding high order parameters.
RBD1 backbone dynamics
Computed RBD1 backbone amide order parameters are shown in Fig. 3 a and are uniformly high throughout most of the domain. The average order parameter of 0.85 ± 0.06 for all sites is consistent with most of the molecule having low conformational freedom in the complex. Table 2 contains average order parameter statistics for RBD1 broken down by position in the protein, as well as comparison with experimental NMR (16
) and RED (17
) results for the unbound domain. Previously published RED results for unbound RBD1 showed an average order parameter of 0.78 ± 0.17 (17
). As previously noted, the simulations overestimate the mobility of loops on this timescale. Although the unbound average is similar (within error) to the bound average, the unbound values were considerably more variable, as demonstrated by the much larger standard deviation.
|
|
SL2 ribose and base dynamics
The computed ribose C1' order parameters are shown in Fig. 3 b. Much like the backbone of RBD1, the majority of the riboses are uniformly rigid in the complex, as evidenced by high order parameters, with the notable exception of the UCC sequence at the 3' end of the loop. Also more mobile is the terminal basepair, which displays some fraying behavior, especially late in the simulation. From Table 2, it is clear that the seven loop nucleotides associated with RBD1 in the complex have ribose order parameters indistinguishable from those of the nonfraying positions in the stem. In contrast, the UCC sequence appears to be the most mobile site in the entire complex, consistent with its exposure to solvent and lack of contact with RBD1.
Fig. 3 c displays a trend for the base order parameters that is very consistent with that reported by the ribose sites for the backbone. Again, the UCC sequence displays extensive flexibility, whereas the rest of the loop bases are no more mobile than those in the stem. The one notable difference between the ribose and base order parameters is that the base of U8 appears to be more mobile than its neighbors (S2 = 0.81; compared to the average of 0.90 ± 0.04 for all loop bases that contact RBD1, Table 2). Clearly, even though U8 is somewhat mobile, it is much more restricted than the bases of the UCC sequence (average S2 = 0.36 ± 0.07), consistent with its retention of a single conformation during the trajectory (Fig. 2 a).
Eigenmodes of the RED covariance matrix
The information encoded in the RED covariance matrix is much richer than the order parameters, in that it also describes the correlations between the reorientations of each site (18
). In the case of the RBD1:SL2 complex, this offers the potential to reveal networks of correlated motions that span the protein/RNA interface, in addition to enumerating motion localized to the protein or RNA alone. These correlations are most readily visualized through calculation of the reorientational eigenvectors (|k>) and associated eigenvalues (
k) of the covariance matrix by solving the eigenvalue problem M|k > =
k|k> (k = 1, ..., n; n = 136 for the RBD1:SL2 complex).
Collectivity of the eigenmodes
The eigenvalue (
k) associated with each eigenvector |k> of the RED covariance matrix M represents the amplitude of motion along the reorientational mode (18
). The collectivity (
, Materials and Methods, Eq. 4) of a given eigenvector reflects the percentage of spin interactions significantly reorienting under the influence of the eigenmode (32
), and so indicates the degree of correlation between all sites in the molecule imposed by the mode. A plot of
versus
k (Fig. 4) calculated from the trajectory shows a distribution similar in shape to those previously reported for Ubiquitin (18
) and unbound RBD1 (17
) where a few modes have high amplitude (
k > 0.1), but are local motions (
< 0.1), whereas the majority of modes are lower amplitude but more collective.
|
versus
k distribution displayed with black diamonds in Fig. 5. Compared with the distribution from RED analysis of unbound RBD1 (Fig. 5, red circles; (17
max = 0.45) is higher than the maximum in the absence of RNA (
max = 0.37), indicating that at least some of the protein modes are more collective in the complex.
|
The distribution of calculated lifetimes (
k) ordered by decreasing eigenvalue of the corresponding eigenvector is shown in Fig. 6. Given that snapshots were saved every 0.5 ps for 4 ns,
k values ranging from
5 to 400 ps are expected to be meaningful. This range is comparable to the timescale of internal motions most reliably reported by model-free relaxation parameters, and many of the 20 eigenvectors with the largest associated eigenvalues have
k falling in this range. However, over the whole distribution, surprisingly few modes displayed lifetimes significantly above zero; none of the eigenvectors beyond the first 20 had associated correlation functions with initial values significantly above their plateau values, indicating that they decay very rapidly compared to the timescale of motion reliably reported (
k >> 0.5 ps). These trends are consistent with unbound RBD1, although the unbound molecule did display slightly longer
k, on average, for those modes with significantly nonzero lifetimes (17
).
|
for each spin pair. In this representation, the coefficient at site j is proportional to the eigenvector's contribution to the order parameter of site
Fig. 7 contains representations of e1, e16, and e54 in this form.
|
The eigenvector with the largest
(Materials and Methods, Eq. 4) represents the most highly collective motion sampled by the trajectory. This eigenvector (e54,
= 0.45) is displayed in Fig. 7 c and clearly contains correlations spanning the protein/RNA interface. Although each of the most mobile regions in RBD1 is reoriented by this mode, the UCC sequence in the SL2 loop is conspicuously absent. This pattern of correlations spanning the protein/RNA interface, but excluding the UCC sequence, is found in the majority of the high collectivity eigenvectors.
The 136 eigenvectors that describe the networks of correlated motions contain many examples of how these correlations are distributed in the complex. Some are restricted to the most mobile regions (UCC in the RNA loop, loop 5, or the N-terminal tail of RBD1) and so have high amplitude eigenvalues (
k) but low collectivity (
), since they consist of contributions from only a few sites. Others contain contributions from virtually the entire complex, such as e54 with its high collectivity and low amplitude. A discussion of each eigenvector is not practical or informative, but a more detailed examination of several does illustrate some important features of the complex.
Four eigenvectors were selected for more discussion on the basis of their contributions from the RNA. The four are given in Fig. 8, where the predominant spin interactions are identified. The sites that contribute to each are mapped onto the structure of the cocrystal (7
) to provide a different view of the patterns (Fig. 9). In those structures, only a backbone ribbon represents the protein backbone amides, whereas those nucleotides that contribute to the eigenmode are explicitly depicted.
|
|
k) of its eigenvector is longer than that of most eigenvectors (Fig. 6), but its correlation function decays to a stable plateau, indicating convergence over the course of the 5 ns simulation (data not shown). Parameters of e10 show that its amplitude (
k = 0.145) is still appreciable; its
= 0.239.
Eigenvector e29 includes contributions from the body of RBD1, including the RNA/protein interface (ß1, ß3, and loop3) and residues Lys88 and Thr89 from the C-terminal tail. The RNA stem is also included in this eigenvector, but the most prominent contributions come from A2 and U8 (Fig. 8 b). In the cocrystal, the side chain of Lys20 could interact with the phosphate backbone of the stem, close to A2, and perhaps explains the presence of the amide of Lys20 in this eigenmode (Fig. 9 b). The U8 base has been previously noted in simulations to show unusual mobility in the complex, and here it also is the only base among those at the interface that contributes to this eigenvector. The 5 ps correlation time for e29 indicates that these motions are rapid, and its correlation function is converged. The amplitude of e29 (
k = 0.079) is low, but its collectivity,
= 0.38, shows that there are many contributions to this eigenmode.
The predominant motions in e31 come from uniformly small amplitude contributions across the RNA/protein interface. These include loop nucleotides A6U7...C10A11C12 (U8 and G9 do not appear). From the protein, amides of Tyr13, Phe56, Lys88, and Thr89 form the major part of the binding surface; together these two areas of the molecules illustrate how the complex is now linked through correlated motions (Fig. 9 c). The relatively large contributions from Arg7 could reflect the network of interactions that link the N- and C-termini of the protein; the contribution from Ser71 is not easily explained. Eigenvector e31 has a lifetime of 5 ps, an amplitude of 0.070, and a collectivity of 0.289.
Finally, e40 is predominantly a protein eigenmode that excludes the binding surface. RNA contributions are relatively minor and come from A1 and the ribose of C10 (Figs. 8 d and 9 d). Their presence in e40 illustrates how selective coupling occurs in the complex and how unpredictable it appears. This eigenmode, with a lifetime of 5 ps, is converged, indicating its modes should be adequately sampled in the simulation. Its amplitude is 0.061, while its collectivity is 0.278.
There are many other examples of the intricate connections between RNA and protein in the other eigenvectors. Some correlated motions can be readily explained by proximity or covalent connectivity, whereas others do not appear to have a simple physical rationale. An obviously missing component in the analysis is any contribution of water molecules that could serve as connectors between sites. Future directions of this work include investigation of the role of solvent in these correlations.
Correlations on the RBD1-binding face
Many eigenvectors of the RED covariance matrix calculated for unbound RBD1 displayed large amplitude correlated motions coupling loop 1, loop 3, and the TDS sequence, three important elements of the RNA-binding surface (17
). For the RBD1:SL2 complex, these correlations are not as easy to see as the amplitude of motion experienced by these regions has been significantly reduced. It is predicted that what little motion these sites experience should still display a high degree of correlation, however, and given that they all interact with SL2, the RNA too should contribute to the eigenvectors that reorient them. When many sites contribute to an eigenvector (high
; for example e54, Fig. 7), it can be difficult to observe patterns in the correlation between sites of interest. In this case, it is convenient to consider the vector |ksub> spanning the subspace in the orthonormalized eigenvector |k> that represents a particular region of the molecule, such as a loop or a secondary structure element. The projection of |k> onto this subspace (||ksub||2) for each |k> can help reveal which eigenvectors reorient the region of interest.
For example, projecting each eigenvector onto loop 1 reveals the modes that contribute to its mobility; comparison with the same projection calculated for loop 3 shows which modes affect both sites simultaneously (Fig. 10 a). In this plot, the projection of each normalized eigenvector onto the loop 1 subspace (black) and the loop 3 subspace (blue) is displayed as a bar, with height proportional to the magnitude of the projection. Because the eigenvectors |k> are normalized, this projection can never exceed 1; large ||ksub||2 indicates that the eigenvector is dominated by motions of the subspace of interest. As can be seen in Fig. 10 a, most modes which reorient loop 1 also reorient loop 3, but much of the motion experienced by loop 3 is independent of loop 1. Because the eigenvectors are ordered by descending eigenvalue (motional amplitude), the tendency of loop 3 modes to appear to the right of the plot confirms that loop 3 is less mobile than loop 1. This is consistent with the intrusion of loop 3 into the RNA loop, whereas loop 1 contacts the stem through mostly side chain interactions, leaving the backbone in this region less conformationally constrained.
|
Correlations in the SL2 loop
Mutational analysis of SL2 has shown that several sites in the loop are strongly coupled (9
,15
). For example, mutation of G9 (the beginning of the highly conserved GCAC sequence) results in an
104-fold loss of binding affinity. This number is far more than should be expected from loss of hydrogen bonding interactions from a single base, leading to the hypothesis that mutation at this site (and others that displayed similar trends) not only affects base readout locally, but also at distal sites, due to a reorganization of the RNA/protein interface.
The first seven bases of the loop sequence are all recognized by RBD1, with the conserved GCAC sequence displaying higher sequence specificity than the 5' AUU (15
). Fig. 11 a displays the projection of the normalized eigenvectors onto the AUU and GCAC sequences. The few modes that project strongly onto either of these sequences tend to have significant projections onto both. This suggests that the intramolecular motions of the entire seven nucleotide sequence are correlated. These correlations appear to extend through the C-G loop closing basepair, the only other site in SL2 that is recognized with sequence specificity (not shown). Although the mode amplitude at any single site in the first seven nucleotides of the loop is generally small (too small for clear graphical visualization), correlation between them is almost always seen in the eigenmodes which reorient the loop. This is consistent with the larger than expected changes in binding-free energy upon single site mutation, although the disproportionate selectivity in the GCAC sequence must come from some other source.
|
| DISCUSSION |
|---|
|
|
|---|
Dynamic properties of the complex
As the cocrystal showed, there are two distinct regions of the RNA loop as it is held in the complex: the C:G loop closing basepair and the AUUGCAC sequence form a tight interface with the protein, whereas the UCC sequence is solvent exposed (7). The loop closing C-G basepair of the RNA, as well as the first seven nucleotides of the loop (Fig. 1 b), are recognized with varying levels of sequence specificity, with the conserved GCAC sequence playing an especially important role (15
). RBD1 does not read the base composition of the final three loop positions; these nucleotides appear to serve only as spacers that maintain the total loop length necessary for proper recognition of the first seven bases (34
). The RNA-binding surface of RBD1 includes ß1 and ß3, which are covered by RNA, loop 1, which makes electrostatic interactions with the stem of the RNA, and loop 3, which moves into the RNA stem/loop junction to contact specific bases. The structure of the complex leads to the prediction that the interface will be highly structured, but the extent of its dynamic motion cannot be anticipated.
Structural rearrangements of RBD1 are confined to its loops and the T89D90S91 sequence (noting that the C-terminal tail must extend away from the body of the protein to accommodate the RNA). Three loops of the domain (loops 1, 3, and 5) were shown by 15N NMR relaxation measurements to have lower order parameters than the body of the protein (14
,16
,35
). Those properties were correctly described by RED analysis of trajectories of unbound RBD1 (17
), leading to the expectation that RED analysis of the complex will be similarly accurate and thus allow an evaluation of how the dynamics of the protein have been altered by complex formation.
The structures of the RNA alone in solution and bound to protein are very different. The most drastic conformational change that accompanies formation of the complex is that of SL2 RNA. In solution, the loop structure changes from stacked, A-form-like structure to a flexible and floppy structure as a function of temperature (NMR data, not shown). In the complex, the bases of SL2 are splayed outward to make sequence-specific contacts, forming a constrained structure on the surface of the protein. Unfortunately, there are no experimental data to describe the timescales of motion in the free RNA, the bound RNA, or of the conformational change.
Some motions that involve both RNA and protein will be rapid (ps-ns) and thus captured by the 5 ns molecular dynamics simulations. These motions would also be reported by order parameters obtained from NMR relaxation experiments, analyzed by the Lipari-Szabo formalism (30
,31
). In the unfortunate absence of experimental NMR data that describe the dynamics of this complex, we rely here on RED analysis and its predictions.
Given the highly flexible nature of unbound SL2 and of the RBD1 loops in the binding surface, it is surprising just how inflexible the amino acids and nucleotides forming the protein/RNA interface are in this complex. The order parameters calculated from RED analysis of the MD trajectory predict that the backbone of loop 1, loop 3, and the TDS linker are just as rigid as the backbone of the various secondary structure elements. In contrast, loop 5 retains the low order parameters indicative of conformational freedom that it displays in the absence of RNA. Structurally, loop 5 packs against loop 1, which in turn packs against loop 3 in unbound RBD1; and yet, RED analysis of the unbound domain did not reveal strong correlation between loop 5 and either loop 1 or loop 3 (17
). These results, combined with this analysis of the RBD1:SL2 complex, suggest that loop 5 is mostly independent of loop 1 and loop 3, despite their spatial proximity. It is intriguing that the correlation network that is hypothesized to so strongly couple the RNA-binding interface does not extend to the adjacent loop that does not contact RNA.
The UCC sequence is independent
Although the first seven nucleotides of the loop display restricted motion, order parameters for the UCC sequence are very low, indicating that both bases and riboses are highly flexible. The effect on binding affinity of making single RNA point mutations in the AUUGCAC sequence is nonadditive, meaning that the sum of the changes in binding affinity from each mutation is drastically higher than the overall affinity (9
,15
). This implies a highly cooperative structure for the bases of the loop, with mutation at one site preventing proper sequence-specific recognition of some or all of the others. This effect did not extend into the UCC sequence that makes no contact with RBD1 in the cocrystal structure (7
) (Fig. 1 b) and can be replaced by polyethylene glycol without affecting the binding affinity (34
). This suggests that at least some of the observed coupling between the first seven bases of the loop originates from the bound form of the RNA, even though they are primarily displayed on the protein surface with an outward orientation such that they do not stack with each other. The loop closing C:G basepair is also involved in the cooperative structure element defined by the first seven loop nucleotides in the complex, meaning that the UCC sequence truly is isolated from its covalent neighbors.
The eigenvectors of the RED covariance matrix confirm the virtual excision of the UCC sequence from the rest of the loop structure, as the majority of the eigenvectors that reorient it do not reorient any other loop nucleotides, or the C:G close, to any appreciable extent. The majority of the eigenvectors that reorient the UCC sequence have appearances similar to e1 (Fig. 7 a), with the relative weight of the fraying terminal basepair and the individual elements of the UCC sequence varying between the vectors. Lastly, the UCC sequence is conspicuously absent from most of the interface spanning eigenmodes (such as e54, Fig. 7 c), which confirms that it does not interact with any of the portions of RBD1 that define the binding interface. It is striking that the UCC sequence can serve as such an independent element without disrupting the sequence-specific interactions to either its 5' or 3' side.
Tight binding interface
The RNA-binding surface of RBD1 is large, spanning an entire face of the folded domain. MM/PBSA studies of the interaction between RBD1 and SL2 predict that the free energy of ordering the RBD1-binding surface is large (on the order of 10 kcal/mol; (22
)). If the distal elements of this binding surface were not in some way preorganized, the free energy required to order the surface upon binding would be extremely costly. For example, two mutants of G53 (RBD1-G53A and RBD1-G53V) have been shown by RED analysis to decouple distal elements of the binding surface (17
). RBD1-G53A binds with an affinity 2.9 kcal/mol less favorable than wild type, whereas RBD1-G53V loses 5.7 kcal/mol of binding free energy, more than one third of the total wild-type binding free energy (
Go = 13.9 kcal/mol for the conditions used in the comparison; (16
)). The RED results for unbound and bound RBD1 suggest that, although the unbound molecule is flexible, a significant amount of the ordering necessary for RNA binding is already "prepaid" in the pattern and extent of motional correlations present in the unbound domain.
Loop 3 is flexible on the ps-ns timescale in unbound RBD1, but in the context of the G53 mutants a significant portion of this mobility is lost. This suggests that the G53 mutants decouple the binding surface by locking loop 3 down, thus preventing the correlated motions required for coupling. In contrast, NMR spin relaxation studies of constructs with mutations at conserved sites on the ß-sheet surface (Y13, Q54, F56) show that these mutations disrupt thermodynamic coupling while increasing the ps-ns mobility of loop 3 (14
). We predict that RED analysis of trajectories bearing one or more of these mutations would also display decoupled reorientational motion. Combination of the loop 3 and ß-sheet mutagenesis results suggests that the extent of flexibility in loop 3 has been heavily optimized by evolution and that any change toward greater or lesser flexibility is disruptive.
Unlike the unbound state of RBD1, the bound form of the domain is almost uniformly inflexible. It is notable that, although the interface itself is quite rigid, the majority of the eigenmodes of the RED matrix span the interface, correlating the intramolecular reorientations of the two molecules in the complex. It has previously been observed that large amplitude reorientational modes tend to be more local in nature than the largest amplitude modes from standard quasiharmonic analysis (36
), making it unsurprising that the largest amplitude RED modes are dominated by local motion (of the UCC element in e1, for example). Interface spanning modes are seen in normal mode analysis of protein dimers (such as insulin; (37
)) and are predicted to contribute to dimer stability. Modes which correlate RBD1 and SL2 sites begin to dominate the eigenvectors of the RED matrix around e10, which, with
k = 0.145 and
k = 93 ps, is still in the range of eigenmodes with significant impact on the observed dynamics, and likely the stability, of the complex. The RED analysis produces results akin to those for tight protein-protein interactions, suggesting that this behavior may be a more general feature of macromolecular interactions regardless of the molecules involved (protein, nucleic acid, or both). In other words, neither RBD1 nor SL2 behaves like a ligand for the other, but rather the complex should be thought of as a macromolecular assembly.
| CONCLUSIONS |
|---|
|
|
|---|
| ACKNOWLEDGEMENTS |
|---|
|
|
|---|
Submitted on December 17, 2004; accepted for publication May 26, 2005.
| REFERENCES |
|---|
|
|
|---|
2. Burd, C. G., and G. Dreyfuss. 1994. Conserved structures and diversity of functions of RNA-binding proteins. Science. 265:615621.
3. Scherly, D., W. Boelens, W. J. van Venrooij, N. A. Dathan, J. Hamm, and I. W. Mattaj. 1989. Identification of the RNA binding segment of human U1A protein and definition of its binding site on U1 snRNA. EMBO J. 8:41634170.[Medline]
4. Hall, K. B., and W. T. Stump. 1992. Interaction of N-terminal domain of U1A protein with an RNA stem/loop. Nucleic Acids Res. 20:42834290.
5. Nagai, K., C. Oubridge, T. H. Jessen, J. Li, and P. R. Evans. 1990. Crystal structure of the RNA-binding domain of the U1 small nuclear ribonucleoprotein A. Nature. 348:515520.[CrossRef][Medline]
6. Avis, J. M., F. H. T. Allain, P. W. A. Howe, G. Varani, K. Nagai, and D. Neuhaus. 1996. Solution structure of the N-terminal RNP domain of U1A protein: the role of C-terminal residues in structure stability and RNA binding. J. Mol. Biol. 257:398411.[CrossRef][Medline]
7. Oubridge, C., N. Ito, P. R. Evans, C.-H. Teo, and K. Nagai. 1994. Crystal structure at 1.92 Å resolution of the RNA-binding domain of the U1A spliceosomal protein complexed with an RNA hairpin. Nature. 372:432438.[CrossRef][Medline]
8. Williamson, J. R. 2000. Induced fit in RNA-protein recognition. Nat. Struct. Biol. 7:834837.[CrossRef][Medline]
9. Hall, K. B. 1994. Interaction of RNA hairpins with the human U1A N-terminal RNA binding domain. Biochemistry. 33:1007610088.[CrossRef][Medline]
10. Jessen, T. H., C. Oubridge, C. H. Teo, C. Pritchard, and K. Nagai. 1991. Identification of molecular contacts between the U1A small nuclear ribonucleoprotein and U1 RNA. EMBO J. 10:34473456.[Medline]
11. Kranz, J. K., J. Lu, and K. B. Hall. 1996. Contribution of the tyrosines to the structure and function of the human U1A N-terminal RNA binding domain. Protein Sci. 5:15671583.[Abstract]
12. Zeng, Q., and K. B. Hall. 1997. Contribution of the C-terminal tail of U1A RBD1 to RNA recognition and protein stability. RNA. 3:303314.[Abstract]
13. Kranz, J. K., and K. B. Hall. 1998. RNA binding mediates the local cooperativity between the ß-sheet and the C-terminal tail of the human U1A RBD1 protein. J. Mol. Biol. 275:465481.[CrossRef][Medline]
14. Kranz, J. K., and K. B. Hall. 1999. RNA recognition by the human U1A protein is mediated by a network of local cooperative interactions that create the optimal binding surface. J. Mol. Biol. 285:215231.[CrossRef][Medline]
15. Stump, W. T., and K. B. Hall. 1995. Crosslinking of an iodo-uridine-RNA hairpin to a single site on the human U1A N-terminal RNA binding domain. RNA. 1:5563.[Abstract]
16. Showalter, S. A., and K. B. Hall. 2004. Altering the RNA-binding mode of the U1A RBD1 protein. J. Mol. Biol. 335:465480.[CrossRef][Medline]
17. Showalter, S. A., and K. B. Hall. 2002. A functional role for correlated motion in the N-terminal RNA-binding domain of human U1A protein. J. Mol. Biol. 322:533542.[CrossRef][Medline]
18. Prompers, J. J., and R. Brüschweiler. 2001. Reorientational eigenmode dynamics: a combined MD/NMR relaxation analysis method for flexible parts in globular proteins. J. Am. Chem. Soc. 123:73057313.[CrossRef][Medline]
19. Hermann, T., and E. Westhof. 1999. Simulations of the dynamics at an RNA-protein interface. Nat. Struct. Biol. 6:540544.[CrossRef][Medline]
20. Reyes, C. M., and P. A. Kollman. 1999. Molecular dynamics studies of U1A-RNA complexes. RNA. 5:235244.[Abstract]
21. Tang, Y., and L. Nilsson. 1999. Molecular dynamics simulations of the complex between human U1A protein and hairpin II of U1 small nuclear RNA and of free RNA in solution. Biophys. J. 77:12841305.
22. Reyes, C. M., and P. A. Kollman. 2000. Structure and thermodynamics of RNA-protein binding: using molecular dynamics and free energy analyses to calculate the free energies of binding and conformational change. J. Mol. Biol. 297:11451158.[CrossRef][Medline]
23. Pitici, F., D. L. Beveridge, and A. M. Baranger. 2002. Molecular dynamics simulation studies of induced fit and conformational capture in U1A-RNA binding: do molecular substrates code for specificity? Biopolymers. 65:424435.[CrossRef][Medline]
24. Case, D. A., D. A. Pearlman, J. W. Caldwell, T. E. Cheatham III, W. S. Ross, C. Simmerling, T. Darden, K. M. Merz, R. V. Stanton, A. Cheng, J. J. Vincent, M. Crowley, V. Tsui, R. Radmer, Y. Duan, J. Pitera, I. Massova, G. L. Seibel, U. C. Singh, P. Weiner, and P. A. Kollman. 1999. AMBER 6. University of California, San Francisco.
25. Cornell, W. D., P. Cieplak, C. I. Bayly, I. R. Gould, K. M. Merz, D. M. Ferguson, D. C. Spellmeyer, T. Fox, J. W. Caldwell, and P. A. Kollman. 1995. A second generatio