| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
pa
ková 
tefl *
a *
í
poner 
* National Center for Biomolecular Research, Kotlá
ská 2, 611 37 Brno, Czech Republic;
Institute of Biophysics, Academy of Sciences of the Czech Republic and National Center for Biomolecular Research, Královopolská 135, 612 65, Brno, Czech Republic; and
Chemistry Department and Center for Biomolecular Sciences, Bowling Green State University, Bowling Green, OH 43403 USA
Correspondence: Address reprint requests to Ji
í
poner, Institute of Biophysics, Academy of Sciences of the Czech Republic and National Center for Biomolecular Research, Královopolská 135, 612 65, Brno, Czech Republic. E-mail: sponer{at}ibp.cz or Neocles B. Leontis, Chemistry Department, Bowling Green State University, Bowling Green, OH 43403. E-mail: leontis{at}bgnet.bgsu.edu.
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
|
RNA-RNA and RNA-protein interactions are largely mediated by RNA motifs, defined as ordered, stacked arrays of non-Watson-Crick basepairs (Leontis and Westhof, 1998b
, 2001
). To properly understand the role of non-Watson Crick regions of RNA, it is crucial to study the relation between the base sequence and the RNA structure and the way it is mediated by non-Watson Crick interactions.
Besides the experimental approaches, molecular interactions in RNA molecules can be studied by a variety of advanced molecular modeling tools (Auffinger and Westhof, 1998
, 2001
; Beveridge and McConnell, 2000
; Brandl et al., 2000
; Cheatham and Kollman, 2000
; Cheatham and Young, 2000
; Chin et al., 1999
; Guo et al., 2000
; Hermann et al., 1997
; Hermann and Westhof, 1998
; Lahiri and Nilsson, 2000
; Nagan et al., 1999
, 2000
; Sarzynska et al., 2000
; Schneider and Suhnel, 2000
; Schneider et al., 2001
; Sponer et al., 2001
; Williams and Hall, 1999
; Zacharias, 2000
). This reflects the enormous progress of computer hardware and software in the last decade, during which the actual performance of computers increased by at least three orders of magnitude. Modern computational techniques for simulating macromolecular dynamics complement the information provided by atomic-resolution experiments and bring valuable insights into the relation between molecular structures and energies. Molecular dynamics (MD) simulations with improved treatment of long-range electrostatic forces were introduced several years ago (Cheatham and Kollman, 2000
; Cheatham and Young, 2000
; Essmann et al., 1995
; McConnell and Beveridge, 2000
) and provide more realistic, dynamic views of RNA molecules than previously possible. In this paper we present extended MD investigations carried out with explicit solvent molecules and counterions. Contemporary simulations are still affected by numerous approximations inherent in the empirical forcefields in practical use and the limited nanosecond timescale of the simulations (Cheatham and Kollman, 2000
; Cheatham and Young, 2000
; Sponer et al., 2000b
, 2001
). Nevertheless, modern computer simulations are capable of bringing unique insights into the structure and dynamics of RNA molecules that cannot be obtained by other techniques. Because the presently available molecular modeling forcefields provide a well-balanced description of stacking and H-bonding of nucleic acid bases (Hobza et al., 1997
; Sponer et al., 2001
), MD simulations represent a promising tool to reveal the behavior (stability, dynamics, solvent and ion-binding properties) of non-Watson-Crick interactions in RNA molecules (Csaszar et al., 2001
; Spackova et al., 2000
). The Loop E is a salient example of a uniquely structured non-Watson-Crick motif, as it contains seven consecutive non-Watson-Crick basepairs. Thus, the structure, dynamics, hydration, and cation binding of non-Watson-Crick basepairs comprise the main focus of this investigation. Further, we utilize MD simulations to refine the structure of the spinach chloroplast Loop E, for which there is no atomic resolution structure yet available and which is sufficiently different from bacterial Loop E motifs in sequence but evolutionarily related to it. The initial model for the simulation of the Loop E of chloroplast RNA was proposed based on homology modeling (Leontis and Westhof, 1998a
; 1998b
). Thus, another aim of this study is to test the usefulness of the MD technique in evaluating the dynamics and energetics of molecular models of RNA motifs constructed by phylogenetic analysis and isostericity principles.
| METHODS |
|---|
|
|
|---|
To enhance sampling, some simulations were carried out at an elevated temperature (400 K). The system was gradually heated from 300 K to 400 K during the first 100 ps using NPT conditions (constant pressure ensemble). The production run was then continued at 400 K using either NPT or NVT (constant volume ensemble). There are no clear guidelines whether NVT or NPT simulations should be preferred, and, in fact, both approaches have drawbacks (Zhou et al., 2001
). The pair additive forcefield is not parameterized for elevated temperatures and therefore leads, in the case of the NPT simulation, to reduced water density, increased diffusion, and then regions with low solvent density (bubbles) that might disrupt the structure. The NVT simulations imply high pressure that in certain cases may stabilize the structure (Zhou et al., 2001
). The outcome of the elevated temperature simulations is also limited by the fact that at elevated temperature the system is not represented by a Boltzmann distribution equivalent to prolongation of a room temperature simulation, i.e., the sampling is not equivalent. Nonetheless, elevated temperature simulations provide insights into labile parts of the simulated structures.
As another method to test the stability of the system, we have carried out "no-salt" simulations, in which cations were omitted from the simulation box. The missing counterions were substituted by a net-neutralizing plasma representing a uniform neutralizing charge distribution over the box. This feature is implemented in the AMBER program package for its use with the PME method and guarantees the neutrality of the system (Darden et al., 1998
).
Trajectories were analyzed using the CARNAL module of AMBER, and structures were visualized using VMD (molecular visualization program, http://www.ks.uiuc.edu/Research/vmd/) (Humphrey et al., 1996
). Figures were prepared using VMD, the Chem3D and ChemDraw Pro programs (CambridgeSoft, Cambridge, MA). Hydration and distribution of ions were calculated with the ptraj module of AMBER-6.0 and graphics using UCSF MidasPlus (University of California, San Francisco, CA) (Ferrin et al., 1988
). Systematic monitoring of the solute-to-water distances was carried out using the CARNAL module of AMBER. All direct solute-solvent contacts were detected for the whole trajectory and then analyzed in detail. All cation-binding sites with occupancies higher then 5% were analyzed in detail. Molecular electrostatic potentials were calculated using the program DELPHI (Gilson et al., 1998
), solving the nonlinear Poisson-Boltzmann equation with implicitly represented solvent and ionic strength. Visualization of the potential maps was carried out using the program Insight II (Biosym/MSI, San Diego, CA).
| RESULTS |
|---|
|
|
|---|
|
The crystal structure reveals ordered water molecules in the A-stack regions and a kink of the backbone at the adenine residue of the trans Watson-Crick/Hoogsteen U/A basepairs of each submotif. There are five hexacoordinated Mg2+ cations bound to the major (deep) groove of the Loop E crystal structure, interacting with bases and anionic phosphate oxygens. Three of these cations directly interact with the RNA atoms (inner-shell binding), and the remaining two interact with the solute molecule via water molecules (outer-shell binding). Two of the Mg2+ ions in the crystal are closely spaced (2.7 Å) and may represent a single cation with two alternating binding sites (Hermann and Patel, 1999
). Nevertheless, when carrying out simulations, we initially included all ions according to the crystal data, with the expectation that the simulation would redistribute them and release excessive ions (Hermann et al., 1998
). We initially carried out three simulations of this molecule (Table 1): Simulation EbacMg (9.7 ns, all five "crystal" Mg2+ cations present together with 10 additional Na+ to achieve charge neutralization), simulation EbacNa1 (8.8 ns), and simulation EbacNa2 (8.8 ns). In simulations EbacNa1 and EbacNa2, the Mg2+ cations were removed and the molecules were neutralized solely by Na+ ions. For the EbacNa1 and EbacMg simulations the xleap module of AMBER was used to place the Na+ cations around the solute molecule at positions of lowest electrostatic potential. For simulation EbacNa2, the Na+ cations were initially shifted deep into the solvent, at least 5 Å away any solute atom, to produce an entirely different initial Na+ distribution compared with EbacNa1.
Spinach chloroplast Loop E
The other molecule studied, Echl, differs from Ebac at three basepairs (Fig. 1, right panel). The sequence is that of the Loop E region of spinach chloroplast 5S rRNA. It contains five consecutive adenines in one of its strands and thus appears very different, at least superficially, from the Loop E of evolutionarily related bacteria. No atomic resolution experimental structure is available for this molecule. However, on the basis of careful considerations of isostericity, we proposed that the structure of Loop E of spinach chloroplast should be similar to that of bacterial Loop E. Thus, the corresponding bases should form the same type of non-Watson-Crick basepairs. Therefore we paired C74 and A102 in the chloroplast sequence in a manner that is isosteric to bifurcated U74/G102 in the crystal structure (URL064) and likewise A76 with A100 in the same bifurcated geometry observed for G76/G100 in the crystal structure (Leontis and Westhof, 1998a
). A78 and A98 were paired to be isosteric to trans Hoogsteen/Sugar-edge A78/G98 (Figs. 1 and 2). trans Hoogsteen/Sugar-edge A/A isosteric to A/G is known from several crystal structures (Ban et al., 2000
; Carter et al., 2000
; Cate et al., 1996
; Harms et al., 2001
; Ramakrishnan and Moore, 2000
; Schluenzen et al., 2000
; Wimberly et al., 2000
). This procedure defined the initial geometry of the altered basepairs of chloroplast Loop E, which are shown next to the corresponding E. coli basepairs in Fig. 2. Two simulations, designated EchlMg (10 ns) and EchlNa (7.4 ns), were carried out. In the first simulation, five Mg2+ cations were included and in the second only neutralizing Na+ ions were utilized, initially positioned at the most negative sites around the molecule. For chloroplast Loop E simulations, the starting positions of Mg2+ were taken from the bacterial Loop E crystal structure.
|
5 ns, as will be described in detail below.
|
4 Å (groove width is defined using four P-P distances; see Table 2).
|
4.5 ns, the deep groove, measured from P75 to P98, widens very substantially, after a considerable Mg2+ relocation event. The P75P98 distance temporarily increased to 16 Å, and this widening was only partly repaired by the end of the simulation. We will comment on this development in detail below. During simulation EchlNa, carried out in the absence of Mg2+ cations, the deep groove narrowed significantly relative to the crystal structure (Table 2), in contrast to developments in the two EbacNa simulations. Moreover, the groove width developed an asymmetry that is related to the H-bonding of some of the mutated basepairs (see below). This narrowing is accompanied by the tight binding of three adjacent Na+ cations in the deep groove (see below).
To provide additional structural information, the Supplementary Material section of this article presents PDB files of averaged simulated structures for key simulations and selected helical parameters (Fig. S4) calculated via the 3DNA code (Lu et al., 2000
). Definition of structural parameters for non-Watson-Crick segments is, however, less straightforward compared with regular Watson-Crick segments, and there is no consensus yet on how to treat the parameters of steps with noncanonical pairs consistently. Nevertheless, the structural data clearly illustrate substantial variability of the helical twist along the Loop E, especially in the A-stack regions, and a clear structural similarity between the bacterial and chloroplast Loop E molecules.
To highlight the dynamics of the molecules, we carried out a principal component analysis (PCA) of the simulated trajectories. PCA identifies the main collective deformation modes of the simulated molecules (Sherer et al., 1999
; Wlodek et al., 1997
).
The leading PCA modes are depicted and commented on in the Supplementary Material. PCA analysis highlights substantial oscillations of the interphosphate distances across the deep groove as described in the main text. In addition, PCA indicates that these deep groove width changes are correlated with bending motions between the central non-Watson-Crick and flanking canonical segments (Figs. S1S3). The bending is primarily localized to the outermost A104/G72 and G98/A78 sheared basepairs of the non-Watson-Crick segment, i.e.; the A-stacks. This motion in fact resembles the dynamics of DNA zipper motifs (Spackova et al., 2000
) where dynamical bending occurs at the junction between the central zipper and flanking Watson-Crick segments and is localized exclusively to the sheared G/A basepairs. The sheared G/A basepair is intrinsically very flexible (Spackova et al., 2000
; Sponer et al., 1996
), and the high degree of its stacking with adjacent basepairs (Chou and Chin, 2001
) likely helps to mediate the intersegment bending motion. Nevertheless, the eigenvalues of the leading PCA components of the present Loop E simulations are quite small (around 0.2) indicating that there is no single dominant mode of the motion. In fact, when visually inspecting the trajectories (see Supplementary Material), we have clearly seen the groove width oscillations. However, the coupled bending motions suggested by the filtered PCA trajectories were almost unnoticeable in real trajectories.
Basepairing, water bridges, and ordered hydration in bacterial Loop E
The standard basepairs were remarkably stable in all simulations, except that the terminal basepairs showed somewhat increased fluctuations, as is usual in simulations. Thus standard basepairs are not discussed in the following.
The Ebac crystal structure contains the following seven non-Watson-crick basepairs (Fig. 1), forming two submotifs (see above): A104/G72 and A78/G98 form trans Hoogsteen/Sugar-edge (sheared) basepairs with A(N7)-G(N2) and A(N6)-G(N3) H-bonds. U103/A73 and U77/A99 form trans Watson-Crick/Hoogsteen basepairs with U(O2)-A(N6) and U(N3)-A(N7) H-bonds. G102/U74 differs from the normal wobble arrangement by forming a cis bifurcated interaction that is isosteric to G76/G100. U74(O4) H-bonds with N1 and N2 of G102, and in the crystal structure, water molecules appear to bridge between G(O6) and U(C5) and between G(N2) and U(N3). In G76/G100, G100 is equivalent to U74. G100(O6) forms bifurcated H-bonds to G76(N1) and G76(N2) and water molecules bridge between G76(O6) and G100(N7). Another water bridge occurs between G100(N2) and G100(N1) in the crystal structure. The G75/A101 basepair is a water inserted cis Watson-Crick pair involving a single base-to-base H-bond, A(N6)-G(O6), and a water bridging between A(N1) and G(N1) (Fig. 2).
In the 9.7-ns magnesium-containing simulation, EbacMg, the non-Watson-Crick basepairs were remarkably stable. In agreement with the crystal data, water bridges in the G75/A101 and G76/G100 basepairs (Fig. 4, a and b) are stable during the simulation. These hydration sites show close to 100% occupancies and prolonged residency times of individual water molecules ranging between 0.5 and 0.8 ns compared with residency times of 0.05 to 0.4 ns for ordinary hydration sites (Auffinger and Westhof, 2001
; Csaszar et al., 2001
; Feig and Pettitt, 1998
, 1999
; Nagan et al., 1999
; Schneider et al., 2001
; Spackova et al., 2000
).
|
0.05 ns. In fact, the water molecule seen in the crystal cannot be oriented correctly to H-bond simultaneously with either of the hydrogen atoms of the G102(N2) amino group and U74(N1). However, a complex five-center hydration pocket centered around G102(O6) is identified in the deep (major) groove. We found a number of long-residing water molecules in this region of the RNA with residency times up to 4 ns (Fig. 4 c). Typically, two long-residing water molecules are bound to the primary G102(O6) hydration position simultaneously. The hydration of G102(O6) was sometimes interrupted for periods of approximately 0.050.1 ns, during which individual water molecules moved as far as 6 Å away from G102(O6) before returning again. While a water molecule remained bound to G102(O6), it typically alternated among four secondary binding sites, namely A73(O5'), A73(O2P), U103(O4), and U74(C5), with movements occurring on a timescale of 0.5 to 0.8 ns. Additional water molecules were often immobilized in the area up to 3 to 6 Å away from G102(O6), as shown in Fig. 4 c. Electron density corresponding to this hydration site is visible in the crystal, but it was not assumed to be of primary importance (Correll et al., 1997Major hydration sites were seen in the upper A-stack (Fig. 5 a). The first one, as in the crystal structure, is between G72(N2) and A104(O2P) with residency times of 0.50.8 ns. At the other edge of G72, atoms G72(O6), G102(O2P), and U103(O2P) form a three-center hydration pocket with occupancy close to 100% and residency times up to 2 ns. We did not identify a primary binding site in this case and, in fact, two long residing water molecules typically occupy the pocket simultaneously. For example, we observed a single water molecule bound for 2 ns between G72(O6) and G102(O2P) with a short 0.3 ns switch to U103(O2P). During the same period U103(O2P) was hydrated by another water molecule with a partial binding to G72(O6). In the other A-stack, a water molecule was localized between N1 and N2 of G98, with residency times of 0.50.8 ns (Fig. 5 c). In this case, Mg2+ binding prevented hydration of G98(O6).
|
70%). Furthermore, this hydration site does not substitute for a conventional H-bond between U77(O2) and A99(N6). In fact, A99(N6) does not have properly oriented hydrogens for optimal water binding. Thus it is not a water-mediated basepair. A similar site (out of the baseplanes) developed in the symmetry-related A73/U103 basepair.
All basepairs remained essentially planar in the Ebac simulations, with the exception of the trans Watson-Crick/Hoogsteen pairs, U103/A73 and U77/A99. These two basepairs showed a moderate tendency to buckle with the basepair centers pointed toward the adjacent (sheared) G/A basepairs, which is likely due to an adjustment of base stacking. Some nonplanarity is seen also in the crystal structure, especially for the lower A-stack. Averaged structures of both G/A sheared basepairs, however, are planar. Note that sheared G/A basepairs are intrinsically nonplanar (in complete isolation), though with two symmetry-related minima that may result in planarity of the time-averaged structures (Spackova et al., 2000
; Sponer and Hobza, 1994b
; Sponer et al., 1996
). Thus, the marked nonplanarity of this basepair observed in DNA structures (Chou and Chin 2001
; Shepard et al., 1998
; Spackova et al., 2000
) was not seen either in the Loop E crystal structures or our simulations, which reproduced the crystal structures.
In the Supplementary Material section, we present tables (Table S2, a and b) with a complete set of averaged basepair H-bond lengths and their fluctuations calculated for the primary simulations. It is to be noted that except for the basepair opening events discussed in the text, all other basepairs showed only modest fluctuations, essentially similar to those reported, for example, in our preceding study of the beet western yellow virus (BWYV) frameshifting pseudoknot (Csaszar et al., 2001
).
Bacterial Loop E in the presence of Na+
Removal of the Mg2+ cations (EbacNa1 and EbacNa2 simulations) did not affect the water bridges in the G75/A101 and G76/G100 basepairs, which remained intact. Conversely, no water bridge developed between G102(N2) and U74(N3), and only weak hydration remained between U77(O2)-A99(N6). The major hydration pocket around G102(O6) was substantially modified, however, due to a marked widening of the major (deep) groove (Table 2). For example, the A73(O2P)-G102(O6) distance increased from
6.0 Å to
8.0 Å, thus opening the hydration pocket. The region around G102(O6) remained hydrated in the absence of magnesium ions, but residency times of individual water molecules decreased to 0.10.4 ns, i.e., by an order of magnitude. In the upper A-stack, the 72(N2)-104(O2P) distance increased from 5.05 ± 0.47 Å to 5.47 ± 0.5 Å upon removal of magnesium (simulations EbacNa1 and EbacNa2, compared to simulation EbacMg). Then the water molecule originally bridging 72(N2) and 104(O2P) shifted closer to 104(O2P), providing space for an additional water molecule between 72(N1) and 72(N2) (very similar to that shown in Fig. 5 b). The residency times for both hydration sites are 0.50.8 ns. The tight binding 72(O6)-102(O2P)-103(O2P) hydration pocket disappeared.
Removal of the Mg2+ cations resulted in a minor increase in basepair H-bond fluctuations. The U77/A99 basepair exhibited two major opening events (at 2.64.3 ns and 4.87.7 ns) in the EbacNa1 simulation (Fig. 3 c). During these opening events, the U(N3)-A(N7) H-bond was disrupted while the N6-O2 H-bond remained intact. A water bridge formed between U77(N3)-A99(N7) which evidently stabilized the open state of this pair (Banavali and MacKerell, 2002
; Cubero et al., 2000
; Varnai and Lavery, 2002
). Such opening events were not seen for the symmetry-related A73-U103 basepair or in the EbacNa2 and EbacMg simulations and are reminiscent of transient basepairing opening events we observed in nanosecond simulations of a frameshifting pseudoknot (Csaszar et al., 2001
).
Noncanonical basepairing, water bridges, and ordered hydration in spinach Loop E
To set up the spinach chloroplast simulations, we replaced three non-Watson-Crick basepairs in the crystal structure of bacterial Loop E with isosteric equivalents suggested by molecular modeling (Leontis and Westhof, 1998b
). As described above, the cis bifurcated C74/A102 basepair was modeled isosteric to U74/G102 and involves a contact between C74(N4) and A102(N1). Both the EchlMg and EchlNa simulations refined this geometry in such a way that the A102(N1)-C74(N4) distance stabilized at 3.1 Å, with an almost linear hydrogen bond (Fig. 6 a). The simulated basepair further shows a repulsive contact between the C74(C5-H5) and A102(H6-N6) groups with a CN distance of 4.1 to 4.2 Å.
|
3.4 Å and 2.60 Å, respectively. The basepair was firmly established around 2 ns into the simulation. The basepair was further stabilized by shortening of the A98(N6) to U77(O2P) distance from 5.6 Å at the beginning of the simulation to 2.9 Å to form an H-bond with N-H...O angle of (135150)° and H ... O distance of
1.9 Å. This altered A-stack was further stabilized by ordered hydration. A major single-water hydration site developed between G79(N7) and A78(O2P) in the EchlMg simulation, partly involving also A78(N7) (not shown). A related hydration site developed in the EchlNa simulation between A98(N1) and U77(O5') (Fig. 5 d). Both sites exhibited water residency times
2 ns and were established once the basepairs stopped fluctuating after
1.52 ns.
Stabilization of the cis bifurcated A76/A100 basepair (corresponding to G76/G100 in the bacterial sequence) was also expected to occur via a bifurcated A100(N6)-A76(N1) contact (Fig. 2). However, the A100-A76 basepair failed to form a stabilizing H-bond in the presence of Mg2+ (Fig. 6 c). During the first 5 ns of the simulation, the A100(N6)-A76(N1) distance was
3.4 Å with the H62-N1 distance
2.6 Å. The A100(N7)-A76(N6) distance was
5.2 Å, and no water bridge was observed stabilizing the basepair. The basepair fluctuated and showed one major opening event correlated with a release of one of the bound Mg2+ cations into the solvent at 4.3 ns (see below). Shortly after release of the Mg2+ cation the part of the structure around ill-paired A76/A100 considerably widened. The deep groove locally reached a width of 16 Å with marked perturbation of the structure and then partly relaxed (see above) (Fig. 3 d). The A100(N6)-A76(N1) and A100(N7)-A76(N6) distances increased to
4.1 Å and 6.6 Å, respectively, after these changes. The A76/A100 basepairs adopted a considerably better geometry in the simulation EchlNa (Fig. 6 d), with the formation of an H-bond between A76(N1) and A100(N6). The A76(N6) and A100(N7) atoms also approach each other but stop short of completing a full H-bond with N ... N distance of 3.6 Å. Rather surprisingly, no water mediation was seen. Forming the A100/A76 basepairs leads to a marked compression of the deep groove in this region of the molecule (Table 2).
To demonstrate that the A76/A100 basepair in the EchlMg simulation is not bound, we calculated the base-base interaction energy term for the geometry seen in the first part of the trajectory using the high quality quantum-chemical MP2/aug-cc-pVTZ method (Sponer and Hobza, 2000
). The calculated base-base interaction energy was only -3.7 kcal/mol. Note that typical base-base H-bonding interaction energies range from -10 to -28 kcal/mol (Sponer et al., 2001
). For comparison, complete optimization of this A-A basepair, obtained by removing all intermolecular constraints, leads to an interbase attraction of -11.8 kcal/mol and two full H-bonds, N1-N6 and N6-N7, corresponding to the trans Watson-Crick/Hoogsteen basepairing geometry. In fact, for the averaged A100/A76 basepair obtained from the EchlNa simulation we calculated an interbase interaction energy of -10.4 kcal/mol, not far from the optimal basepairing value.
The remaining basepairs of chloroplast Loop E are identical to the bacterial version and showed very similar behavior during the simulations. We observed the expected true water bridge between A101(N1) and G75(N1) and fluctuating water contacting U77(O2) and A99(N6). The EchlMg simulation revealed two hydration sites with full occupancy at G72: the G72(N2)-A104(O2P) bridge with water residency times of 1 to 2 ns, and the G72(O6)-A102(O2P)-U103(O2P) three-center pocket with water residency times of 0.4 to 3.5 ns. The hydration structure is fairly similar to that seen for the EbacMg simulation. The G72(O6)-A102(O2P) distance increased from
4.2 Å to 5.2 Å in the EchlNa simulation, accompanied by a weak (
50% occupancy) G72(N2)-A104(O2P) hydration and fully occupied site G72(N2)-G72(N1) with residency times 0.10.4 ns.
All basepairs in the Echl simulations were essentially planar except for A73/U103 and U77/A99, which buckled exactly as in the Ebac simulations. In addition, buckling developed in the mutated trans Hoogsteen/Sugar-edge (sheared) A78/A98 basepair, in the EchlNa but not in the EchlMg simulation. The A78/A98 basepair buckled in the opposite direction to U77/A99. Note that the local variation of the A98 base position in the two simulations was also reflected by differences in ordered hydration sites stabilizing this region (see above).
In the initial homology analysis of the chloroplast Loop E molecule, we have also considered the option that some of the substituted bases can lead to formation of protonated basepairs (Bink et al., 2002
; Jang et al., 1998
; Pan et al., 1998
). We found that no protonation is possible of A(N1), A(N3), or C(N3) while keeping the overall basepair arrangements similar or isosteric with the bacterial Loop E. As evident from Figs. 2 and 6, protonation of A76(N1) would impair any isosteric basepairing whereas protonation of A100(N1) would occur outside the pairing region. In the case of the A98/A78 basepair, all A(N1) and A(N3) positions are completely away from the basepairing region necessary for isosteric pairing. Finally, in case of the A102/C74 basepair, protonation of A102 would impair the basepairing while C74(N3) does not participate in the basepairing. In the modeled basepair C74/A102, the non-Watson-Crick geometry is a bifurcated one, and stable H-bonding is established without protonation of A or C. Formation of the protonated basepair would require flipping of the cytosine base around its glycosidic bond to unlikely syn conformation and its additional in-plane rotation, thus completely changing the local geometry. In other words, all protonations would change the shape of the molecule, and this likely is not compatible with maintaining of its function. We have no need to consider any base protonation as we found very plausible isosteric basepairing in all three substituted sites with canonical bases. We ultimately cannot rule out (based on the simulations) that a protonation in chloroplast Loop E occurs. However, we consider this option as very remote, and the shape of the molecule would not be conserved then. That would contradict the basic conclusion from previous phylogenetic analysis suggesting that essentially only isosteric substitutions are viable. In fact, in the C74/A102 basepair simple flipping of the cytosine base around its glycosidic bond would lead to a stable basepairing without the protonation, keeping the C(N4)-A(N1) H-bond and establishing a new one between A(N6) and C(N3). However, syn orientation of cytosine is very unlikely and also would lead to a complete interchange of the pyrimidine exocyclic groups (compared to U74) protruding into the grooves. We would like to specifically note that the wobble C/A(+) protonated basepair (Bink et al., 2002
; Jang et al., 1998
; Pan et al., 1998
) is distinctly different from the bifurcated geometry we modeled as isosteric to the G/U bifurcated basepair of bacterial Loop E, and thus is not suitable for the Loop E segment. Although the isosteric modeling does not suggest adenine protonation, it is entirely possible that some of the adenines may be protonated at lower pH, especially those sites not involved in basepairing. The negative potential around the molecule may increase the pKa values for adenine protonation. Actually, increased pKa values (4.85.8) of eight adenines participating in the 6 bp non-Watson-Crick internal loop of domain B of the hairpin ribozyme have been recently reported. Nevertheless, even in this case no protonated basepairs are seen (Ravindranathan et al., 2000
). As we have shown elsewhere (Leontis and Westhof, 2001
), any two bases can basepair stably in RNA, utilizing a number of different geometries. For C/A or A/A basepairs, a small number of these geometries require protonation, but most do not.
Dynamics of Mg2+ cations
The starting structure in simulation EbacMg contained five Mg2+ cations, those found in the crystal structure, URL064 (Fig. 7). Mg1 has water mediated contacts to G106(O6), G105(O6), and G105(N7) and is entirely stable in the simulation. Mg2 forms an inner-shell contact to A101(O2P) while Mg3 forms an inner-shell contact with G100(O2P). The crystal distance between these two Mg2+ ions is only 2.7 Å. Not surprisingly, these cations separated during the simulation, to a distance of 6.4 Å, though they remained coordinated to the same phosphates (Hermann and Patel, 1999
). This exerted some effect on the phosphate positions, initially producing an asymmetry of the interphosphate distances compared to the crystal, which disappeared, however, by the end of the simulation (see above). Mg4 is initially bound through water molecules to G75(N7), G76(O6), U74(O2P), G75(O2P), and A99(O2P). It temporarily lost contact with G75(N7) and G76(O6) between 8.8 and 9.3 ns and moved a little toward the solvent, but then it returned to its original position. Its binding to A99(O2P), U74(O2P), and G75(O2P) was not affected by this event. The return of the cation could still be due to the nanosecond timescale of the simulations. Nevertheless, the fluctuation could indicate that this cation is relatively less stable compared to the other magnesium binding positions. The cation has enough space to move along and away from the solute. Since we apply periodic boundary conditions, the cation can leave the primary box entirely; in addition, a 10-Å extension of the solvent box beyond the RNA molecule in all directions is sufficient for large-scale movement of the cation within the box. This was the largest fluctuation seen for Mg2+ cations in the 9.7 ns EbacMg simulation. Mg5 is directly bound to G98(O6) and is entirely stable over the course of the simulation.
|
During the simulations, no exchange was observed of the water molecules in the first hydration spheres of any of the magnesium ions. This is entirely consistent with the fact that, for Mg2+, mean residence times of water in the first hydration shell are
1 microsecond (Burgess, 1988
).
Extensive binding of structural Na+ cations
As in our preceding studies (Spackova et al., 2000
; Csaszar et al., 2001
), all occupancies and residency times reported here mean single site (atom) inner-shell Na+ binding events (with binding distances of
2.4 Å) while obviously neglecting subpicosecond fluctuations. It should be noted that some authors use less strict criteria to identify Na+ binding, including multiple-site or outer-shell binding and enhancing substantially their calculated occupancies. The concentration of cations in all our simulations is
0.15 M. We saw few Na+ binding events in the Mg2+ containing simulations, where 5 Mg2+ and 10 Na+ cations were present. In the simulation EchlMg, the best developed position of Na+ binding was detected at N1 of mutated A98 with a surprisingly high occupancy of 48%. Other contacts occurred at various O2P and O6 sites, such as G79(O6) 47% and A78 (O2P) 38%.
More extensive binding of Na+ cations occurred in the absence of Mg2+. The seven most prominent sodium binding sites seen during the first 5 ns of simulations EbacNa1 and EbacNa2 are summarized in Table 3. The EbacNa1 and EbacNa2 simulations resulted in similar distributions of bound Na+ despite the markedly different initial distributions of Na+ ions, as described above. Na+ ions show large mobility, some of them visiting 5 or 6 different solute atoms in simulation (Fig. 8).
|
|
|
= 180° (usually
= 60°) between nucleotides G72/A73 and G98/A99. An
30° difference occurred for the
angle (
= -100° in the A-stack whereas the common value is
= -70°). Both A-stacks cause compression of the major groove progressing in the 3' direction and widening toward the 5' end as measured with respect to the A-stack.
Electrostatic potential calculations
Molecular electrostatic potential calculations were carried out on the RNA by solving the nonlinear Poisson equation in the absence of salt. Inclusion of salt in the calculations reduces the depth of all minima in the potential energy, but their position and shape are essentially unchanged and therefore these results are not shown. For a more detailed analysis of the electrostatic potentials in RNA molecules see Chin et al., 1999
and Hermann and Westhof, 1998
.
The most negative sites around the crystal geometry of Ebac occur in the deep (major) groove, as expected. The first negative peak forms at -21.6 kT with a broad negative region occurring at a depth of -20 kT (see Fig. 7 above). The Loop E deep groove has a substantially more negative electrostatic potential than that of the BWYV frameshifting pseudoknot, where the deepest minimum occurred at -17.5 kT, again using no salt conditions (Csaszar et al., 2001
). The (hydrated) Mg2+ cations are sterically restricted to regions a little outside the most negative region, which is considerably deeper inside the groove. The movements of the Mg2+ cations inside the groove may also be restricted by electrostatic repulsion between proximal cations. The calculated potential was not found to be significantly different for the averaged structure obtained from EbacMg simulation, which exhibits three negative sites at a contour level of -20 kT to -20.5 kT that spread out into a wide negative region at -19 kT. One negative peak occurs at G102(N7), close to the Mg2 binding site. The most negative site is around G75(O6) which is close to the Mg4 site between atoms G75(N7) and G76(N7). These atoms are more accessible for Mg2+ binding than G75(O6). The third deep well occurs close to G100(N7). All of these negative peaks are deeper in the groove compared with the actual positions of the Mg2+ ions. Our results appear to well correlate with a Brownian-dynamics study by Hermann and Westhof (Hermann and Westhof 1998
). In the simulations EbacNa, the most negative peak is shifted to -17.5 kT due to major groove widening.
For the starting structure for the chloroplast Loop E simulation we calculated the electrostatic potential minimum to be -17.3 kT in the major groove. The negative potential peak deepened in the course of the EchlMg simulation to -19.6 kT. There are again three negative sites visible at contour level -19 kT merging into a wide region at -18 kT contour level. The most negative site is around A102(N7), followed by a site near G100(N7). The less negative potentials compared to the EbacMg structure are due to the substitution of guanine by adenine in Echl. The most negative potential deepens to -22.3 kT in the simulation EchlNa. This is due to structural adjustments of the deep groove caused by formation of the Na+ spine (see above), leading to the most negative potential seen in all simulations. The deepest potential well occurs at A76(N7) and coincides with one of the Na+ binding sites. Other Na+ cations are also located in salient negative potential locations.
Simulations carried out at elevated temperatures
For a further assessment of the stabilities of Loop E motifs, we carried out three simulations at elevated temperature (see Table 1). During the 2.5 ns 400 K constant pressure simulation of the Ebac molecule carried out in the presence of Mg2+ ions, one terminal basepair was disrupted by 2 ns, perturbing two adjacent basepairs. All Mg2+ cations remained bound to the solute molecule. Similar perturbations at the other end of the molecule occurred during the 3 ns 400 K constant pressure simulation of Echl carried out in the presence of Mg2+.
During the 400 K constant volume simulation, also carried out in presence of Mg2+, little perturbation of the Echl system was observed during the first 2 ns. However, after 2 ns, a large instability in one canonically paired segment propagated into the non-Watson-Crick basepaired region, and by 5 ns essentially all basepairs were disrupted or perturbed, although the strands remained in contact. As all the instabilities were initiated at the terminal segments we could not determine the weakest basepairs within the non-Watson-Crick motifs.
Unfolding the Loop E in absence of salt
We also carried out a 10-ns simulation of Ebac under no-salt conditions with only explicit solvent molecules screening the phosphate repulsion (no added cations, see Methods). We observed a rapid widening of the major groove to
15 Å within the first nanosecond. The final major groove width of 2022 Å was achieved by approximately 2 ns and was followed by no further changes. A number of basepairs adopted altered and more open geometries. In particular, the trans Watson-Crick/Hoogsteen U/A pairs transform into trans Sugar-edge/Hoogsteen (sheared) U/A pairs in which A(N6) is H-bonded to U(O2) and U(O2') (see Fig. S5 in Supplementary Material). The bifurcated pairs also adopt geometries in the sheared family of geometries. Interestingly, all bases remain paired to their original partner until the end of the simulation. This demonstrates that Loop E motifs exhibit a structural resilience based on flexibility in the basepairing geometries that allows for a wide range of major groove widths. This result is also interesting in regard to possible pathways for unfolding of RNA molecules that contain such motifs.
The deep (major) groove of the Echl molecule also showed immediate widening during the 5 ns no-salt simulation we carried out. However, the basepairing and the A-stacks were largely disrupted/deformed. This may be partly attributed to the fact that the starting geometry for the chloroplast structure was not optimized, but likely also reflects the inherently weaker basepairing of the chloroplast Loop E sequence.
The purpose of carrying out the no-salt simulations was to test the flexibility of the molecule and to increase the rate of sampling of alternative geometries. Although the no-salt simulation might appear at first sight far-fetched or unrealistic, such simulations are justified as equally well as the more common simulations carried out at elevated temperature. Although the no-salt environment is not realistic, the solute molecule remains controlled by its internal forcefield and enveloped by solvent waters that continue to provide significant electrostatic screening and full solvent friction. Thus, the deformations are not uncontrolled distortions, and the molecule moves through a low free energy valley as defined by the forcefield. In addition, although true no-salt situation is only an idealized, limiting case, low-salt conditions are not unrealistic and the no-salt simulation actually should identify (in the initial stages) the path of structural changes upon reducing the electrostatic screening. Actually, in this study, we obtained more insights from the no-salt simulations than from the elevated temperature simulations.
Refolding the molecule by adding salt back
We selected the widened Ebac structure obtained after 1 ns of simulation in the absence of salt as the starting coordinates for a new simulation to test refolding induced by addition of 20 Na+ cations. The ions were added via the Leap module. Neutralization with sodium ions alone, however, did not initiate any refolding of the molecule during 3.5 ns, and the major groove remained as wide as 15 Å. When this was repeated in a subsequent simulation using 10 Na+ and 5 Mg2+ cations to neutralize the RNA, gradual major groove compression was achieved (cf. Fig. 3 a above). By the end of this simulation, the structure displayed the very narrow deep groove architecture previously seen only for Echl, narrower, in fact, than in any preceding simulation with Ebac. Most significantly, all the non-Watson-Crick basepairs returned to their crystallographic geometries. The Mg2+ cations were initially positioned several angstroms from the solute. Nonetheless, during the preequilibration minimization one of the Mg2+ ions established inner-shell binding with G75(O2P). (Such mobility during initial relaxation is never seen for Na+ cations.) This Mg2+ cation remained bound to this position during the entire 4-ns simulation. About 1 ns into the simulation, a hexahydrated Mg2+ bound to A73(O2P) and G100(O2P), pulling the two strands closer to each other. The cation was expelled back into the solvent at 3.5 ns, seemingly because the groove was already too narrow for its stable binding.
We used the last snapshot of the preceding simulation as the starting geometry for a new simulation in which all Mg2+ cations were removed and the solute was neutralized exclusively by Na+. After
2 ns of simulation the groove widened once again and the final outcome of the 3.5-ns simulation was very similar to simulations EbacNa1 and EbacNa2. Thus, the large-scale deformation induced by the no-salt simulation was completely repaired.
| DISCUSSION |
|---|
|
|
|---|
Cation binding and major groove width
The major groove of bacterial Loop E (Ebac) shows a broad and deep minimum of electrostatic potential, which helps to understand the extensive binding of Mg2+ cations seen in the crystal structures (Correll et al., 1997
; Hermann and Westhof, 1998
; Lu and Steitz, 2000
). The Mg2+ cations do not, however, bind directly to the most negative sites, as these are too deeply buried inside the groove. The chloroplast Loop E (Echl) shows, in the proposed Ebac crystal-like geometry, somewhat less negative electrostatic potential due to substitution of several guanines by adenines. Our work shows that the magnitude of the negative electrostatic potential in the major groove of Loop E is clearly regulated by the groove width. Narrowing of the groove substantially strengthens the negative potential, which is further enhanced by local geometry adjustments induced by tight cation binding (e.g., chloroplast Loop E in absence of Mg2+). When comparing structures with different deep groove widths, we did not find any salient geometrical parameter regulating the deep groove width, except for rather modest changes of backbone angles and, in the case of the no-salt simulations, changes in basepairing. Reversible changes of basepairing allow the maintenance of stable pairing in structures with extreme widening of the deep groove, as demonstrated by the no-salt simulations. We did not see larger changes in the shallow (minor) groove width.
Our simulation timescale does not guarantee a complete sampling. We thermally sampled within a range of a few kcal/mol of free energies in 10 ns as an ideal two-state system surmounts
5 kcal/mol each 11.5 ns (unless a highly improbable sequence of events occurs). Nevertheless, some conclusions can be drawn. Full binding of the Mg2+ cations stabilizes the major groove width at the crystal (intermediate) value and rigidifies the bacterial Loop E motif. In the absence of Mg2+, Loop E motifs show unprecedented degrees of inner-shell binding of Na+ cations and can adopt a wide range of major groove widths, depending on the base sequence and detailed balance of the counterion distribution. The Na+ cations are capable of penetrating into the most negative regions inside the major groove, where multiple binding sites display inner-shell occupancies above 50% (Figs. 8 and 9, Table 3). In simulations of Echl, we observed a narrowing of the major groove that essentially collapsed around a string of Na+ cations bound almost permanently on the simulation timescale. We are not aware of any preceding MD study showing such a high Na+ coordination and large structural effect on nucleic acid geometry, except for simulations carried out for guanine quadruplex molecules (Spackova et al., 1999
, 2001
). One simulation suggests that a partial occupancy of the major groove by Mg2+ may support major groove compression in the case of Ebac. Nevertheless, the Ebac simulations carried out in the absence of Mg2+ typically widen the deep groove though there is extensive Na+ binding in the deep groove. The base changes around the lower A-stack of the Echl sequence clearly support (nonsymmetrical) narrowing of the deep groove compared to the Ebac sequence.
When assessing the significance of modeling of cation binding, one has to consider two limitations of current techniques. First, the pair-additive forcefield provides only an approximate description of the cation interactions, neglecting substantial polarization and charge transfer effects (Haider et al., 2002
; Sponer et al., 2000a
, 2000b
). These include strong polarization of the first-shell water molecules around Mg2+ and polarization strengthening of certain basepairs (Sponer et al., 2000b
). The neglected polarization effects should influence the binding strength and especially the balance of interactions between Mg2+, solvent, and various solute groups (Sponer et al., 2000b
). The simulations nevertheless accurately represent the electrostatic effects, which, for the present system and timescale, appear to be the dominating contributions. In the case of the Na+ cations, we suggest that the outcomes of simulations are considerably less affected by forcefield approximations.
The second limitation is the nanosecond timescale of the simulations. The timescale is not sufficient to achieve a thorough sampling of the motions of the magnesium hexahydrates. Therefore, when making such simulations, we are largely restricted to investigate the structural effect of Mg2+ in the experimentally determined sites and coordination patterns. We cannot even study exchange between inner- and outer-shell binding. As noted above, the typical timescale for an exchange of ligands in the first coordination shell of magnesium is around 1 microsecond. Nonetheless, the simulations appear to reproduce the structural effects of the Mg2+ cations bound in the known x-ray geometries of the bacterial Loop E. More complicated is the evaluation of the chloroplast