help button home button Biophys. J.
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS

This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Matsumoto, A.
Right arrow Articles by Olson, W. K.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Matsumoto, A.
Right arrow Articles by Olson, W. K.

Biophys J, July 2002, p. 22-41, Vol. 83, No. 1

Sequence-Dependent Motions of DNA: A Normal Mode Analysis at the Base-Pair Level

Atsushi Matsumoto and Wilma K. Olson

Department of Chemistry, Rutgers, the State University of New Jersey, Wright-Rieman Laboratories, Piscataway, New Jersey 08854-8087 USA


    ABSTRACT
TOP
ABSTRACT
INTRODUCTION
METHODS
RESULTS AND DISCUSSION
CONCLUSIONS
GLOSSARY
REFERENCES

Computer simulation of the dynamic structure of DNA can be carried out at various levels of resolution. Detailed high resolution information about the motions of DNA is typically collected for the atoms in a few turns of double helix. At low resolution, by contrast, the sequence-dependence features of DNA are usually neglected and molecules with thousands of base pairs are treated as ideal elastic rods. The present normal mode analysis of DNA in terms of six base-pair "step" parameters per chain residue addresses the dynamic structure of the double helix at intermediate resolution, i.e., the mesoscopic level of a few hundred base pairs. Sequence-dependent effects are incorporated into the calculations by taking advantage of "knowledge-based" harmonic energy functions deduced from the mean values and dispersion of the base-pair "step" parameters in high-resolution DNA crystal structures. Spatial arrangements sampled along the dominant low frequency modes have end-to-end distances comparable to those of exact polymer models which incorporate all possible chain configurations. The normal mode analysis accounts for the overall bending, i.e., persistence length, of the double helix and shows how known discrepancies in the measured twisting constants of long DNA molecules could originate in the deformability of neighboring base-pair steps. The calculations also reveal how the natural coupling of local conformational variables affects the global motions of DNA. Successful correspondence of the computed stretching modulus with experimental data requires that the DNA base pairs be inclined with respect to the direction of stretching, with chain extension effected by low energy transverse motions that preserve the strong van der Waals' attractions of neighboring base-pair planes. The calculations further show how one can "engineer" the macroscopic properties of DNA in terms of dimer deformability so that polymers which are intrinsically straight in the equilibrium state exhibit the mesoscopic bending anisotropy essential to DNA curvature and loop formation.


    INTRODUCTION
TOP
ABSTRACT
INTRODUCTION
METHODS
RESULTS AND DISCUSSION
CONCLUSIONS
GLOSSARY
REFERENCES

The large-scale fluctuations of DNA are key to understanding kinetically complicated events, such as the ease of the long, threadlike molecule snaking though the pores of a gel or closing into a loop between separately bound regulatory proteins. DNA loop formation is implicated, in turn, in a number of important biological processes, including the regulation of transcription (Bellomy et al., 1988; Schleif, 1992) and the organization of chromatin (Dillon et al., 1997; Bazett-Jones et al., 1999; Ringrose et al., 1999).

The motions of polymeric DNA are generally modeled in terms of a spatially homogeneous, naturally straight, elastic rod which ignores realistic features of chemical structure (reviewed in Olson, 1996). The fluctuations and correlations of structural parameters in crystals of pure DNA and of DNA-protein complexes, however, show that the equilibrium rest states and elastic constants of neighboring base pairs are dependent on sequence and further reveal the presence of strong couplings between modes of deformation---e.g., between bending, twisting, and stretching---that are the direct result of the chiral nature of DNA chemical structure (Olson et al., 1998). These local features become important as the deformations of sequential base-pair steps accumulate with the increase of chain length, introducing notable structural variability at the mesoscopic level, i.e., in chains of a few hundred base pairs.

Electron micrographs and other low resolution images of mesoscopic DNA fragments reveal broad distributions of molecular shapes and end-to-end distances (Muzard et al., 1990; Thresher and Griffith, 1992; Bednar et al., 1995; Lyubchenko et al., 1995; Hansma et al., 1996; Bustamante et al., 1997). The sequential features responsible for these spatial arrangements are often deduced by comparison of the experimentally observed images of a given DNA with the distribution of configurations generated by Metropolis-Monte Carlo sampling of the likely structural fluctuations of the constituent dimers (Hagerman, 1985; Levene and Crothers, 1986; Kahn and Crothers, 1998). Limitations on computer resources make routine comparison of simulations and experiment impractical, because hundreds of thousands of coordinate sets must be generated to characterize an individual polymer molecule. The study of the large-scale, collective motions of long DNA is consequently impeded.

The normal modes of DNA provide an alternative, computationally less demanding way to study the equilibrium properties of DNA. The normal modes are coupled vibrations found by assuming that the molecular potential energy can be approximated by a harmonic function of the configurational variables and by then solving a generalized eigenvector problem to give a closed analytical description of the motion. The eigenvalues give the vibrational time scales (frequencies) and the eigenvectors the details of the corresponding motion. The motion can be described and visualized at each frequency or as a superposition of independent modes. Time-averaged equilibrium and kinetic properties can be calculated accurately and efficiently as weighted sums, e.g., thermodynamic functions can be determined from the normal mode frequencies. The price paid for these benefits is the limited accuracy of the harmonic energy approximation. The method, nevertheless, provides a useful first impression of the flexibility of a molecule and shows how the motions may change when the chemical structure is modified or the chain is perturbed by the binding of proteins and other ligands.

Normal mode or vibrational analysis is a well established and much used technique for the study of both small molecules and proteins: the technique is successful in reproducing the vibrational spectra of small molecules (Wilson et al., 1955) and in analyzing the collective motions involved in the folding of protein fragments and in the binding and release of substrates and products to and from enzymes (Levy and Karplus, 1979; Brooks and Karplus, 1983; Kitao and G<A><AC>o</AC><AC>&cjs1171;</AC></A>, 1999; Berendsen and Hayward, 2000). Analyses of DNA normal modes to date have focused on the collective motions of very short chain fragments ranging from small oligomers (Tidor et al., 1983; Irikura et al., 1985; Garcia and Soumpasis, 1989; Kottalam and Case, 1990; Ha Duong and Zakrzewska, 1997a,b, 1998; Lin et al., 1997) to a few turns of double helix (Matsumoto and G<A><AC>o</AC><AC>&cjs1171;</AC></A>, 1999). The size of the latter systems is limited by the kinds of parameters which have been used to describe three-dimensional structure---typically the Cartesian coordinates of the constituent atoms or the dihedral angles of a nucleic acid fragment with rigid chemical bonds and valence angles. The small, almost imperceptible motions of short DNA oligonucleotides described at this high level of resolution are closely tied to the detailed sequence-dependent fine structure of the double helix and the association of water, drugs, proteins, and other molecules with individual nucleotide residues. The conformational effects which are collectively responsible for the large-scale, sequence-dependent properties of polymeric DNA chains are difficult to discern in such studies. The normal modes of infinitely long DNA, however, have been examined by Prohofsky and associates (Hua and Prohofsky, 1988; Chen and Prohofsky, 1995) by repeating a short fragment of DNA and taking advantage of helical symmetry to reduce the number of independent coordinates. The complexity of chains that can be studied from this perspective is again limited by the length of the repeating unit.

In the present investigation, we examine the motions of long fragments of DNA by taking advantage of "knowledge-based" harmonic energy functions deduced from known high-resolution crystal structures of DNA (Olson et al., 1998). The deformations of individual dimers are described by six independent "step" parameters which specify the spatial arrangements of neighboring base pairs: three angular variables called Tilt, Roll, and Twist and three variables called Shift, Slide, and Rise with dimensions of distance (Dickerson et al., 1989). This rigid-body representation of base pairs significantly reduces the number of independent variables per chain molecule, thereby making it possible to study the normal modes of much longer DNA fragments than could heretofore be treated in either Cartesian or dihedral angle space. In addition, the elastic energy incorporates the sequence-dependent anisotropy of DNA bending as well as the known correlations of base-pair step parameters. The only missing structural information is the detailed conformation of the sugar-phosphate backbone, including the charged phosphate groups. The latter atoms and the surrounding aqueous solvent, such as the water molecules and counterions in the first or second solvation layer around the DNA, are implicitly treated in the energy terms so that their omission introduces no serious errors when duplex deformations are limited to energies of the order of kBT (where kB is the Boltzmann constant and T the temperature in Kelvin). It should be noted, however, that the surrounding solvent does not have a viscosity and that we do not consider the damping effect of solvent on large-scale polymeric properties in this study. This low-resolution model, nevertheless, provides a straightforward way to deduce the effects of sequence on global folding, something that is said (Berendsen and Hayward, 2000) still to elude analyses of the slow conformational changes of proteins. In principle, there is no limit on the length of DNA that can be described in this manner so long as the constituent dimers obey the harmonic energy model, i.e., there are no excursions of the molecule from the classical B-form double helical structure. Longer chains are characterized by a greater number of normal modes, which when appropriately weighted and superimposed, give rise to a broader distribution of global molecular configurations. In practice, chain length is restricted by the form of the kinetic energy, which assumes that changes in atomic coordinates brought about by the fluctuations of individual "step" parameters are small. These upper limits preclude the need to treat the long-range electrostatic self-repulsion that influences the folding of long supercoiled DNA molecules (Fenley et al., 1994; Vologodskii and Cozzarelli, 1995; Westcott et al., 1997).

We focus attention here on the normal modes of regularly repeating, linear polymers free of bound ligands to establish a point of reference for studies to be reported elsewhere of arbitrary DNA sequences and of chains with ends held in place by specific anchoring conditions, such as the looped configurations imposed by protein binding. We compare our simplified energy model with previous all-atom treatments of short oligomers and with ideal elastic rod representations of polymeric DNA. We identify the local base-pair fluctuations responsible for the dominant (lowest frequency) normal modes. The derived polymeric properties account satisfactorily for the overall bending, i.e., persistence length, of the double helix and reveal the critical role of sequence in global twisting. The computations further suggest that the constants impeding the large-scale stretching of single DNA molecules are related to the concerted bending, twisting, and lateral displacements of neighboring base-pair planes as well as to their axial separation. We identify particular sequence contexts under which ideal elastic behavior breaks down, concentrating on conditions that induce the mesoscopic bending anisotropy which underlies DNA loop formation. Finally, by comparing the end-to-end dimensions of the simulated double helices with exact values obtained from standard matrix formulations of polymer configurational statistics (Flory, 1969; Maroun and Olson, 1988; Marky and Olson, 1994), we investigate the chain lengths at which the normal mode analysis of DNA modeled in terms of base-pair steps is valid.


    METHODS
TOP
ABSTRACT
INTRODUCTION
METHODS
RESULTS AND DISCUSSION
CONCLUSIONS
GLOSSARY
REFERENCES

Base-pair representation

The atoms of each base pair lie in the plane of a local, orthogonal coordinate frame (x, y, z), which is defined in accordance with recently established guidelines (Olson et al., 2001). (Note: The x and y axes lie in the plane of the base pair, with x pointing in the direction of the major groove along the pseudodyad axis of the base pair and y running along the long axis of the base pair in the direction of the leading (sequence) strand, parallel to the C1' ··· C1' vector, and displaced so as to pass through the intersection of x with the vector connecting the pyrimidine Y(C6) and purine R(C8) atoms. The z axis is perpendicular to the base-pair plane, pointing in the 5'-3' direction of the leading strand.) The coordinate system of the kth base pair is further characterized by the position of its origin ok and the rotation matrix Rk, which relates the local base-pair frame to a fixed, orthonormal global reference frame. The rotation matrix is given by [gamma xgamma ygamma z], where the gamma nu (nu  = x, y, z) are unit vectors, expressed in columns, along the positive coordinate axes of the base-pair frame.

The relative position and orientation of the (k + 1)th base pair with respect to its predecessor k are given respectively by the difference between coordinate origins, vk,k+1 = (ok+1 - ok) and the product of the rotation matrix Rk+1 and the inverse of Rk, i.e., the dimer transformation matrix Tk,k+1 Rk+1(Rk)-1. The components of the projection of vk,k+1 on the coordinate axes of base pair k, (tau x, tau y, tau z), are used as translational parameters in the present calculation.

The matrix T can also be expressed in terms of the rotation of magnitude phi around the axis u which brings the coordinate frames on successive base pairs into coincidence, where the components of the unit vector u are (ux, uy, uz) and the angle phi equals (phi<UP><SUB><IT>x</IT></SUB><SUP>2</SUP></UP> phi<UP><SUB><IT>y</IT></SUB><SUP>2</SUP></UP> + phi<UP><SUB><IT>z</IT></SUB><SUP>2</SUP></UP>)1/2 (Chasles, 1830). The elements of T, so expressed, are given by (Jeffreys and Jeffreys, 1946):
t<SUB>&ngr;&mgr;</SUB>=(1−<UP>cos</UP>ϕ)u<SUB>&ngr;</SUB>u<SUB>&mgr;</SUB>−<UP>sin</UP>ϕ <LIM><OP>∑</OP><LL>&kgr;</LL></LIM> &egr;<SUB>&ngr;&mgr;&kgr;</SUB>u<SUB>&kgr;</SUB>+<UP>cos</UP>ϕ &dgr;<SUB>&ngr;&mgr;</SUB>, (1)
where delta nu µ is the Kronecker delta, i.e., delta nu µ = 1 when nu  = µ, delta nu µ = 0 when nu  not equal  µ, and epsilon nu µkappa  = ±1 when nu , µ, kappa  is an even or an odd permutation of 1, 2, 3, respectively, and vanishes otherwise. The rotational components (phix, phiy, phiz) are equated to (Tilt, Roll, Twist) after the definition of Babcock et al. (1994).

The translational parameters between base-pair planes are generally expressed in terms of a "middle" frame so that numerical values are independent of the direction from which a DNA structure is analyzed (Lu and Olson, 1998), i.e., either the sequence or the complementary strand. The translational parameters defined in this work in terms of the kth coordinate frame are related to Shift, Slide, Rise values in the literature, e.g., (Olson et al., 1998), through the matrix T1/2 that effects the halfway rotation (phi/2) between neighboring base-pair planes:
<FENCE><AR><R><C>&tgr;<SUB><UP>x</UP></SUB></C></R><R><C>&tgr;<SUB><UP>y</UP></SUB></C></R><R><C>&tgr;<SUB><UP>z</UP></SUB></C></R></AR></FENCE>=<B><UP>T</UP></B><SUP>1/2</SUP><FENCE><AR><R><C><UP>Shift</UP></C></R><R><C><UP>Slide</UP></C></R><R><C><UP>Rise</UP></C></R></AR></FENCE>. (2)
The dependence of the tau nu (nu  = x, y, z) on Tilt, Roll, Twist (through their incorporation in T) as well as on Shift, Slide, Rise precludes description of the local conformational energy in terms of a quadratic expression in the six base-pair step parameters. To overcome this limitation, we express the rotation matrix in Eq. 2 in terms of the equilibrium "rest" values of Tilt, Roll, and Twist, i.e., T1/2 approx  To1/2. The latter approximation is valid so long as the dimer structure does not depart significantly from this minimum energy state.

Normal mode analysis

Our treatment of the normal modes of DNA at the level of "step" parameters builds upon general formulations (Noguti and G<A><AC>o</AC><AC>&cjs1171;</AC></A>, 1983; Braun et al., 1984; Higo et al., 1985; Levitt et al., 1985) previously developed to describe the collective motions of proteins in terms of internal chemical coordinates, i.e., dihedral angles. The conformational potential energy V of a double helix of N base pairs is thus approximated by the multi-dimensional parabola,
V=<FR><NU>1</NU><DE>2</DE></FR> <LIM><OP>∑</OP><LL><IT>ij</IT></LL></LIM> f<SUB><IT>ij</IT></SUB>&Dgr;&thgr;<SUB><IT>i</IT></SUB>&Dgr;&thgr;<SUB><IT>j</IT></SUB>, (3)
where Delta theta i is the instantaneous fluctuation of the ith "step" parameter from its equilibrium value and fij is an element of a 6(N - 1) × 6(N - 1) matrix of elastic force constants, F, i.e., the terms describing the potential deformability of the six parameters in each of the N - 1 base-pair steps.

The kinetic energy K is similarly expressed in quadratic form in terms of <A><AC>&thgr;</AC><AC>˙</AC></A>i, the first derivative of theta i with respect to time, and the weighted "mass" coefficients hij of the 6(N - 1) × 6(N - 1) matrix H (defined below):
K=<FR><NU>1</NU><DE>2</DE></FR> <LIM><OP>∑</OP><LL><IT>ij</IT></LL></LIM> h<SUB><IT>ij</IT></SUB>&Dgr;<A><AC>&thgr;</AC><AC>˙</AC></A><SUB><IT>i</IT></SUB>&Dgr;<A><AC>&thgr;</AC><AC>˙</AC></A><SUB><IT>j</IT></SUB>. (4)
If the Delta theta i and Delta <A><AC>&thgr;</AC><AC>¨</AC></A>i are collected respectively in the vectors Theta  and <A><AC>&THgr;</AC><AC>¨</AC></A>, the equations of motion simplify to:
<B><UP>H</UP></B>&Dgr;<A><AC><B><UP>&THgr;</UP></B></AC><AC>¨</AC></A>+<B><UP>F</UP></B>&Dgr;<B><UP>&THgr;</UP></B>=<B><UP>0</UP></B>, (5)
with a periodic solution of the form:
&Dgr;&thgr;<SUB><IT>i</IT></SUB>=<LIM><OP>∑</OP><LL><IT>n</IT><UP>=1</UP></LL><UL><UP>6</UP>(<IT>N</IT><UP>−1</UP>)</UL></LIM> A<SUB><IT>in</IT></SUB>&agr;<SUB><IT>n</IT></SUB> <UP>cos</UP>(&ohgr;<SUB><IT>n</IT></SUB>t+&dgr;<SUB><IT>n</IT></SUB>). (6)
Here omega n is the frequency, delta n the phase angle, alpha n the amplitude of the nth normal mode, and Ain the fractional contribution to the ith "step" parameter from the nth normal mode. (In practice the components of Delta Theta and the frequencies omega n are obtained by introducing the vector of normal coordinates Q defined by Delta Theta  = AQ, where the elements of Q are alpha n cos(omega nt delta n) and A is a 6(N - 1) × 6(N - 1) transformation matrix. Eq. 5 can then be rewritten as the eigenvalue expression, HALambda  = FA, with ATHA equated to the identity matrix of order 6(N - 1) and the normal mode frequencies and directions found from the diagonalization of F, i.e., Lambda  = ATFA, where AT denotes the transpose of A and Lambda  is a diagonal matrix with elements lambda nn = omega <UP><SUB><IT>n</IT></SUB><SUP>2</SUP></UP>. The phase angles, however, cannot be determined with this procedure.) The fluctuations of each "step" parameter Delta theta i at time t are thereby expressed as a linear combination of harmonic oscillators, the energies of which are proportional to omega <UP><SUB><IT>n</IT></SUB><SUP>2</SUP></UP>. The contribution of each mode to the variation of individual "step" parameters decreases rapidly with increase in omega n, so that relatively few (low frequency) modes are responsible for the large-scale deformations of the molecule.

Kinetic energy

The kinetic energy coefficients in Eq. 4 incorporate the mass ma and the Cartesian coordinates ra of individual atoms in the DNA through the relationship:
h<SUB><IT>ij</IT></SUB>=<LIM><OP>∑</OP><LL><IT>a</IT></LL></LIM> m<SUB><IT>a</IT></SUB><FENCE><FR><NU>∂<B><UP>r</UP></B><SUB><IT>a</IT></SUB></NU><DE>∂&thgr;<SUB> 
<IT>i</IT></SUB></DE></FR></FENCE><FENCE><FR><NU>∂<B><UP>r</UP></B><SUB><IT>a</IT></SUB></NU><DE>∂&thgr;<SUB><IT>j</IT></SUB></DE></FR></FENCE>. (7)
To evaluate the hij, we take advantage of analytical expressions for the (partial ra/partial theta i) developed to treat the normal mode dynamics of a system of two molecules, each of which moves in dihedral angle space (Braun et al., 1984; Higo et al., 1985). The set of rigid body parameters used here to relate neighboring base-pair planes is identical in form to the variables previously used to describe the relative global positions and orientations of different molecules. The internal atomic motions of DNA are effectively separated from the overall rotations and translations of the molecule by expressing the atomic displacements in terms of an embedded coordinate frame, chosen according to the so-called Eckart condition (Eckart, 1935) to minimize the mass-weighted square atomic displacement of DNA (McLachlan, 1979) in its equilibrium rest state and in a state where one of the "step" parameters is altered from its minimum energy value (Noguti and G<A><AC>o</AC><AC>&cjs1171;</AC></A>, 1983).

Because the atomic motions Delta ra brought about by "step" parameter change Delta theta i are assumed to be small in this common frame (Eckart, 1935), the kinetic energy term (Eq. 4) effectively restricts the length of DNA which can be studied by normal mode analysis. This limitation does not apply to the mesoscopic chains lengths considered below, where typical room temperature fluctuations of individual base-pair steps, such as the ±5° perturbations of Roll which raise the potential energy by ~kBT/2, limit atomic movement to 0.02-0.07 persistence lengths. The persistence length of mixed sequence DNA under ambient aqueous salt conditions, by comparison, is the same magnitude as the contour length of a 150 to 200-bp chain (Hagerman, 1988; Smith et al., 1992, 1996; Bustamante et al., 1994; Bednar et al., 1995; Baumann et al., 1997). The limitation on chain length is further discussed below in the analysis of DNA end-to-end dimensions.

Both backbone and base atoms must be included in the evaluation of the hij. The sugar-phosphate backbone is incorporated in the present calculations by the superposition of a canonical B-form 5'-nucleotide helical fragment (Chandrasekaran and Arnott, 1989) in the reference frame of each base. Because complementary base-pair parameters are fixed at ideal planar values, each complementary nucleotide pair is thereby treated as a rigid body and the small variations in backbone conformation that accompany fluctuations in the geometry of neighboring base pairs are ignored.

Force field

The potential energy coefficients in Eq. 3 are taken from knowledge-based elastic functions extracted from the three-dimensional arrangements of DNA base-pair steps in protein-DNA crystal structures (Olson et al., 1998):
V<SUB><UP>XY</UP></SUB>=<FR><NU>1</NU><DE>2</DE></FR> <LIM><OP>∑</OP><LL><IT>i</IT><UP>=1</UP></LL><UL><UP>6</UP></UL></LIM> <LIM><OP>∑</OP><LL><IT>j</IT><UP>=1</UP></LL><UL><UP>6</UP></UL></LIM> g<SUB><IT>ij</IT></SUB>&Dgr;&phgr;<SUB><IT>i</IT></SUB>&Dgr;&phgr;<SUB><IT>j</IT></SUB>. (8)
The gij in this expression are the force constants impeding deformations of the XpY dimer, and the Delta phi i correspond to the instantaneous fluctuations of each of the six "step" parameters of that dimer from the B-DNA rest state, i.e., Delta phi 1 = Delta TiltXY, Delta phi 2 = Delta RollXY, Delta phi 3 = Delta TwistXY, Delta phi 4 = Delta ShiftXY, Delta phi 5 = Delta SlideXY, Delta phi 6 = Delta RiseXY (full details of the potential function are available in Table S-1 in the Supplementary Material). If these elements are collected respectively in the 6 × 6 force constant matrix GXY and the 6 × 1 vector Delta Phi XY, the dimer deformation energy VXY reduces to (1/2)Delta Phi XYTGXYDelta Phi XY. The choice of translational parameters and the aforementioned approximation of T in Eq. 2 by the constant matrix To introduce a linear relationship between the Delta Phi XY and the corresponding elements of the longer 6(N - 1) vector Delta Theta used in Eq. 5. That is, Delta Phi XY = CXYDelta Theta XY, where the molecular deformation vector Delta Theta (Eq. 5) is built up from the (N - 1) sets of constituent Delta Theta XY dimer fluctuations and CXY is a 6 × 6 constant matrix characteristic of the mean orientation of the XY dimer in B-form DNA. The potential energy of the DNA as a whole can be expressed in the form of Eq. 3 by constructing the 6(N - 1) pseudodiagonal matrix of force constants F from the FXY = CXYTGXYCXY corresponding to the DNA base-pair sequence. For example, the fij of the self-complementary (CGTACG)2 hexamer duplex would be collected in the 30 × 30 array,
<B><UP>F</UP></B>=<FENCE><AR><R><C><UP><B>F</B><SUB>CG</SUB></UP></C><C><UP><B>0</B></UP></C><C><UP><B>0</B></UP></C><C><UP><B>0</B></UP></C><C><UP><B>0</B></UP></C></R><R><C><UP><B>0</B></UP></C><C><UP><B>F</B><SUB>GT</SUB></UP></C><C><UP><B>0</B></UP></C><C><UP><B>0</B></UP></C><C><UP><B>0</B></UP></C></R><R><C><UP><B>0</B></UP></C><C><UP><B>0</B></UP></C><C><UP><B>F</B><SUB>TA</SUB></UP></C><C><UP><B>0</B></UP></C><C><UP><B>0</B></UP></C></R><R><C><UP><B>0</B></UP></C><C><UP><B>0</B></UP></C><C><UP><B>0</B></UP></C><C><UP><B>F</B><SUB>AC</SUB></UP></C><C><UP><B>0</B></UP></C></R><R><C><UP><B>0</B></UP></C><C><UP><B>0</B></UP></C><C><UP><B>0</B></UP></C><C><UP><B>0</B></UP></C><C><UP><B>F</B><SUB>CG</SUB></UP></C></R></AR></FENCE>, (9)
where the boldface 0 values in this expression are 6 × 6 null matrices.

Because the lowest energy conformation of a given DNA sequence is self-evident from the knowledge-based force field, i.e., the minimum energy three-dimensional structure is defined by the set of average "step" parameters, the time-consuming energy minimization step normally carried out before normal mode analysis does not need to be performed in the present calculations.

The potential functions introduced below are first approximations of the sequence-dependent structure of DNA based on the observed conformations of selected dimers in high resolution protein-bound and B-DNA crystal structures. Although mean (equilibrium) sequence-dependent parameters have not significantly changed since first reports (Gorin et al., 1995; Olson et al., 1998), the local force constants are expected to change as new structural data accumulate. Moreover, there are not yet sufficient crystallographic data to address DNA deformability beyond the dimer level despite suggestions (Nadeau and Crothers, 1989) of longer-range organization of DNA structure.

End-to-end dimensions

The unperturbed, mean-square end-to-end distance of DNA, < r2> 0, is obtained by three independent approaches. First, exact values of < r2> 0 are computed using standard matrix methods for the mean extension of a DNA of N base pairs, with < r2> 0 equated to the average scalar product of the end-to-end vector, < r · r> 0, and r defined as
<B><UP>r</UP></B>=<B><UP>v</UP></B><SUB>1</SUB>+<B><UP>T</UP></B><SUB>12</SUB><B><UP>v</UP></B><SUB>2</SUB>+<B><UP>T</UP></B><SUB>12</SUB><B><UP>T</UP></B><SUB>23</SUB><B><UP>v</UP></B><SUB>3</SUB>+…+<B><UP>T</UP></B><SUB>12</SUB><B><UP>T</UP></B><SUB>23</SUB> … <B><UP>T</UP></B><SUB><IT>N</IT><UP>−2,</UP><IT>N</IT><UP>−1</UP></SUB><UP><B>v</B></UP><SUB>N−1</SUB>. (10)
Here Tk,k+1 is the transformation matrix that relates coordinate frames in successive base-pair planes (Eq. 1) and vk is the virtual bond vector that connects base-pair centers (Eq. 2). Boltzmann averages of the T and v at each base-pair step are incorporated in dimeric generator matrices, the serial product of which yields all possible configurations of the selected chain sequence. The reader is referred elsewhere for the precise formulation of these expressions (Flory, 1969; Maroun and Olson, 1988; Marky and Olson, 1994). The zero subscript on < r2> 0 and other (unperturbed) averages denotes the omission of chain self-intersection and solvent-polymer interactions in their calculation (Flory, 1969). The DNAs studied below are sufficiently short and stiff so that excluded volume effects are automatically avoided.

Second, we express < r2> 0 in terms of the fluctuations Delta r of the end-to-end vector with respect to the position ro of terminal base pairs in the (minimum energy) rest state,
⟨r<SUP>2</SUP>⟩<SUB>0</SUB>=⟨<B><UP>r</UP></B> · <B><UP>r</UP></B>⟩<SUB>0</SUB>=⟨(<B><UP>r</UP></B><SUP><UP>o</UP></SUP>+&Dgr;<B><UP>r</UP></B>) · (<B><UP>r</UP></B><SUP><UP>o</UP></SUP>+&Dgr;<B><UP>r</UP></B>)⟩<SUB>0</SUB>=<B><UP>r</UP></B><SUP><UP>o</UP></SUP> · <B><UP>r</UP></B><SUP><UP>o</UP></SUP>+2<B><UP>r</UP></B><SUP><UP>o</UP></SUP> · ⟨&Dgr;<B><UP>r</UP></B>⟩<SUB>0</SUB>+⟨&Dgr;<B><UP>r</UP></B> · &Dgr;<B><UP>r</UP></B>⟩<SUB>0</SUB>. (11)
Here the superscript o denotes (constant) rest state values. The value of Delta r, which is determined by the fluctuations of "step" parameters, is written as a Taylor series expansion in the Delta theta i,
&Dgr;<B><UP>r</UP></B>=<LIM><OP>∑</OP><LL><IT>i</IT><UP>=1</UP></LL><UL><UP>6</UP>(<IT>N</IT><UP>−1</UP>)</UL></LIM> <FENCE><FR><NU>∂r</NU><DE>∂&thgr;<SUB><IT>i</IT></SUB></DE></FR></FENCE>&Dgr;&thgr;<SUB><IT>i</IT></SUB>+<FR><NU>1</NU><DE>2</DE></FR> <LIM><OP>∑</OP><LL><IT>i</IT><UP>=1</UP></LL><UL><UP>6</UP>(<IT>N</IT><UP>−1</UP>)</UL></LIM> <LIM><OP>∑</OP><LL><IT>j</IT><UP>=1</UP></LL><UL><UP>6</UP>(<IT>N</IT><UP>−1</UP>)</UL></LIM> <FENCE><FR><NU>∂<SUP>2</SUP><B><UP>r</UP></B></NU><DE><UP>∂&thgr;</UP><SUB><IT>i</IT></SUB>∂&thgr;<SUB><IT>j</IT></SUB></DE></FR></FENCE>&Dgr;&thgr;<SUB><IT>i</IT></SUB>&Dgr;&thgr;<SUB><IT>j</IT></SUB>+…. (12)
If the series is approximated by the first term, the average displacement vector < Delta r> 0 is zero because < Delta theta i>  = 0 and the mean-square end-to-end distance is greater than the separation of chain ends in the rest state (because the self product, < Delta r · Delta r> 0, in Eq. 11 must be positive). Because such a result is inconsistent with the end-to-end displacement of a naturally straight, inextensible DNA, we consider up to second-order terms of the Taylor series (Eq. 12) in the calculation with the result,
⟨&Dgr;<B><UP>r</UP></B>⟩<SUB>0</SUB>=<FR><NU>1</NU><DE>2</DE></FR> <LIM><OP>∑</OP><LL><IT>i</IT><UP>=1</UP></LL><UL><UP>6</UP>(<IT>N</IT><UP>−1</UP>)</UL></LIM> <LIM><OP>∑</OP><LL><IT>j</IT><UP>=1</UP></LL><UL><UP>6</UP>(<IT>N</IT><UP>−1</UP>)</UL></LIM> <FENCE><FR><NU>∂<SUP>2</SUP><B><UP>r</UP></B></NU><DE>∂&thgr;<SUB><IT>i</IT></SUB>∂&thgr;<SUB><IT>j</IT></SUB></DE></FR></FENCE>⟨&Dgr;&thgr;<SUB><IT>i</IT></SUB>&Dgr;&thgr;<SUB><IT>j</IT></SUB>⟩,

⟨&Dgr;<B><UP>r</UP></B> · &Dgr;<B><UP>r</UP></B>⟩<SUB>0</SUB>=<LIM><OP>∑</OP><LL><IT>i</IT><UP>=1</UP></LL><UL><UP>6</UP>(<IT>N</IT><UP>−1</UP>)</UL></LIM> <LIM><OP>∑</OP><LL><IT>j</IT><UP>=1</UP></LL><UL><UP>6</UP>(<IT>N</IT><UP>−1</UP>)</UL></LIM> <FENCE><FR><NU>∂<B><UP>r</UP></B></NU><DE>∂&thgr;<SUB><IT>i</IT></SUB></DE></FR></FENCE> · <FENCE><FR><NU>∂<B><UP>r</UP></B></NU><DE>∂&thgr;<SUB><IT>j</IT></SUB></DE></FR></FENCE>⟨&Dgr;&thgr;<SUB><IT>i</IT></SUB>&Dgr;&thgr;<SUB><IT>j</IT></SUB>⟩. (13)
The first and second derivatives in these expressions are evaluated numerically. The values of < Delta theta iDelta theta j> are taken directly from the experimental data used to derive the knowledge-based force field (Olson et al., 1998). Such averages would have to be determined numerically over all normal modes if an all-atom force field were used. It should also be noted that the first derivatives that appear in Eq. 13, (partial r/partial theta i), differ from similar terms employed in Eq. 7 to compute the kinetic energy. As explained above, the latter derivatives are computed in a carefully chosen coordinate frame that essentially eliminates rotational and translation motions of the molecule. The variation of chain ends required to evaluate Eq. 13, by contrast, is performed in a common global coordinate frame embedded in the plane of the first base pair.

Finally, we compute the root-mean-square end-to-end distance of DNA molecules which are confined to the set of spatial configurations associated with one or more normal modes. That is, we use the set of "step" parameters associated with mode n at time t (Eq. 6) to generate the end-to-end vector r(n, t) and the Boltzmann factor sigma (n, t) of each of the polymeric states that comprise certain global motions and then evaluate < r2> 0 as an energy weighted sum over representative configurations:
⟨r<SUP>2</SUP>⟩<SUB>0</SUB>=<FR><NU><LIM><OP>∑</OP><LL><IT>n t</IT></LL></LIM> <B><UP>r</UP></B>(n, t) · <B><UP>r</UP></B>(n, t)&sfgr;(n, t)</NU><DE><LIM><OP>∑</OP><LL><IT>n t</IT></LL></LIM> &sfgr;(n, t)</DE></FR>. (14)
In practice, we set an upper limit of 7.5 kBT on the potential energy, V(Delta theta i(n, t)) (Eq. 3), thereby excluding configurations where sigma (n, t) = exp[-V/kBT<=  exp[-7.5].


    RESULTS AND DISCUSSION
TOP
ABSTRACT
INTRODUCTION
METHODS
RESULTS AND DISCUSSION
CONCLUSIONS
GLOSSARY
REFERENCES

Elastic features of DNA repeating polymers

We start by determining the normal modes of four simple repeating polymers---the poly dA · poly dT and poly dG · poly dC homopolymers with a monomeric repeating unit (the AA · TT or GG · CC dimer) and the poly d(AT) and poly d(GC) alternating copolymers made up of sequential purine-pyrimidine (AT · AT or GC · GC) and pyrimidine-purine (TA · TA or CG · CG) base-pair steps. We classify the motions, after Matsumoto and G<A><AC>o</AC><AC>&cjs1171;</AC></A> (1999), on the basis of the global structural changes revealed through computer visualization of the eigenvectors associated with each mode (i.e., Eq. 6). The color-coded spectrum of lowest frequency bending, twisting, and stretching modes of a 200 bp chain is presented in Fig. 1 for each of the four polymers and schematic illustrations of some of these motions are given in Fig. 2. Numerical values of the frequencies are tabulated in the Supplementary Material (Table S-2). We focus on the lowest frequency modes because the amplitudes of the normal modes and their consequent importance to the overall motions of DNA diminish at higher frequency.



View larger version (26K):
[in this window]
[in a new window]
 
FIGURE 1   Color-coded spectrum of lowest frequency bending (blue), twisting (green), and stretching (red) modes of 200-bp segments of the poly dA · poly dT and poly dG · poly dC homopolymers and the poly d(AT) and poly d(GC) alternating copolymers. As the frequency increases, the amplitude of each normal mode and its consequent importance to the overall motions of the DNA becomes lower.



View larger version (18K):
[in this window]
[in a new window]
 
FIGURE 2   Schematic illustration of representative low frequency normal modes of an elastic rod. The arrows point in the directions of bending, twisting, and stretching motions in each mode.

It should be noted that these low frequency modes would not be observed in a real system due to viscous effects, which we do not consider here. While our method does not necessarily describe the time-development of DNA chains in a real system, we show below that these low normal mode frequencies are closely related to the global dynamic properties of DNA, such as the bending rigidity, twisting rigidity, and stretching rigidity. Thus, the low normal mode frequencies presented here are useful indicators of the dynamic properties of DNA.

Each of the pure (planar) bending modes (blue lines at the left of Fig. 1) is doubly degenerate, describing mutually perpendicular motions with equivalent (bending) energy. The superposition of these modes leads to the global isotropic bending characteristic of an ideal elastic rod. In contrast to the bending motions, the twisting modes (green lines) are higher in frequency, more widely spaced, and more sensitive to polymer composition. Specifically, the frequencies of the twisting modes of the simulated alternating copolymers and the poly dG · poly dC homopolymer are lower in value than those of the poly dA · poly dT model. The stretching modes (red lines), which are also sensitive to base-pair sequence, tend to be even more widely spaced and to occur at still higher frequencies than the twisting modes.

The rod-like behavior of poly dA · poly dT is further illustrated in Fig. 3 where the chain length dependence of the computed low frequency modes is superimposed on the predicted variation (Strutt, 1894) of the bending, twisting, and stretching frequencies of a homogeneous elastic rod. (The frequency of the nth bending mode of an elastic rod of length L with uniform circular cross-section is given by nu <UP><SUB><IT>n</IT></SUB><SUP><IT>b</IT></SUP></UP> = Cbp<UP><SUB><IT>n</IT></SUB><SUP>2</SUP></UP>, where Cb is a material constant and pn satisfies the condition pnL = (4.730, 7.853, 10.9965, ···) for (n = 1, 2, 3, ...) (Strutt, 1894). The twisting and stretching frequencies follow a similar form, varying directly with n and inversely with L, i.e., nu <UP><SUB><IT>n</IT></SUB><SUP><IT>t</IT></SUP></UP> = (n/2L)Ct and nu <UP><SUB><IT>n</IT></SUB><SUP><IT>s</IT></SUP></UP> = (n/2L)Cs.) The frequencies of the poly dA · poly dT motions (scatter points) are normalized with respect to the lowest frequency bending, twisting, and stretching modes of a 200-bp chain. The normal mode frequencies of the elastic rod (smooth curves) are similarly expressed as multiples of the lowest frequency motions of an ideal rod with the same contour length (199 base-pair steps × 3.27 Å per step = 650 Å). The agreement between computed and theoretical values is remarkably close, with only minor discrepancies in the relative bending frequencies at short chain lengths where the theory is known to break down. Similar agreement between the base-pair level representation of DNA motions and elastic rod behavior is obtained for the low frequencies modes of poly dG · poly dC and the two alternating copolymers (data not shown).



View larger version (22K):
[in this window]
[in a new window]
 
FIGURE 3   Superposition of the computed dependence of representative low frequency bending, twisting, stretching modes of poly dA · poly dT (scatter points) on chain length and the predicted variation (thin solid lines for the lowest frequencies, dashed lines for the second lowest frequencies, and thick solid lines for the third lowest frequencies) for an elastic rod with the same contour length. Frequencies are normalized with respect to the lowest frequency of each type of motion in a 200 bp-chain.

The mechanical constants associated with the overall elastic behavior of the four repeating polymers are collected in Table 1. These values are obtained by substituting the computed normal mode frequencies in the classical expressions for the normal modes of an ideal elastic rod (Strutt, 1894). Specifically, the bending constant A, the twisting constant C, and the stretching constant Y are given by
A=<FR><NU>(2&pgr;v<SUP><IT>b</IT></SUP><SUB><IT>n</IT></SUB>)<SUP>2</SUP>M</NU><DE>p<SUP><UP>4</UP></SUP><SUB><IT>n</IT></SUB>L</DE></FR>, C=<FR><NU>(2Lv<SUP><IT>t</IT></SUP><SUB><IT>n</IT></SUB>)<SUP>2</SUP>I<SUB><IT>M</IT></SUB></NU><DE>n<SUP>2</SUP></DE></FR>, Y=<FR><NU>(2v<SUP><IT>s</IT></SUP><SUB><IT>n</IT></SUB>)<SUP>2</SUP>ML</NU><DE>n<SUP>2</SUP></DE></FR>, (15)
where the nu n are the computed frequencies of the nth bending (b), twisting (t), or stretching (s) mode, M is the total mass of the molecule, L is the length of the DNA helical axis (see Kosikov et al., 1999, and Matsumoto and G<A><AC>o</AC><AC>&cjs1171;</AC></A>, 1999 for computational details) and IM is the moment of inertia per unit length around the twisting axis. As evident from the table, the computed values of the bending constant A and the persistence length a derived from A on the basis of elastic rod theory (a A/kBT with T = 298 K) are approximately independent of sequence and in excellent agreement with values extracted from experimental studies of mixed sequence DNA (Hagerman, 1988; Smith et al., 1992, 1996; Bustamante et al., 1994; Bednar et al., 1995; Baumann et al., 1997). The elastic constants, however, may change as the local force constants are refined.


                              
View this table:
[in this window]
[in a new window]
 
TABLE 1   Mechanical constants describing the simulated elastic properties of four regularly repeating DNA polymers

The sensitivity of the calculated twisting constants to base-pair sequence may account in part for the wide range of experimentally derived values of C (Barkley and Zimm, 1979; Thomas et al., 1980; Hurley et al., 1982; Millar et al., 1982; Horowitz and Wang, 1984). The differences in measured twisting constants, usually attributed to experimental effects, such as the bending strain that accompanies loop closure (Heath et al., 1996), may also reflect the intrinsic deformability of the constituent base pairs. The greater overall twisting stiffness of poly dA · poly dT compared to the other polymers in Table 1 stems in part from the substantially higher cost of distorting AA dimers compared to other base-pair steps in the knowledge-based potential (see Table S-1 in the Supplementary Material and discussion below).

The stretching constants Y extracted here, however, are two- to threefold greater than values deduced from recent single-molecule manipulations of DNA (Smith et al., 1996; Baumann et al., 1997; Wang et al., 1997; Bouchiat and Mezard, 1998). As discussed below, the relative ease of global stretching reflects a number of local factors, in addition to the force constants that govern the axial, i.e., van der Waals', separation of neighboring base-pair planes. In other words, the global stretching of DNA is not simply a function of Rise.

Matsumoto and G<A><AC>o</AC><AC>&cjs1171;</AC></A> (1999) previously derived DNA elastic constants on the basis of a normal mode analysis in the dihedral angle space of short (24-36 bp) double helical molecules, with poorer correspondence between computation and experiment than the present work, i.e., the previously computed bending and twisting constants are respectively three to ten times greater than the corresponding values in Table 1. The improved agreement found here seemingly reflects our use of knowledge-based potential energies rather than an atomic force field. Optimization of DNA conformation on the basis of the detailed interactions of all atoms typically yields a large number of closely spaced minimum energy substates of similar structural character (Poncin et al., 1992). The normal mode analysis of DNA in dihedral angle (and Cartesian) space is performed with respect to one of these minimum energy states and the transitions between closely related, competing minima are not considered. The statistical approach used to generate our elastic force field, by contrast, reflects a large number of known crystal structures, each of which is analogous to one of the minimum energy substates found through all-atom energy minimization. Therefore, any fluctuation on our elastic energy surface is comparable to a transition between different minimum energy points derived with an atomic force field. The correspondence between the mechanical constants in Table 1 and values extracted from experiment suggests that transitions between conformational substates have a significant influence on the global properties of DNA.

A similar argument is made by Song and Schurr (1990) in accounting for the unusually large values of DNA persistence length (~2100 Å) deduced from measurements of transient electric dichroism. They offer three possible reasons for the large discrepancy between their results and the persistence lengths for mixed sequence DNA (~500 Å) obtained with other experimental approaches. One explanation, which they favor, is that the potential energy of DNA bending is not a smooth quadratic function and that the energy exhibits several discrete minima separated by barriers. In the short-time scale of their experiment the DNA is trapped in one of the minima and the persistence length thereby becomes larger.

It should be noted that there is an open controversy over the ionic strength dependence of the persistence length and the lower limit of the persistence length of mixed sequence DNA at ambient salt conditions. We cite a DNA persistence length of 500 Å in Table 1 consistent with a variety of independent measurements. As noted above, larger values of the persistence length are deduced in some work, e.g., (Song and Schurr, 1990). There is similar disagreement among elastic constants obtained for specific DNA sequences (see Table S-3 in the Supplementary Material) which may be related in part to the discussion above, i.e., chains may sample only a small part of the energy surface in some experiments but may explore vast areas of conformation space in others.

Sequence effects on global motion

Figs. 4-6 illustrate the deformations of local "step" parameters, which are collectively responsible for selected low frequency normal modes of the aforementioned repeating polymers. These plots, for DNA chains of 120 bp, capture the sequential fluctuations of each conformational variable at the moment when the potential energy of the molecule is raised by kBT/2; the fluctuations of all parameters are reversed a half cycle later of the mode. As detailed below, the mechanisms used to effect overall chain motions reflect the conformational properties of the constituent dimer units.



View larger version (42K):
[in this window]
[in a new window]
 
FIGURE 4   Deformations of local angular "step" parameters, which are collectively responsible for the lowest frequency in-plane bending motions of a 120-bp poly dA · poly dT duplex: (top, middle) degenerate modes of lowest frequency, nb = nb' = 1; (bottom) one of the two second lowest frequency modes, nb = 2. Plots illustrate the sequential fluctuations of Tilt (thin solid line), Roll (dashed line), and Twist (thick solid line) at the moment when the potential energy of the molecule is raised by kBT/2; fluctuations are reversed a half cycle later of the mode.



View larger version (33K):
[in this window]
[in a new window]
 
FIGURE 5   Fluctuations of local angular "step" parameters, which are collectively responsible for the lowest frequency global twisting modes (nt = 1) of 120-bp fragments of poly dA · poly dT, poly d(AT), poly dG · poly dC, and poly d(GC). See legend to Fig. 4.



View larger version (36K):
[in this window]
[in a new window]
 
FIGURE 6   Fluctuations of local translational "step" parameters associated with the lowest frequency global stretching (ns = 1) of 120-bp fragments of poly dA · poly dT, poly d(AT), poly dG · poly dC, and poly d(GC). Plots illustrate the sequential fluctuations of Shift (thin solid line), Slide (dashed line), and Rise (thick solid line) at the moment when the potential energy of the molecule is raised by kBT/2. See legend to Fig. 4.

Bending

The in-plane bending of poly dA · poly dT, detailed in Fig. 4, takes advantage of the local bending anisotropy of the double helix, i.e., the tendency of DNA to deform through rolling rather than tilting motions and an intrinsic feature of the knowledge-based AA · TT potential. The same patterns of angular distortions are found in the corresponding modes of the other three polymers and thus are not reported here. The regular patterns of Roll deformations in the illustrated examples (dashed lines) resemble early "mini-kinked" models of global DNA bending (Olson, 1979; Yathindra, 1979; Zhurkin et al., 1979) with marked angular distortions of opposite sense at ~5 bp intervals, corresponding to sites on the surface of the double helix where the cumulative effects of local structural change combine constructively. The lesser, out-of-phase variations of Tilt (thin solid lines) contribute in the same fashion to global bending, much like the sequentially shifted sinusoidal changes in Roll and Tilt characteristic of classical models of "smooth" bending (Namoradze et al., 1977; Levitt, 1978; Sussman and Trifonov, 1978). In contrast to the uniformity of the ideal models, the local bending deformations revealed by the normal mode analysis are variable with Roll and Tilt angles of greater or lesser magnitude in different parts of the polymeric fragment, e.g., the distortions responsible for the lowest frequency bending mode are concentrated around the midpoint of the chain. The small accompanying fluctuations of Twist in the present study (thick solid lines) reflect the well known correlation between Roll and Twist built into the energy model.

The global bending isotropy brought about by degenerate bending modes arises at the local level from a simple 2- to 3-bp phase shift of conformational distortions (compare Figs. 4a, b). A comparable shift of "step" parameters also underlies the two normal modes that dominate the global bending of short RNA duplexes (Zacharias and Sklenar, 2000). These subtle changes follow from the ideas behind DNA "mini-kinking," where the sites of largest Roll (and concomitantly zero Tilt) deformation determine the direction of overall curvature (Olson, 1979; Yathindra, 1979; Zhurkin et al., 1979). A 90° change in the direction of global bending comes about by shifting the pattern of local structural perturbations so that Roll is null and Tilt becomes a local maximum or minimum at corresponding sites in the degenerate mode.

The DNA makes use of the same conformational mechanism to change the direction of global bending in the second lowest frequency bending mode. The point of inflection in the variation of "step" parameters versus poly dA · poly dT base-pair position in Fig. 4 c gives rise to a 180° shift of bending directions in the two halves of the molecule (see schematic in Fig. 2). The change in direction stems from the asymmetric pattern of Roll variation in the 120-bp chain, e.g., the distortions are local maxima near base pairs 10, 20, 30, 40, 50 in the first half of the molecule and local minima near base pairs 70, 80, 90, 100, 110 in the second half. Similar representations of the higher frequency bending modes (data not shown) contain additional points of inflection where all three angular variables are undistorted. If the inflection points become too closely spaced, the ~10 bp pattern of conformational fluctuations in phase with the helical repeating unit disappears.

The critical frequency at which the distance between adjacent points of inflection drops to 10 bp can be estimated as follows. Roughly speaking, there are (2n + 1)/4 standing waves in the nth bending mode (Fig. 2). The wavelength of the mode is thus given by lambda  = 4L/(2n + 1), where L is the contour length of the DNA. The upper limit on the number of bending modes n is then obtained by setting the half wavelength corresponding to the distance between two adjacent points of inflection to 10 bp, i.e., lambda /2 = 10 bp. The maximum n for a 120-bp stretch of poly dA · poly dT is thus 12, corresponding to a bending frequency of ~1.4 cm-1 in an elastic rod with equivalent material properties and contour length. In the case of the poly dA · poly dT 120-mer, we find a pair of bending modes with close frequencies below the theoretical frequency limit at 1.07 and 1.08 cm-1 and with 11 points of inflection, separated by ~10 bp, in each of these modes. As anticipated from elastic theory, there are no other such pairs of degenerate bending modes at higher frequency.

The coupling of local "step" parameters in the DNA force field gives rise to changes in neighboring base-pair displacement as the chain bends globally (see Fig. S-1 in the Supplementary Material for plots of the local translational fluctuations associated with the bending of individual polymers). The deformations in Roll which dominate the overall molecular motion are accompanied by changes in Slide, whereas the lesser changes in Tilt introduce concomitant fluctuations of Shift. The extremes of deformability found in pyrimidine-purine (YR) versus purine-pyrimidine (RY) steps, i.e., the enhanced flexibility of YR steps versus the natural stiffness of RY steps (Olson et al., 1998), introduces a zig-zag pattern in the variation of "step" parameters in poly d(AT) and poly d(GC) models compared to the smooth parametric responses of the simulated homopolymers in Fig. 4 (see plots of translational parameters for the alternating copolymers in Fig. S-1 in the Supplementary Material). For example, the increments in Roll that lead to global bending are consistently greater at the YR steps of alternating copolymers (data not shown), and the coupled sequence-dependent variation of base-pair displacement in these chains is even more pronounced, with distortions in Slide restricted almost entirely to the YR steps.

Twisting

As evident from Fig. 5, the lowest frequency global twisting motions of regular DNA polymers are dominated by fluctuations in local Twist (thick solid lines). The dimer deformations build up from the ends of the molecule with the largest deformations of neighboring base-pair steps found in the center of the chain. The alternating copolymers show the zig-zag pattern of local conformational change noted above with greater distortions occurring at YR steps. The degree of local Twist deformation of poly dA · poly dT is smaller than that in the other chains as expected from its higher global twisting constant (Table 1). The increase in Twist angles in Fig. 5 is accompanied by a concomitant decrease in Roll values (dashed lines), the decrease in local bending depending on the strength of the energy coupling term, e.g., small for GG compared to AA and YR steps. The translational parameters show similar sequence-dependent, coupled variations that reflect the knowledge-based energy function. For example, fluctuations of Shift are negligible in the alternating copolymers where the Twist-Shift energy coefficient is by definition zero. (The directional dependence of Shift (Dickerson et al., 1989) leads naturally to null force constants at self-complementary dimer steps. Indeed, the coefficients of all mixed energy terms involving Shift or Tilt (Eq. 8), except for the Tilt-Shift contribution, are zero at such steps.) Similarly, Slide varies in opposing directions at YR versus RY steps in poly d(GC) in accordance with the positive Twist-Slide coefficient at CG steps and the negative value at GC steps. Graphs of the sequential variation of translational parameters in the lowest frequency twisting modes of all four polymers are available in Fig. S-2 in the Supplementary Material.

Stretching

The global stretching of DNA homopolymers and alternating copolymers stems primarily from changes in Rise that grow to a maximum in the middle of the chain (thick solid lines in Fig. 6). The computed, sequence-dependent variation in Rise in the lowest frequency stretching mode reflects the local force constants, with CG steps much more easily deformed than any other dimer. The changes in Rise are coupled, via the elastic energy potential, to fluctuations in Shift in the homopolymers (thin solid lines in Fig. 6) and to Slide in the copolymers (dashed lines in the figure). Indeed, the variation of Shift is comparable to that of Rise at GG steps as is the change in Slide relative to that of Rise at TA steps. Poly d(GC) is unusual in stretching with an alternation in the direction of Slide at YR and RY steps. As discussed below, the coupling of Rise to Shift or Slide contributes to the overall stretching force constants. Concerted movements of base pairs which mimic the conformational behavior of stretched DNA fibers (Chandrasekaran and Arnott, 1989), e.g., same sign variation of Shift and Rise, enhance the global stretching motion. Notably, deformations of angular parameters in the lowest fr