| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
Biophys J, July 2002, p. 22-41, Vol. 83, No. 1
Department of Chemistry, Rutgers, the State University of New Jersey, Wright-Rieman Laboratories, Piscataway, New Jersey 08854-8087 USA
| |
ABSTRACT |
|---|
|
|
|---|
Computer simulation of the dynamic structure of DNA can be carried out at various levels of resolution. Detailed high resolution information about the motions of DNA is typically collected for the atoms in a few turns of double helix. At low resolution, by contrast, the sequence-dependence features of DNA are usually neglected and molecules with thousands of base pairs are treated as ideal elastic rods. The present normal mode analysis of DNA in terms of six base-pair "step" parameters per chain residue addresses the dynamic structure of the double helix at intermediate resolution, i.e., the mesoscopic level of a few hundred base pairs. Sequence-dependent effects are incorporated into the calculations by taking advantage of "knowledge-based" harmonic energy functions deduced from the mean values and dispersion of the base-pair "step" parameters in high-resolution DNA crystal structures. Spatial arrangements sampled along the dominant low frequency modes have end-to-end distances comparable to those of exact polymer models which incorporate all possible chain configurations. The normal mode analysis accounts for the overall bending, i.e., persistence length, of the double helix and shows how known discrepancies in the measured twisting constants of long DNA molecules could originate in the deformability of neighboring base-pair steps. The calculations also reveal how the natural coupling of local conformational variables affects the global motions of DNA. Successful correspondence of the computed stretching modulus with experimental data requires that the DNA base pairs be inclined with respect to the direction of stretching, with chain extension effected by low energy transverse motions that preserve the strong van der Waals' attractions of neighboring base-pair planes. The calculations further show how one can "engineer" the macroscopic properties of DNA in terms of dimer deformability so that polymers which are intrinsically straight in the equilibrium state exhibit the mesoscopic bending anisotropy essential to DNA curvature and loop formation.
| |
INTRODUCTION |
|---|
|
|
|---|
The large-scale fluctuations of DNA are key to
understanding kinetically complicated events, such as the ease of the
long, threadlike molecule snaking though the pores of a gel or closing into a loop between separately bound regulatory proteins. DNA loop
formation is implicated, in turn, in a number of important biological
processes, including the regulation of transcription (Bellomy et al.,
1988
; Schleif, 1992
) and the organization of chromatin (Dillon et al.,
1997
; Bazett-Jones et al., 1999
; Ringrose et al., 1999
).
The motions of polymeric DNA are generally modeled in terms of a
spatially homogeneous, naturally straight, elastic rod which ignores
realistic features of chemical structure (reviewed in Olson, 1996
). The
fluctuations and correlations of structural parameters in crystals of
pure DNA and of DNA-protein complexes, however, show that the
equilibrium rest states and elastic constants of neighboring base pairs
are dependent on sequence and further reveal the presence of strong
couplings between modes of deformation
e.g., between bending,
twisting, and stretching
that are the direct result of the chiral
nature of DNA chemical structure (Olson et al., 1998
). These local
features become important as the deformations of sequential base-pair
steps accumulate with the increase of chain length, introducing notable
structural variability at the mesoscopic level, i.e., in chains of a
few hundred base pairs.
Electron micrographs and other low resolution images of mesoscopic DNA
fragments reveal broad distributions of molecular shapes and end-to-end
distances (Muzard et al., 1990
; Thresher and Griffith, 1992
; Bednar et
al., 1995
; Lyubchenko et al., 1995
; Hansma et al., 1996
; Bustamante et
al., 1997
). The sequential features responsible for these spatial
arrangements are often deduced by comparison of the experimentally
observed images of a given DNA with the distribution of configurations
generated by Metropolis-Monte Carlo sampling of the likely structural
fluctuations of the constituent dimers (Hagerman, 1985
; Levene and
Crothers, 1986
; Kahn and Crothers, 1998
). Limitations on computer
resources make routine comparison of simulations and experiment
impractical, because hundreds of thousands of coordinate sets must be
generated to characterize an individual polymer molecule. The study of
the large-scale, collective motions of long DNA is consequently impeded.
The normal modes of DNA provide an alternative, computationally less demanding way to study the equilibrium properties of DNA. The normal modes are coupled vibrations found by assuming that the molecular potential energy can be approximated by a harmonic function of the configurational variables and by then solving a generalized eigenvector problem to give a closed analytical description of the motion. The eigenvalues give the vibrational time scales (frequencies) and the eigenvectors the details of the corresponding motion. The motion can be described and visualized at each frequency or as a superposition of independent modes. Time-averaged equilibrium and kinetic properties can be calculated accurately and efficiently as weighted sums, e.g., thermodynamic functions can be determined from the normal mode frequencies. The price paid for these benefits is the limited accuracy of the harmonic energy approximation. The method, nevertheless, provides a useful first impression of the flexibility of a molecule and shows how the motions may change when the chemical structure is modified or the chain is perturbed by the binding of proteins and other ligands.
Normal mode or vibrational analysis is a well established and much used
technique for the study of both small molecules and proteins: the
technique is successful in reproducing the vibrational spectra of small
molecules (Wilson et al., 1955
) and in analyzing the collective motions
involved in the folding of protein fragments and in the binding and
release of substrates and products to and from enzymes (Levy and
Karplus, 1979
; Brooks and Karplus, 1983
; Kitao and G
;
Berendsen and Hayward, 2000
). Analyses of DNA normal modes to date have
focused on the collective motions of very short chain fragments ranging
from small oligomers (Tidor et al., 1983
; Irikura et al., 1985
; Garcia
and Soumpasis, 1989
; Kottalam and Case, 1990
; Ha Duong and Zakrzewska,
1997a
,b
, 1998
; Lin et al., 1997
) to a few turns of double helix
(Matsumoto and G
). The size of the latter systems is
limited by the kinds of parameters which have been used to describe
three-dimensional structure
typically the Cartesian coordinates of the
constituent atoms or the dihedral angles of a nucleic acid fragment
with rigid chemical bonds and valence angles. The small, almost
imperceptible motions of short DNA oligonucleotides described at this
high level of resolution are closely tied to the detailed
sequence-dependent fine structure of the double helix and the
association of water, drugs, proteins, and other molecules with
individual nucleotide residues. The conformational effects which are
collectively responsible for the large-scale, sequence-dependent
properties of polymeric DNA chains are difficult to discern in such
studies. The normal modes of infinitely long DNA, however, have been
examined by Prohofsky and associates (Hua and Prohofsky, 1988
; Chen and
Prohofsky, 1995
) by repeating a short fragment of DNA and taking
advantage of helical symmetry to reduce the number of independent
coordinates. The complexity of chains that can be studied from this
perspective is again limited by the length of the repeating unit.
In the present investigation, we examine the motions of long fragments
of DNA by taking advantage of "knowledge-based" harmonic energy
functions deduced from known high-resolution crystal structures of DNA
(Olson et al., 1998
). The deformations of individual dimers are
described by six independent "step" parameters which specify the
spatial arrangements of neighboring base pairs: three angular variables
called Tilt, Roll, and Twist and three variables called Shift, Slide,
and Rise with dimensions of distance (Dickerson et al., 1989
). This
rigid-body representation of base pairs significantly reduces the
number of independent variables per chain molecule, thereby making it
possible to study the normal modes of much longer DNA fragments than
could heretofore be treated in either Cartesian or dihedral angle
space. In addition, the elastic energy incorporates the
sequence-dependent anisotropy of DNA bending as well as the known
correlations of base-pair step parameters. The only missing structural
information is the detailed conformation of the sugar-phosphate backbone, including the charged phosphate groups. The latter atoms and
the surrounding aqueous solvent, such as the water molecules and
counterions in the first or second solvation layer around the DNA, are
implicitly treated in the energy terms so that their omission
introduces no serious errors when duplex deformations are limited to
energies of the order of kBT (where
kB is the Boltzmann constant and T
the temperature in Kelvin). It should be noted, however, that the
surrounding solvent does not have a viscosity and that we do not
consider the damping effect of solvent on large-scale polymeric
properties in this study. This low-resolution model, nevertheless,
provides a straightforward way to deduce the effects of sequence on
global folding, something that is said (Berendsen and Hayward, 2000
)
still to elude analyses of the slow conformational changes of proteins.
In principle, there is no limit on the length of DNA that can be
described in this manner so long as the constituent dimers obey the
harmonic energy model, i.e., there are no excursions of the molecule
from the classical B-form double helical structure. Longer chains are
characterized by a greater number of normal modes, which when
appropriately weighted and superimposed, give rise to a broader
distribution of global molecular configurations. In practice, chain
length is restricted by the form of the kinetic energy, which assumes
that changes in atomic coordinates brought about by the fluctuations of
individual "step" parameters are small. These upper limits preclude
the need to treat the long-range electrostatic self-repulsion that
influences the folding of long supercoiled DNA molecules (Fenley et
al., 1994
; Vologodskii and Cozzarelli, 1995
; Westcott et al., 1997
).
We focus attention here on the normal modes of regularly repeating,
linear polymers free of bound ligands to establish a point of reference
for studies to be reported elsewhere of arbitrary DNA sequences and of
chains with ends held in place by specific anchoring conditions, such
as the looped configurations imposed by protein binding. We compare our
simplified energy model with previous all-atom treatments of short
oligomers and with ideal elastic rod representations of polymeric DNA.
We identify the local base-pair fluctuations responsible for the
dominant (lowest frequency) normal modes. The derived polymeric
properties account satisfactorily for the overall bending, i.e.,
persistence length, of the double helix and reveal the critical role of
sequence in global twisting. The computations further suggest that the
constants impeding the large-scale stretching of single DNA molecules
are related to the concerted bending, twisting, and lateral
displacements of neighboring base-pair planes as well as to their axial
separation. We identify particular sequence contexts under which ideal
elastic behavior breaks down, concentrating on conditions that induce the mesoscopic bending anisotropy which underlies DNA loop formation. Finally, by comparing the end-to-end dimensions of the simulated double
helices with exact values obtained from standard matrix formulations of
polymer configurational statistics (Flory, 1969
; Maroun and Olson,
1988
; Marky and Olson, 1994
), we investigate the chain lengths at which
the normal mode analysis of DNA modeled in terms of base-pair steps is valid.
| |
METHODS |
|---|
|
|
|---|
Base-pair representation
The atoms of each base pair lie in the plane of a local,
orthogonal coordinate frame (x, y, z),
which is defined in accordance with recently established guidelines (Olson et al., 2001
). (Note: The x and y axes lie in the plane of the base pair, with x pointing in the
direction of the major groove along the pseudodyad axis of the base
pair and y running along the long axis of the base pair in
the direction of the leading (sequence) strand, parallel to the
C1' ··· C1' vector, and displaced so as to pass through the
intersection of x with the vector connecting the pyrimidine
Y(C6) and purine R(C8) atoms. The z axis is perpendicular to
the base-pair plane, pointing in the 5'-3' direction of the leading strand.) The coordinate system of the kth base pair is
further characterized by the position of its origin
ok and the rotation matrix
Rk, which relates the local base-pair
frame to a fixed, orthonormal global reference frame. The rotation
matrix is given by
[
x
y
z], where the 
(
= x, y, z)
are unit vectors, expressed in columns, along the positive coordinate
axes of the base-pair frame.
The relative position and orientation of the (k + 1)th
base pair with respect to its predecessor k are given
respectively by the difference between coordinate origins,
vk,k+1 = (ok+1
ok) and the product of the rotation
matrix Rk+1 and the inverse of
Rk, i.e., the dimer transformation matrix
Tk,k+1 = Rk+1(Rk)
1.
The components of the projection of
vk,k+1 on the coordinate
axes of base pair k, (
x,
y,
z), are used as
translational parameters in the present calculation.
The matrix T can also be expressed in terms of the rotation
of magnitude
around the axis u which brings the coordinate frames on successive base pairs into coincidence, where the
components of the unit vector u are
(ux,
uy, uz)
and the angle
equals (





). The
elements of T, so expressed, are given by (Jeffreys and
Jeffreys, 1946
):
|
(1) |

µ is the Kronecker delta, i.e.,

µ = 1 when
= µ,

µ = 0 when
µ, and

µ
= ±1 when
, µ,
is an even or an
odd permutation of 1, 2, 3, respectively, and vanishes otherwise. The
rotational components (
x,
y,
z) are equated
to (Tilt, Roll, Twist) after the definition of Babcock et al. (1994)The translational parameters between base-pair planes are generally
expressed in terms of a "middle" frame so that numerical values are
independent of the direction from which a DNA structure is analyzed (Lu
and Olson, 1998
), i.e., either the sequence or the complementary
strand. The translational parameters defined in this work in terms of
the kth coordinate frame are related to Shift, Slide, Rise
values in the literature, e.g., (Olson et al., 1998
), through the
matrix T1/2 that effects the halfway rotation
(
/2) between neighboring base-pair planes:
|
(2) |

(
= x, y,
z) on Tilt, Roll, Twist (through their incorporation in
T) as well as on Shift, Slide, Rise precludes description of
the local conformational energy in terms of a quadratic expression in
the six base-pair step parameters. To overcome this limitation, we
express the rotation matrix in Eq. 2 in terms of the equilibrium
"rest" values of Tilt, Roll, and Twist, i.e.,
T1/2
To1/2. The latter approximation is
valid so long as the dimer structure does not depart significantly from
this minimum energy state.
Normal mode analysis
Our treatment of the normal modes of DNA at the level of
"step" parameters builds upon general formulations (Noguti and
G
; Braun et al., 1984
; Higo et al., 1985
; Levitt et al.,
1985
) previously developed to describe the collective motions of
proteins in terms of internal chemical coordinates, i.e., dihedral
angles. The conformational potential energy V of a double
helix of N base pairs is thus approximated by the
multi-dimensional parabola,
|
(3) |

i is the instantaneous
fluctuation of the ith "step" parameter from its
equilibrium value and fij is an
element of a 6(N
1) × 6(N
1) matrix of
elastic force constants, F, i.e., the terms describing the
potential deformability of the six parameters in each of the
N
1 base-pair steps.
The kinetic energy K is similarly expressed in quadratic
form in terms of 
i with respect to time,
and the weighted "mass" coefficients
hij of the 6(N
1) × 6(N
1) matrix H (defined below):
|
(4) |

i and


and 
|
(5) |
|
(6) |
n is the frequency,
n the phase angle,
n the amplitude of the nth normal
mode, and Ain the fractional
contribution to the ith "step" parameter from the nth normal mode. (In practice the components of

and the frequencies
n are
obtained by introducing the vector of normal coordinates Q
defined by 
= AQ, where the elements
of Q are
n
cos(
nt +
n) and A is a 6(N
1) × 6(N
1) transformation matrix. Eq. 5 can then
be rewritten as the eigenvalue expression, HA
= FA, with ATHA equated to
the identity matrix of order 6(N
1) and the normal
mode frequencies and directions found from the diagonalization of
F, i.e.,
= ATFA, where AT
denotes the transpose of A and
is a diagonal matrix with elements
nn = 


i at time t are thereby expressed as a linear combination of harmonic
oscillators, the energies of which are proportional to


n, so that relatively
few (low frequency) modes are responsible for the large-scale
deformations of the molecule.
Kinetic energy
The kinetic energy coefficients in Eq. 4 incorporate the mass
ma and the Cartesian coordinates
ra of individual atoms in the DNA
through the relationship:
|
(7) |
ra/
i)
developed to treat the normal mode dynamics of a system of two
molecules, each of which moves in dihedral angle space (Braun et al.,
1984
Because the atomic motions
ra
brought about by "step" parameter change

i are assumed to be small in this common
frame (Eckart, 1935
), the kinetic energy term (Eq. 4)
effectively restricts the length of DNA which can be studied by normal
mode analysis. This limitation does not apply to the mesoscopic chains
lengths considered below, where typical room temperature fluctuations
of individual base-pair steps, such as the ±5° perturbations of Roll
which raise the potential energy by
~kBT/2, limit atomic movement to
0.02-0.07 persistence lengths. The persistence length of mixed
sequence DNA under ambient aqueous salt conditions, by comparison, is
the same magnitude as the contour length of a 150 to 200-bp chain
(Hagerman, 1988
; Smith et al., 1992
, 1996
; Bustamante et al., 1994
;
Bednar et al., 1995
; Baumann et al., 1997
). The limitation on chain
length is further discussed below in the analysis of DNA end-to-end dimensions.
Both backbone and base atoms must be included in the evaluation of the
hij. The sugar-phosphate backbone is
incorporated in the present calculations by the superposition of a
canonical B-form 5'-nucleotide helical fragment (Chandrasekaran and
Arnott, 1989
) in the reference frame of each base. Because
complementary base-pair parameters are fixed at ideal planar values,
each complementary nucleotide pair is thereby treated as a rigid body
and the small variations in backbone conformation that accompany
fluctuations in the geometry of neighboring base pairs are ignored.
Force field
The potential energy coefficients in Eq. 3 are taken from
knowledge-based elastic functions extracted from the three-dimensional arrangements of DNA base-pair steps in protein-DNA crystal structures (Olson et al., 1998
):
|
(8) |

i correspond to the instantaneous
fluctuations of each of the six "step" parameters of that dimer
from the B-DNA rest state, i.e., 
1 =
TiltXY, 
2 =
RollXY, 
3 =
TwistXY, 
4 =
ShiftXY, 
5 =
SlideXY, 
6 =
RiseXY (full details of the potential function are
available in Table S-1 in the Supplementary Material). If these
elements are collected respectively in the 6 × 6 force constant matrix GXY and the 6 × 1 vector

XY, the dimer deformation energy
VXY reduces to
(1/2)
XYTGXY
XY.
The choice of translational parameters and the aforementioned
approximation of T in Eq. 2 by the constant matrix
To introduce a linear relationship between the

XY and the corresponding elements of the
longer 6(N
1) vector 
used in Eq. 5. That is, 
XY = CXY
XY, where the
molecular deformation vector 
(Eq. 5) is built up from
the (N
1) sets of constituent

XY dimer fluctuations and
CXY is a 6 × 6 constant matrix
characteristic of the mean orientation of the XY dimer in B-form DNA.
The potential energy of the DNA as a whole can be expressed in the form
of Eq. 3 by constructing the 6(N
1) pseudodiagonal
matrix of force constants F from the
FXY = CXYTGXYCXY
corresponding to the DNA base-pair sequence. For example, the
fij of the self-complementary
(CGTACG)2 hexamer duplex would be collected in the 30 × 30 array,
|
(9) |
Because the lowest energy conformation of a given DNA sequence is self-evident from the knowledge-based force field, i.e., the minimum energy three-dimensional structure is defined by the set of average "step" parameters, the time-consuming energy minimization step normally carried out before normal mode analysis does not need to be performed in the present calculations.
The potential functions introduced below are first approximations of
the sequence-dependent structure of DNA based on the observed
conformations of selected dimers in high resolution protein-bound and
B-DNA crystal structures. Although mean (equilibrium)
sequence-dependent parameters have not significantly changed since
first reports (Gorin et al., 1995
; Olson et al., 1998
), the local force
constants are expected to change as new structural data accumulate.
Moreover, there are not yet sufficient crystallographic data to address DNA deformability beyond the dimer level despite suggestions (Nadeau and Crothers, 1989
) of longer-range organization of DNA structure.
End-to-end dimensions
The unperturbed, mean-square end-to-end distance of DNA,
r2
0, is obtained by three
independent approaches. First, exact values of
r2
0 are computed using
standard matrix methods for the mean extension of a DNA of N
base pairs, with
r2
0 equated
to the average scalar product of the end-to-end vector,
r
· r
0, and r defined as
|
(10) |
r2
0 and other (unperturbed)
averages denotes the omission of chain self-intersection and
solvent-polymer interactions in their calculation (Flory, 1969Second, we express
r2
0 in
terms of the fluctuations
r of the end-to-end vector with
respect to the position ro of terminal base
pairs in the (minimum energy) rest state,
|
(11) |
r, which is determined by the
fluctuations of "step" parameters, is written as a Taylor series
expansion in the 
i,
|
(12) |

r
0 is zero because


i
= 0 and the mean-square end-to-end
distance is greater than the separation of chain ends in the rest state
(because the self product, 
r ·
r
0, in Eq. 11 must be positive). Because such a result is inconsistent with the end-to-end displacement of a
naturally straight, inextensible DNA, we consider up to second-order terms of the Taylor series (Eq. 12) in the calculation with the result,
|
|
(13) |


i
j
are
taken directly from the experimental data used to derive the knowledge-based force field (Olson et al., 1998
r/
i), differ from similar
terms employed in Eq. 7 to compute the kinetic energy. As explained
above, the latter derivatives are computed in a carefully chosen
coordinate frame that essentially eliminates rotational and translation
motions of the molecule. The variation of chain ends required to
evaluate Eq. 13, by contrast, is performed in a common global
coordinate frame embedded in the plane of the first base pair.
Finally, we compute the root-mean-square end-to-end distance of DNA
molecules which are confined to the set of spatial configurations associated with one or more normal modes. That is, we use the set of
"step" parameters associated with mode n at time
t (Eq. 6) to generate the end-to-end vector r(n,
t) and the Boltzmann factor
(n, t) of each of the
polymeric states that comprise certain global motions and then evaluate
r2
0 as an energy weighted sum
over representative configurations:
|
(14) |

i(n, t)) (Eq. 3), thereby
excluding configurations where
(n, t) = exp[
V/kBT]
exp[
7.5].
| |
RESULTS AND DISCUSSION |
|---|
|
|
|---|
Elastic features of DNA repeating polymers
We start by determining the normal modes of four simple repeating
polymers
the poly dA · poly dT and poly dG · poly dC
homopolymers with a monomeric repeating unit (the AA · TT or
GG · CC dimer) and the poly d(AT) and poly d(GC) alternating
copolymers made up of sequential purine-pyrimidine (AT · AT or
GC · GC) and pyrimidine-purine (TA · TA or CG · CG) base-pair steps. We classify the motions, after Matsumoto and
G
, on the basis of the global structural changes revealed
through computer visualization of the eigenvectors associated with each
mode (i.e., Eq. 6). The color-coded spectrum of lowest frequency
bending, twisting, and stretching modes of a 200 bp chain is presented
in Fig. 1 for each of the four polymers and schematic illustrations of some of these motions are given in Fig.
2. Numerical values of the frequencies
are tabulated in the Supplementary Material (Table S-2). We focus on
the lowest frequency modes because the amplitudes of the normal modes
and their consequent importance to the overall motions of DNA diminish at higher frequency.
|
|
It should be noted that these low frequency modes would not be observed in a real system due to viscous effects, which we do not consider here. While our method does not necessarily describe the time-development of DNA chains in a real system, we show below that these low normal mode frequencies are closely related to the global dynamic properties of DNA, such as the bending rigidity, twisting rigidity, and stretching rigidity. Thus, the low normal mode frequencies presented here are useful indicators of the dynamic properties of DNA.
Each of the pure (planar) bending modes (blue lines at the left of Fig. 1) is doubly degenerate, describing mutually perpendicular motions with equivalent (bending) energy. The superposition of these modes leads to the global isotropic bending characteristic of an ideal elastic rod. In contrast to the bending motions, the twisting modes (green lines) are higher in frequency, more widely spaced, and more sensitive to polymer composition. Specifically, the frequencies of the twisting modes of the simulated alternating copolymers and the poly dG · poly dC homopolymer are lower in value than those of the poly dA · poly dT model. The stretching modes (red lines), which are also sensitive to base-pair sequence, tend to be even more widely spaced and to occur at still higher frequencies than the twisting modes.
The rod-like behavior of poly dA · poly dT is further
illustrated in Fig. 3 where the chain
length dependence of the computed low frequency modes is superimposed
on the predicted variation (Strutt, 1894
) of the bending, twisting, and
stretching frequencies of a homogeneous elastic rod. (The frequency of
the nth bending mode of an elastic rod of length
L with uniform circular cross-section is given by



). The
twisting and stretching frequencies follow a similar form, varying
directly with n and inversely with L, i.e.,




|
The mechanical constants associated with the overall elastic behavior
of the four repeating polymers are collected in Table 1. These values are obtained by
substituting the computed normal mode frequencies in the classical
expressions for the normal modes of an ideal elastic rod (Strutt,
1894
). Specifically, the bending constant A, the twisting
constant C, and the stretching constant Y are
given by
|
(15) |
n are the computed frequencies
of the nth bending (b), twisting (t),
or stretching (s) mode, M is the total mass of
the molecule, L is the length of the DNA helical axis (see
Kosikov et al., 1999
|
The sensitivity of the calculated twisting constants to base-pair
sequence may account in part for the wide range of experimentally derived values of C (Barkley and Zimm, 1979
; Thomas et al.,
1980
; Hurley et al., 1982
; Millar et al., 1982
; Horowitz and Wang,
1984
). The differences in measured twisting constants, usually
attributed to experimental effects, such as the bending strain that
accompanies loop closure (Heath et al., 1996
), may also reflect the
intrinsic deformability of the constituent base pairs. The greater
overall twisting stiffness of poly dA · poly dT compared to
the other polymers in Table 1 stems in part from the substantially
higher cost of distorting AA dimers compared to other base-pair steps in the knowledge-based potential (see Table S-1 in the Supplementary Material and discussion below).
The stretching constants Y extracted here, however, are two-
to threefold greater than values deduced from recent single-molecule manipulations of DNA (Smith et al., 1996
; Baumann et al., 1997
; Wang et
al., 1997
; Bouchiat and Mezard, 1998
). As discussed below, the relative
ease of global stretching reflects a number of local factors, in
addition to the force constants that govern the axial, i.e., van der
Waals', separation of neighboring base-pair planes. In other words,
the global stretching of DNA is not simply a function of Rise.
Matsumoto and G
previously derived DNA elastic constants
on the basis of a normal mode analysis in the dihedral angle space of
short (24-36 bp) double helical molecules, with poorer correspondence
between computation and experiment than the present work, i.e., the
previously computed bending and twisting constants are respectively
three to ten times greater than the corresponding values in Table 1.
The improved agreement found here seemingly reflects our use of
knowledge-based potential energies rather than an atomic force field.
Optimization of DNA conformation on the basis of the detailed
interactions of all atoms typically yields a large number of closely
spaced minimum energy substates of similar structural character (Poncin
et al., 1992
). The normal mode analysis of DNA in dihedral angle (and
Cartesian) space is performed with respect to one of these minimum
energy states and the transitions between closely related, competing
minima are not considered. The statistical approach used to generate
our elastic force field, by contrast, reflects a large number of known crystal structures, each of which is analogous to one of the minimum energy substates found through all-atom energy minimization.
Therefore, any fluctuation on our elastic energy surface is comparable
to a transition between different minimum energy points derived with an
atomic force field. The correspondence between the mechanical constants
in Table 1 and values extracted from experiment suggests that
transitions between conformational substates have a significant influence on the global properties of DNA.
A similar argument is made by Song and Schurr (1990)
in accounting for
the unusually large values of DNA persistence length (~2100 Å)
deduced from measurements of transient electric dichroism. They offer
three possible reasons for the large discrepancy between their results
and the persistence lengths for mixed sequence DNA (~500 Å) obtained
with other experimental approaches. One explanation, which they favor,
is that the potential energy of DNA bending is not a smooth quadratic
function and that the energy exhibits several discrete minima separated
by barriers. In the short-time scale of their experiment the DNA is
trapped in one of the minima and the persistence length thereby becomes larger.
It should be noted that there is an open controversy over the
ionic strength dependence of the persistence length and the lower limit
of the persistence length of mixed sequence DNA at ambient salt
conditions. We cite a DNA persistence length of 500 Å in Table 1
consistent with a variety of independent measurements. As noted above,
larger values of the persistence length are deduced in some work, e.g.,
(Song and Schurr, 1990
). There is similar disagreement among elastic
constants obtained for specific DNA sequences (see Table S-3 in the
Supplementary Material) which may be related in part to the discussion
above, i.e., chains may sample only a small part of the
energy surface in some experiments but may explore vast areas of
conformation space in others.
Sequence effects on global motion
Figs. 4-6 illustrate the deformations of local "step" parameters, which are collectively responsible for selected low frequency normal modes of the aforementioned repeating polymers. These plots, for DNA chains of 120 bp, capture the sequential fluctuations of each conformational variable at the moment when the potential energy of the molecule is raised by kBT/2; the fluctuations of all parameters are reversed a half cycle later of the mode. As detailed below, the mechanisms used to effect overall chain motions reflect the conformational properties of the constituent dimer units.
|
|
|
Bending
The in-plane bending of poly dA · poly dT, detailed in Fig. 4, takes advantage of the local bending anisotropy of the double helix, i.e., the tendency of DNA to deform through rolling rather than tilting motions and an intrinsic feature of the knowledge-based AA · TT potential. The same patterns of angular distortions are found in the corresponding modes of the other three polymers and thus are not reported here. The regular patterns of Roll deformations in the illustrated examples (dashed lines) resemble early "mini-kinked" models of global DNA bending (Olson, 1979
= 4L/(2n + 1), where
L is the contour length of the DNA. The upper limit on the
number of bending modes n is then obtained by setting the
half wavelength corresponding to the distance between two adjacent
points of inflection to 10 bp, i.e.,
/2 = 10 bp. The maximum
n for a 120-bp stretch of poly dA · poly dT is thus 12, corresponding to a bending frequency of ~1.4 cm
1 in
an elastic rod with equivalent material properties and contour length.
In the case of the poly dA · poly dT 120-mer, we find a pair of
bending modes with close frequencies below the theoretical frequency
limit at 1.07 and 1.08 cm
1 and with 11 points of
inflection, separated by ~10 bp, in each of these modes. As
anticipated from elastic theory, there are no other such pairs of
degenerate bending modes at higher frequency.
The coupling of local "step" parameters in the DNA force field
gives rise to changes in neighboring base-pair displacement as the
chain bends globally (see Fig. S-1 in the Supplementary Material for
plots of the local translational fluctuations associated with the
bending of individual polymers). The deformations in Roll which
dominate the overall molecular motion are accompanied by changes in
Slide, whereas the lesser changes in Tilt introduce concomitant
fluctuations of Shift. The extremes of deformability found in
pyrimidine-purine (YR) versus purine-pyrimidine (RY) steps, i.e., the
enhanced flexibility of YR steps versus the natural stiffness of RY
steps (Olson et al., 1998Twisting
As evident from Fig. 5, the lowest frequency global twisting motions of regular DNA polymers are dominated by fluctuations in local Twist (thick solid lines). The dimer deformations build up from the ends of the molecule with the largest deformations of neighboring base-pair steps found in the center of the chain. The alternating copolymers show the zig-zag pattern of local conformational change noted above with greater distortions occurring at YR steps. The degree of local Twist deformation of poly dA · poly dT is smaller than that in the other chains as expected from its higher global twisting constant (Table 1). The increase in Twist angles in Fig. 5 is accompanied by a concomitant decrease in Roll values (dashed lines), the decrease in local bending depending on the strength of the energy coupling term, e.g., small for GG compared to AA and YR steps. The translational parameters show similar sequence-dependent, coupled variations that reflect the knowledge-based energy function. For example, fluctuations of Shift are negligible in the alternating copolymers where the Twist-Shift energy coefficient is by definition zero. (The directional dependence of Shift (Dickerson et al., 1989Stretching
The global stretching of DNA homopolymers and alternating copolymers stems primarily from changes in Rise that grow to a maximum in the middle of the chain (thick solid lines in Fig. 6). The computed, sequence-dependent variation in Rise in the lowest frequency stretching mode reflects the local force constants, with CG steps much more easily deformed than any other dimer. The changes in Rise are coupled, via the elastic energy potential, to fluctuations in Shift in the homopolymers (thin solid lines in Fig. 6) and to Slide in the copolymers (dashed lines in the figure). Indeed, the variation of Shift is comparable to that of Rise at GG steps as is the change in Slide relative to that of Rise at TA steps. Poly d(GC) is unusual in stretching with an alternation in the direction of Slide at YR and RY steps. As discussed below, the coupling of Rise to Shift or Slide contributes to the overall stretching force constants. Concerted movements of base pairs which mimic the conformational behavior of stretched DNA fibers (Chandrasekaran and Arnott, 1989