The submicrosecond bending dynamics of duplex DNA were
measured at a single site, using a site-specific electron paramagnetic resonance active spin probe. The observed dynamics are interpreted in
terms of the mean squared amplitude of bending relative to the
end-to-end vector defined by the weakly bending rod model. The bending
dynamics monitored at the single site varied when the length and
position of a repeated AT sequence, distant from the spin probe, were
changed. As the distance between the probe and the AT sequence was
increased, the mean squared amplitude of bending seen by the probe due
to that sequence decreased. A model for the sequence-dependent internal
flexural motion of duplex DNA, which casts the mean squared bending
amplitudes in terms of sequence-dependent bending parameters, has been
developed. The best fit of the data to the model occurs when the
(AT)n basepairs are assumed to be 20% more flexible than
the average of the basepairs within the control sequence. These
findings provide a quantitative basis for interpreting the kinetics of
biological processes that depend on duplex DNA flexibility, such as
protein recognition and chromatin packaging.
 |
INTRODUCTION |
The flexibility of duplex DNA is important for
its functionality, particularly in the areas of protein-DNA binding
(Hogan and Austin, 1987
), interactions with architectural transcription proteins (Wolffe, 1994
; Grove et al., 1996a
; Nardulli et al., 1996
),
packaging DNA into chromatin (Richmond et al., 1984
; Patikoglou and
Burley, 1997
; Hagerman, 1988
), strand exchange (Thompson et al., 1976
),
and deletion formation (Chedin et al., 1994
). In 1988, Hagerman (1988)
concluded that little convincing evidence existed in favor of the
hypothesis that DNA flexibility is sequence dependent. In 1994, Harrington and Winicov (1994)
extensively reviewed the interactions
between DNA and a number of proteins in which "the relatively new
concept of sequence-directed structural softness or flexibility" is
implicated as a physical basis for DNA-protein recognition.
Until now, there have been no experiments quantitatively evaluating the
sequence-dependent flexibility in duplex DNA. A number of specific
sequences have been suggested as more flexible; in particular, the
dinucleotide CA and TA steps have been suggested as candidates for
regions of increased flexibility based on the results of gel mobility
assays (Harrington and Winicov, 1994
). Harrington suggests that
sequence-dependent flexibility may account for unexplained differences
in the gel mobilities between GGGCCC motifs and AAAAAA tracts in
cyclization assays (Dlakic and Harrington, 1995
). In the formation of
chromatin structures, long runs of homopolymers (dA)-(dT) and (dG)-(dC)
are excluded from the DNA-histone packaging, presumably because of
rigidity (Drew and Travers, 1985
), while sequences containing AAA, AAT,
and TA repeats bind well (Patikoglou and Burley, 1997
). Hogan and
Austin examined bacteriophage 434 repressor binding affinity for DNA
sequences (Hogan and Austin, 1987
) and concluded that protein binding
correlated with the basepair sequence central to the operator binding
sites. It was suggested that binding was proportional to a sequence's
ability to twist or flex and that the observed variation in DNA
stiffness was sequence dependent.
When there is a change in the dynamics due to a particular change in
sequence in duplex DNA, it occurs for one of two reasons. The first
could be a global change in the set of secondary structures induced by
a specific sequence. An example of this was presented by Schurr and
co-workers, who studied the effect of a 16-bp (CG)8 insert
in a linear 1.1-kbp duplex DNA sequence. Their data implied that the
insert induced a large change in secondary structure throughout the
molecule (Kim et al., 1993
). The second reason for a change in dynamics
is that a sequence has a locally different flexibility, which in the
context of a large section of DNA may affect the dynamics of other
bases through the internal collective modes. An example of a local
change in flexibility affecting dynamics or kinetics elsewhere in the
DNA is the process by which inserts are deleted between directing 18-bp
repeats flanking the deletion segment (Chedin et al., 1994
).
In general, if an effect is local but is communicated via the DNA to
other parts, then the effect should be characterized by a fall-off with
distance. Fall-off of an effect with distance is a characteristic of
structural alterations or allosterism. It is also a characteristic of
internal dynamic twisting correlations (Schurr and Fujimoto, manuscript
submitted for publication) and internal dynamic bending correlations,
as will be shown later. One example of such a fall-off that was
interpreted in terms of allosterism is that of the effect of a
(CG)4 insert on the rate of cleavage by restriction enzymes
at specific sites. The state of the (CG)4 insert was
controlled by the presence of
[Co(NH3)6]3+ and, presumably,
varied locally between B- and Z-form DNA. When the insert adopted an
alternative right-handed structure, a severalfold enhancement in the
cutting rate was observed for sites up to 50 bp from the cutting site.
Even in the absence of
[Co(NH3)6]3+, an enhancement in
the cutting rate was observed when the insert was placed up to 50 bp
from the cutting site in linear DNA (Aloyo et al., 1993
; Schurr et al.,
1997a
; Ramsauer et al., 1997
). The above example illustrates how
long-range structural correlations can propagate over a finite domain
and die off with distance.
In this paper we will show that the observed changes in flexibility
seen by an electron paramagnetic resonance (EPR) active probe are
quantitatively well explained by a modified weakly bending rod model.
This model supports the ideas that 1) increased motion may be due to a
local change in the flexibility of the DNA molecule and 2) certain
sequences of DNA may have different propensities for bending. Examples
of the consequences of local changes in flexibility for structure or
function will be discussed.
We have previously investigated DNA dynamics by EPR, using a nitroxide
rigidly fused to a pyrimidine base, Q, paired to
2-aminopurine (2AP) (Fig. 1) (Miller et
al., 1995
; Okonogi et al., 1999
). It was shown that the Q-2AP basepair
reports the dynamics of the attached DNA and that the probe motion
independent of the attached DNA was small relative to the motion
induced by the DNA. In particular, this probe is primarily sensitive to
bending dynamics and thus provides an excellent method for
quantitatively determining the flexibility of DNA. In the following
experiments, the probe is used in a site-specific manner by being
placed in a small region of duplex DNA whose sequence is kept fixed.
The DNA is linear, relaxed, and nonstressed. At well-defined distances from the probe, there is a test region in which the basepair sequence is varied. The effect of the test region on the dynamics of the site-specific probe is interpreted as differences in the coupling of
basepairs to one another. The sources of the dynamics include 1) the
overall tumbling of the DNA, 2) the internal collective modes of
motion, and 3) the motion of the probe (including the basepair)
independent of the macromolecular environment. To test the effects of
the collective modes of motion on the probe, the following experiments
were designed so that 1) an 11-bp region containing the probe and 2)
the overall length of the DNA remain constant throughout the
experiments. These two constraints on the DNA ensure that the overall
tumbling and the length-independent motions of the probe are constant
among the experiments. The internal collective modes provide the only
mechanism for motions in one region of the DNA to affect the probe.
Preliminary results on the sequence-specific bending of duplex DNA,
obtained with a site-specific probe, have previously been reported
(Okonogi et al., 1997
, 1998
, 1999
).

View larger version (12K):
[in this window]
[in a new window]
|
FIGURE 1
Basepairs G-C, 2AP-Q, and A-T. 2-Aminopurine
(2AP)-Q is a modified adenosine-thymidine basepair. The
guanosine-cytosine and adenosine-thymidine basepairs are shown for
comparison.
|
|
The present measurements pertain to the dynamic bending rigidity that
governs the submicrosecond flexural dynamics of DNA. Its corresponding
dynamic persistence length is Pdb = 1500-2000 Å (Okonogi et al., 1999
; Naimushin et al., 2000
), and
because of its considerable stiffness, it partially contributes to the equilibrium mean squared bending between basepairs, to the effective equilibrium bending rigidity of DNA and to Ptot.
As discussed elsewhere (Schurr et al., 1997a
,b
; Okonogi et al., 1999
;
Naimushin et al., 2000
), Ptot apparently
contains contributions not only from dynamic bends and
sequence-dependent permanent bends, but also from slowly relaxing bends
that arise from fluctuations between distinct secondary conformations
with different intrinsic curvatures. Consequently, the bending rigidity
is time-dependent, relaxing from its initially stiff value
characteristic of Pdb = 1500-2000 Å to a
value about half as great at long times. By themselves, the present
experiments provide no information about the variation of this slowly
relaxing contribution to mean squared bending with DNA sequence. Thus
any slowly relaxing contributions to the equilibrium bending rigidity
conceivably might behave differently.
 |
A MODIFIED WEAKLY BENDING ROD THEORY |
To investigate the sequence dependence of DNA flexibility, we
extend the theory developed by Schurr and co-workers for the elastic
motion of the bending of duplex DNA (Wu et al., 1987
; Song and Schurr,
1990
; Schurr et al., 1991
; Okonogi et al., 1999
). The weakly bending
rod theory assumes that the DNA is stiff enough that the twisting modes
of motion are uncoupled from those of bending. The DNA is assumed to
behave like a flexible rod that has mean local cylindrical symmetry.
The uniform rotational modes of motion are 1) rotation about the
cylinder axis (the z direction) and 2) rotation about the
x and y axes. Each basepair is coupled to its
neighbors by a harmonic bending potential, which generates a torque to
each basepair and thereby governs the extent of angular rotations due
to bending. The internal potential is a quadratic function of the
differences between the angular displacements of successive bond
vectors from the end-to-end vector. The force constant between
basepairs,
, is taken to be the same for rotation about x
and y, where
and
are the angles of rotation about x and y, respectively. For the case where the
force constants between basepairs,
, are independent of DNA
composition, the internal potential is
|
(1)
|
This can be recast into a matrix form:
|
(2)
|
The A matrix is a second-order finite-difference
matrix. Explicitly, for the example of 4 bp, A is
The mean squared amplitudes of internal bending can be found from
this potential (Wu et al., 1987
; Song and Schurr, 1990
):
|
(3)
|
where k is Boltzmann's constant, T is the
absolute temperature, d
N = d
1d
2d
3...d
N,
and Q is the transformation matrix that diagonalizes
A. The matrix
Q
1Qt = A
1 is the pseudoinverse of A.
is the diagonal matrix that contains the eigenvalues of A.
The uniform sheer mode eigenvalue of A is zero and is
removed from the inverse, as it is not one of the internal bending
modes. The total mean squared amplitude of bending,

i2(
)
, is the sum of the two bending
contributions plus a length-independent contribution,

o2(
)
, to the basepair at the ith
position:
or more simply,
|
(4)
|
As has been shown elsewhere (Hustedt et al., 1993
; Allison et al.,
1982
),
The mean squared amplitudes increase nearly in proportion to the
length of the DNA. The mean squared amplitude of motion is the quantity
that is obtained from the experiments. This may be compared to a
related quantity: the mean squared difference amplitude. This quantity
is found from the orientation difference 
i =
i+1
i, from which it follows that


i2(
)
= 2(kT/
), by Eqs. 3
and 4. This quantity describes the extent of bending relative to the
neighboring bases. The extent of bending of the difference amplitude is
independent of DNA length and of position in the duplex. This may be
contrasted with the total amount of bending,

i2(
)
, reported by the probe, which
increases in proportion to the length. This is an important
distinction, which will become useful shortly.
We now extend the weakly bending rod theory to consider the possibility
of different force constants between different basepairs along the DNA
molecule. In principle, each nearest-neighbor basepair interaction can
have its own force constant. The bending force constants are still
nearest neighbor and the potential energy has only nearest-neighbor
interactions, but the value of the force constants may depend upon
sequences that may include many basepairs in the immediate vicinity.
Let
represent the mean force constant (
= ((1/N)
j=1N
j)), and let
rj be the ratio of
j/
.
Then Eq. 2 can be written in the same form, as originally developed,
and
can be replaced by
. The A matrix is not as
simple as given in Eq. 3. Now the potential matrix, A, is
written in terms of the ratios of force constants. For the example of 4 bp,
|
(5)
|
At present, we make some simplifying assumptions in the model.
Rather than develop a set of force constants unique to each dinucleotide pair from 5' to 3', we have chosen to assume that
(AT) =
(TA) and that all force constants outside the region of the test sequence are the average value. In future papers, we will
explore more elaborate models. For now the model admits nonunity force
constant ratios between the A-to-T basepairs only in the region of the
test sequence, while the control sequence of basepairs flanking the
variable test sequences is controlled by a mean force constant,
. The matrix in Eq. 5 is used in the following way. The force
constants between basepairs in the sample molecule outside of the test
sequence are set to a mean value,
, or
rj = 1 in this region. A test sequence
insert can have a set of ratios, rj, that depend
on the details of the sequence. For the test sequence inserts presented
here we will assume that there is only a single force constant between
basepairs,
', which is different from the mean. Then r =
'/
is the single ratio that describes the test
sequence as having different forces between basepairs. r
will be less than 1 for sequences that are more flexible than the
average and greater than 1 for sequences more rigid than average. In
general,
|
(6)
|
In nearly all cases, the probe is placed at position i = 6 in the DNA. The experiment monitors the mean squared amplitude of motion, 
62(
)
, due to the collective
modes seen by the probe at position 6. There are N + 1
basepairs, of which L are distinct in the sense that
r
1, and they range from position
N1 to N2. The length of
the distinct test sequence is L = N2
N1 + 1. There is a single distinct force constant
ratio, r, for the L distinct basepairs in the
test sequence. For this simple model, we can vary the length of the
DNA, N + 1; the length of the insert, L; the
starting position, N1; and the force constant
ratio, r. The matrix A is a function of
N + 1, L, N1, and r. As an
example, for the case where L = 3, N1 = 3, and N + 1 = 8,
|
(7)
|
The mean squared amplitudes are explicitly a function of
r, 
i2(
)
(r) = 
i2(
)
. With this simple model, we can
explore the effects of sequence variation at a site distant from the
probe. Sequence variation is modeled by different values of the bending
constant ratio, r. When r = 1, there is no
difference in dynamics between the test and control sequences and no
dependence on the length of the test sequence or its distance from the
probe. The DNA outside of the test sequence insert region will always
have r = 1 and for our purposes here will be referred
to as "average" DNA. When r
1 in the test
sequence region, we measure that change in flexibility as the
difference in flexibility due to the difference between r
and 1:
In Fig. 2, the theoretical
difference bending amplitude
r
62
is shown as a function of
r, the ratio of force constants, for various lengths of test
sequences and lengths of DNA. All test sequences started at position
N1 = 7, and the difference mean squared
bending amplitude was calculated at position 6. The difference mean
squared amplitude is nearly linear with the inverse ratio of the force
constants, r, and is nearly proportional to the length of
the test sequence, L. From Eq. 5, if all bases are of
flexibility r
, then A(r) = rA(1).
In this limit then,
r
ii2
(1/r
1)Aii(1). We find that the
following empirical functional form (Eq. 8) satisfactorily simulated
the computations using Eq. 6 and matrices of the form of Eq. 7. For the
case where i = 6 and N1 = 7,
|
(8)
|
No is the length of the DNA at which
r
62
becomes independent of
N and demonstrates that there is a limit to the amount of
DNA that can be added to enhance the difference effect due to an insert
of length, L, and ratio, r. Fig. 2 compares this analytic form (Eq. 8) with the exact results. The difference mean squared amplitude,
r
i2
, depends
approximately exponentially on the ratio of the length of the DNA to
No.

View larger version (30K):
[in this window]
[in a new window]
|
FIGURE 2
Mean squared bending amplitudes,
r 62 (rad2), and rms
angular displacements, rms (deg), are shown as a
function of the force constant ratio, r, for DNAs of length
N + 1 = 50, and test sequences of lengths
L = 5 ( ), 10 ( ), 15 ( ), and 20 ( ), and for
DNAs with test sequence lengths L = 20 and N + 1 = 50 ( ), 100 ( ), 200 ( ), and 400 ( )
basepairs. All sequences begin at position N1 = 7, and the mean squared bending amplitude is measured at position
6. kBT/ = 0.00272 rad2, a typical value (Okonogi et al., 1999 ).
N0 = 70 bp. Estimates of
r 62 values from Eq. 8 are
plotted as lines and compared with the exact calculations (shown as
icons).
|
|
Fig. 3 demonstrates theoretically the
effect that an insert with force constant ratio r can have
on the internal amplitudes of bending seen by a nearby probe.
r
i2
is plotted as a function of
the total length of the DNA for a fixed insert and a fixed distance
between the insert and the probe. Initially there is a 20-mer DNA with
the probe, Q, at position 6, N1 = 11 and N2 = 20, and the test insert
has force constant ratio, r. L = 10 bp is the length of
the test sequence insert. The extend left curve (triangles
pointing left) extends the initial 20-mer to the left of the
initial DNA with a region of average DNA (r = 1 in this
region). Notice that the difference drops off to nearly zero at ~200
bp. The extend right curve (triangles pointing right)
extends the original 20-mer to the right side of the test sequence, so
that the test sequence is between the added average DNA and the probe.
The difference increases with increasing length of DNA and becomes
independent of DNA length at ~400 bp. For the case where the DNA is
extended in both directions, the curve (diamonds)
demonstrates that the effect is an average of the effects due to the
extension to the left and to the right. Empirical functional forms,
which satisfactorily model the computations, are given in the legend
and are shown as solid lines in Fig. 2.

View larger version (21K):
[in this window]
[in a new window]
|
FIGURE 3
Plots of the difference in the mean squared amplitudes
of bending as a function of Q (probe) and L (test
sequence) placement in three uniquely growing chains of DNA for
r = 1 and r = 0.815. Every curve begins
with a 20-mer DNA with Q at position i = 6, N1 = 11, and N2 = 20,
where L = 10 bp in length. The extend right curve ( )
extends the original 20-mer towards the 3' end away from Q
and L. The extend left curve ( ) extends the initial
20-mer to the left, or 5' of Q. The extend both curve ( )
extends the initial 20-mer on both ends of the DNA simultaneously and
appears as a linear combination of the right and left, as seen in the
fitting formulas:
The equation for extending in both directions simultaneously is an
average of the left and right equations, using 30% of the extend right
curve and 70% of the extend left curve. All three curves depend
exponentially on the length of the DNA, with a characteristic
relaxation length of No = 70 bp and
N = (N 2)/2.
|
|
These calculations provide insight into how the experiments should be
designed to best detect the effect of an insert with a different
inherent flexibility. The test insert should be located between the
spin probe and a long region of average DNA. The region of average DNA
enhances the difference in bending amplitudes observed by the probe up
to about No = 70 bp; beyond that the
increase in the difference is negligible. Notice that the probe's mean squared amplitude of motion increases without bound, but the difference in amplitudes due to the test sequence becomes a constant value as the
length of DNA goes to infinity. It is interesting to note that, when
the test region is between the probe and the average DNA, the presence
of average DNA is beneficial in amplifying, and never overshadows, the
motional effect of the test sequence on the probe. In contrast, when
the probe is between the average DNA and the test insert, the average
region can overshadow the effect of the test sequence.
We now consider how a sequence will affect the internal motion, when
that sequence is moved further away from the probe, which is part of
the base where the bending amplitudes are being monitored. In Fig.
4, the mean squared amplitude is shown as
a function of the test sequence's starting position,
N1, for representative lengths, L,
and ratios, r, when N is held constant. According to the theory, the initial amplitudes,
Ao(r; N, L), are dependent upon
N, r, and L. The mean squared amplitude, seen at
position 6, evolves toward the average value of the N + 1-mer (where all r = 1, as given, for example, by
the NT sequence) as N1 increases to the length
of the molecule (N1 = N + 1). The rate
at which the effect of the test sequence falls off, however, is
independent of r and L. This remarkable result
was not obvious from the model and simplifies our interpretation of the
effect of an insert on the motion seen at the probe. To demonstrate the
effect of increased N1 at constant N,
overlaid on the mean squared amplitudes in Fig. 4 is an exponential
function of the form
|
(9)
|
where Ao(r; N, L) and
Bo(r; N, L) are the least-squares
optimized fitting parameters.
N
is the correlation
length in basepairs and represents the extent to which the mean squared
amplitude is affected by that insert. The overlay of a single
exponential, in Fig. 4, shows that this model function is a
satisfactory approximation to the dependence of the mean squared
amplitude on N1.

View larger version (19K):
[in this window]
[in a new window]
|
FIGURE 4
Plots of the mean squared amplitude of bending and rms
angular displacements as a function of the starting position
N1 for L = 5 and 20 bp and
r = 0.8 and 1.35 for DNAs of length N + 1 = 50 bp for L = 20, r = 0.8 ( );
L = 5, r = 0.8 ( ); L = 5, r = 1.35 ( ); and L = 20, r = 1.35 ( ). Data
are fit to single-exponential functions of N1
(Eq. 9).
|
|
Fig. 5 plots the correlation length,
N
, for different L, r, and N + 1. The astoundingly simple result is that the correlation length
is always approximately half the total number of basepairs:
N
(N
2)/2.
N
is independent of
L, r, and
. Lengths of DNA up to N + 1 = 200 were tested for
N
(not shown in Fig.
5). This effect can be qualitatively understood in the following way. We note first that the alteration of the bending potential in one
region of the molecule has no effect whatsoever on the equilibrium distribution of bending angles between successive bond vectors, 

i2(
)
, or on the mean squared curvature,
at any other point in the molecule. It is important to note that the
mean squared angular displacements under discussion, namely

i2(
)
, are those of a given bond vector with
respect to the end-to-end vector of the entire filament. When viewed
from the local frame of the probe, the effect of increasing the
flexibility of some distal region is to increase the mean squared
angular displacement of the end-to-end vector away from the bond vector
of the probe. The end-to-end vector is a normalized sum of all of the
bond vectors between neighboring bases in the DNA molecule. In the
frame of the probe, all bond vectors from the end containing the probe up to the site of increased flexibility remain unchanged, and beyond
the flexible region all bond vectors are rotated upon flexing. By
placing the region of flexibility as close to the probe as possible,
the maximum number of bond vectors are rotated, and the largest
displacement of the end-to-end vector relative to the probe bond vector
occurs. As the flexible region is moved away from the probe, flexing
will rotate fewer bond vectors, and the effect falls away.
 |
CONCLUSIONS FROM THEORY |
The overall impact of a test sequence on the motion of the probe
falls off exponentially with the increasing length of an intervening
average sequence, when the total length of the DNA is kept fixed. The
correlation length over which the dynamical communication is maintained
depends only on the total length of the duplex DNA and is
1/2(N + 1). Differences in the bending dynamics seen by the probe due to a test sequence are enhanced by adding more
average DNA. The difference in the internal bending,
r
i2
, for an average sequence
(for instance, NT) and for one that is more flexible (like that of an
(AT)n sequence) increases exponentially with increasing
length of the molecule up to ~70 bp, and this length is different
from 1/2(N + 1).
The result,
N
(N
2)/2, is valid within the
context of the weakly bending rod model. However, there must be a point
at which the correlation length,
N
, is cut off by the
total persistence length of the DNA, beyond which angular correlations
are lost in any case. That is, when the contour length exceeds the
persistence length, then the correlation length must be limited by the
persistence length. One can express this idea mathematically by
assuming a form similar to that suggested for the addition of
persistence lengths (Schellman and Harvey, 1995
):
|
(10)
|
 |
EXPERIMENTAL METHODS |
A series of duplex DNAs were constructed, all of which were 50 bp in length. The spin probe base, Q (Fig. 1), is always at
position 6 in the initial 11-bp sequence, and the test sequences never
begin before position 12. The control sequence (NT) is basepairs 1087 to 1136 from Drosophila melanogaster TATA-box binding
protein TFIID gene:
The constructs that were tested contained the dinucleotide
repeat (AT)n. (AT)n inserts were chosen because
they lack retarded polyacrylamide gel electrophoretic mobilities,
indicating that these sequences do not contain permanent bends
(Hagerman, 1990
); because (AT)n sequences are thought to be
very flexible (Harrington and Winicov, 1994
); and because
(AT)n sequences have not been implicated in inducing
allosteric transitions. The test sequence names describing the insert
composition, test sequence length L, test sequence base
starting position N1, and stopping position N2, all with Q at position 6, are
listed in Table 1. For example, the
sequence listed AT4 is
where the unbolded sequence is the test sequence.
View this table:
[in this window]
[in a new window]
|
TABLE 1
The names of the sequences made with the probe, Q, at
position 6, and the length of the test sequence, L, and the
start, N1, and stop,
N2, positions of the test sequence
|
|
In addition to the poly-(AT)n basepair test sequences, we
inserted a dipropylene linker moiety, DPL
(5'O-(CH2)3O-PO2
-O-(CH2)3O-3'),
into the NT control sequence. DPL is close in size to 2 bp, but is
missing the sugar and base. This linker is placed in both strands of
the duplex DNA in a complementary fashion but does not replace any of
the 50 basepairs. Such a linker was designed with the intention of
showing the extent to which EPR spectra are sensitive to very weak
coupling between the two segments, which are the 11-mer sequence
containing the dynamics probe and the remaining 39-mer sequence in the
molecule. Furthermore, the DPL sample demonstrates the extent to which
the dynamics of the two segments of the DNA can be fully decoupled.
Q was prepared as previously described (Miller et al.,
1995
), using both the naturally abundant
[14N,H12] and the isotopically substituted
[15N,D12] nitroxides. Q was
site-specifically attached at the sixth position of every 50-mer duplex
DNA sampled. The duplexes were prepared using the Klenow polymerase
filling technique (Maniatis et al., 1989
). All oligomers were
synthesized on an ABS 6800 or 392 DNA synthesizer and purified using
reverse-phase (trityl-on purification) high-performance liquid
chromatography on a Dynamax 300-Å column. 0.4 OD of the 11-mer primer
strand containing Q at the sixth position (DNA 5'-d(CCT
CGQ ATC GT)) was annealed with 2.0 OD of the appropriate
template 50-mer and combined with 10 mM each of dATP, dCTP, dGTP, and
dTTP, and 50 units of Klenow fragment or 20 units of Vent polymerase in
1× primer extension buffer or ThermoPol buffer. After incubation, the
reaction mixture was purified by nondenaturing polyacrylamide gel
electrophoresis. DNA was extracted from the gel, using the
crush-and-soak procedure (Maniatis et al., 1989
) in duplex elution
buffer. Extracted DNA was ethanol precipitated, dried, and resuspended
in 10 µl PNE buffer (10 mM phosphate (pH 7.0), 0.1 mM EDTA, and 100 mM NaCl).
CW-EPR spectra were digitally recorded on a spectrometer with a loop
gap resonator cavity (Mailer et al., 1991
) and a commercial Bruker EMX
spectrometer with a TE102 cavity. Parameters employed for CW-EPR
measurements include a 10-kHz modulation frequency, 1.0-G modulation
amplitude, 0.1 mW power, 1024 points, and 30°C regulated to
±0.2°C, well below Tm for these molecules
(Okonogi et al., 1999
). Line samples of 1-2 OD (100-200 µM) duplex
DNA were placed in either a 0.6 × 0.84 mm or 0.8 × 1.0 mm
quartz capillary and stored at 4°C between EPR measurements.
Known rigid limit A and g tensors were obtained
elsewhere (Reese, 1996
), and the uniform mode correlation times were
used to simulate the spectra of the B-form 50-mers in PNE solution at
30°C. The change in spectral width arises from a rapid dynamics
averaging of the A tensor elements. The observed spectral
width is proportional to the dynamically averaged tensor element,
Azz
, which is directly related to the
order parameter, S6, and to the mean squared
internal oscillation amplitude, 
62(
)
, by
the relation
|
(11)
|
where
=
trA and is independent
of probe motion (Hustedt et al., 1993
).

62(
)
was found by best fitting the outer
components of the spectra. The correlation coefficient, R,
defined previously (Hustedt et al., 1993
), exceeded 0.97 for all
spectra reported.
 |
RESULTS |
The EPR spectra of NT and DPL are shown in Fig.
6. The distance between the outer
spectral features is proportional to the quantity
Azz
. The decrease in the distance between
the outer spectral feature in the DPL spectrum with respect to the NT
spectrum is indicative of a more flexible sequence undergoing larger
mean squared amplitudes of bending (see Eq. 11). These spectra
illustrate the range of spectral responses expected when different test
sequences are inserted into DNA with a fixed contour length of ~170
Å. If the DPL connection between basepairs 11 and 12 were to act as a
universal swivel joint, or in the context of the model r = 0 (Eq. 6), then we could predict the dynamics of the 11-mer. A
freely jointed hinge uncouples the internal collective modes of the
11-bp segment (containing the probe) from the rest of the DNA. The
11-bp segment then freely rotates as an independent object about the end connected to the remaining 39-bp segment, rather than about the
middle, which is basepair 25, of the full DNA. Rotation about the end
of the 11-mer is nearly equivalent to a 22-mer rotating freely about
its center. Therefore, Fig. 6 compares the EPR spectrum of DPL with
that of a 22-bp-long duplex DNA (NT-22), also labeled at position 6. The high degree of overlap between these two spectra supports the
hypothesis that the DPL linker acts nearly as a universal hinge. Others
have observed that freely jointed segments increase flexibility in DNA.
A single-stranded region of six bases will produce a result nearly
equivalent to that of the DPL linker (Reese, 1996
). In cyclization
assays, Crothers and co-workers detected a hinge in duplex DNA composed
of three unpaired bases in a row in 182-bp duplex DNA. The r.m.s.
libration amplitude due to the hinge was ~66° (corresponding to a
local effective persistence length of ~35 Å), as compared to the
value of a basepair in B-form duplex DNA of ~7° (this corresponds
to a persistence length of 500 Å) (Kahn et al., 1994
).

View larger version (20K):
[in this window]
[in a new window]
|
FIGURE 6
The CW-EPR spectra of the 50-mer duplex DNA sequences
NT (control) and DPL (NT containing a dipropylene linker between 5'
basepairs 11 and 12) and a 22-mer duplex sequence (NT-22, the first 22 basepairs of the NT sequence). All three sequences are spin labeled
with Q at position 6 from the 5' end. Tensors used to
simulate the EPR spectra of the [14N,H12]
isotopic spin label are A = 6.58, 4.98, 34.23 Gauss,
and g = 2.0084, 2.0068, 2.0034 (simulations not shown).
The A tensors for [15N,D12] label
are those for [14N,H12] divided by 0.733.
|
|

62
for the different DNAs listed in Table 1
have been measured. The experimental 
62
values
are plotted against their sequence start positions,
N1, in Fig. 7. The
theory (calculated from Eq. 6 for 
62
) is
overlaid for the values of L, corresponding to the different lengths of the (AT)n inserts. Note that all of the curves
decay with a correlation length of ~25 bp, as shown in Figs. 4 and 5. A single r of 0.81 ± 0.02 was chosen by a best-fit
criterion to simulate the 
62
data for all of
the (AT)n inserts. A previously determined value of
kBT/
was used (Okonogi et
al., 1999
), but the ratio is insensitive to
. The r
parameter was calculated using dynamic bending persistence lengths
ranging from 1200 to 1500 Å and did not vary. A single length-independent 
o2
was added to the
theoretical predictions of Eq. 6. 
o2
and
r are the only adjustable parameters. The value of

o2
, which was found by a least-squares
criterion, was the same, within experimental error, as that measured
previously (Okonogi et al., 1999
).

View larger version (20K):
[in this window]
[in a new window]
|
FIGURE 7
 62 and rms
values for the sequences given in Table 1. Overlaid is the theory of
Eqs. 6 and 7, for r = 0.81 ± 0.02, kBT/ = 0.00272 rad2, which corresponds to a persistence length of 1250 Å at 30°C, and the length independent contribution to
 62 of  o2 = 0.0224 ± 0.0006 rad2. Experimental errors are all
less than or equal to ±0.0015 rad2.
|
|
 |
DISCUSSION |
The agreement of the theory with the experiments supports the idea
that (AT)n sequences have a bending force constant that is
different from the average basepair and that the effect is local. This
is supported by the observation that the 
62
differences between the control (NT) sequence and the
(AT)7A inserts become smaller as the (AT)7A
inserts are moved farther from the observing spin probe. The effect is
not independent of position of the insert relative to the probe. The
agreement between the measured values of the amplitude,

62
, for the inserts,
(AT)7Asn, and those computed from the theory (Eqs. 6 and 7) is remarkable in that the fall-off of the effect of the
inserts with distance from the probe is not an adjustable parameter of
the model. The experiments (shown in Fig. 7) are designed to test both
the dependence on L for fixed N1 and
on N1 for fixed L. There is good
agreement, using a single ratio parameter, r, for both sets
of experiments. Given the simplicity of the model, which contains only
two adjustable parameters, r and the length-independent
amplitude, 
o2
, the agreement of the
calculations with the eight different types of test sequences is
excellent. The conclusion that (AT)n sequences are more
flexible than average sequences seems justified.
The correlation length of the effect depends only on the total length
of the DNA, regardless of test sequence length or flexibility. The
amplitudes of bending are a function of L, N, and
r and depend on the position of the test sequence with
respect to the probe and the remaining average DNA; but the correlation
length is always approximately half the length of the DNA duplex. Small
permanent (or static) bends in the DNA do not appreciably reduce the
effects of the coupling as seen throughout the molecule (Okonogi et
al., 1997
). In a much longer filament, this effect must be cut off by
the equilibrium total persistence length of the DNA,
Ptot = 500 Å.
The fact that not all of the experimental 
62
values (including their errors) in Fig. 7 lie on the calculated decay
curves is due in part to the simplicity of the model. The experiments presented herein do not allow a distinction between (AT) and (TA) steps, and the flexibility, parameterized by r, must be
considered a weighted average of the (AT) and (TA) step flexibilities.
Should (AT) and (TA) steps have different flexibilities, the relative flexibility ratios can be defined as r(AT) and
r(TA). The ratio parameter we report,
, is the average of these two ratios, given by
1/
= 0.52 (1/r(AT)) + 0.48(1/r(TA))). This average was determined using Eqs. 6 and 7 modified to accept two different ratio parameters. More extensive experiments and analysis that allow for distinctions between basepairs (Okonogi et al., 1997
, 1998
) will be published elsewhere. Moreover, the motion may be highly anisotropic, and the
value reported here represents an average bending amplitude and force constant.
It is not surprising that we find (AT)n sequences more
flexible than average. Others have observed qualitative differences in
binding and have attributed the differences to increased local flexibility in duplex DNA associated with (AT)-rich sequences or
regions. For example, any of a number of permutations of an (AT)-rich
sequence, next to the GC box that binds the MIG1 zinc finger protein,
was found to be essential for high-affinity binding, even though no
single base within this region was conserved (Lundin et al., 1994
).
Lundin et al. (1994)
concluded that "MIG1 recognizes the AT box
directly, but in a way that requires bendable DNA rather than a unique
sequence motif." It is now possible to assign a magnitude to the
increased bending made possible by the presence of (AT)n steps.
The bending in (AT)n sequences increases by 10% for the
same amount of energy, and 20% less energy is required for the same amount of bending, because the bending constant for (AT)n
is 0.8 times that for average DNA and because


i2(
)
= 2(kT/
i). At room temperature, the average
bending between base pairs is ~3°, and a 20% increase in
flexibility increases the mean relative bending angle to