| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
Biophys J, August 2002, p. 1177-1183, Vol. 83, No. 2



and
*VTT Biotechnology, FIN-02044 VTT, Espoo, Finland;
Department of Physical Sciences, University of Helsinki,
FIN-00014 Helsinki, Finland; and
Department of Molecular
Biology and Biotechnology, University of Sheffield, Sheffield S10 2TN,
United Kingdom
| |
ABSTRACT |
|---|
|
|
|---|
A new principle in constructing molecular complexes from the known high-resolution domain structures joining data from NMR and small-angle x-ray scattering (SAXS) measurements is described. Structure of calmodulin in complex with trifluoperazine was built from N- and C-terminal domains oriented based on residual dipolar couplings measured by NMR in a dilute liquid crystal, and the overall shape of the complex was derived from SAXS data. The residual dipolar coupling data serves to reduce angular degrees of freedom, and the small-angle scattering data serves to confine the translational degrees of freedom. The complex built by this method was found to be consistent with the known crystal structure. The study demonstrates how approximate tertiary structures of modular proteins or quaternary structures composed of subunits can be assembled from high-resolution structures of domains or subunits using mutually complementary NMR and SAXS data.
| |
INTRODUCTION |
|---|
|
|
|---|
Spatial rearrangements between and within
proteins during the course of biological function are essential
manifestations of life at the molecular level. Large organizational and
conformational changes often challenge the means currently available
for the determination of three-dimensional structures. Crystals of a
protein in all of its relevant conformations are seldom available, in particular when one or more states in the functional pathway exhibit flexibility. NMR-derived distance information for large complexes is
sparse, especially between subunits. Here we show that a combination of
directional information measured by NMR from weakly aligned systems
(Tjandra and Bax, 1997
; Bax et al., 2001
) with global information about
size and shape embedded in small-angle scattering (SAS) of x-rays or
neutrons (Glatter and Kratky, 1982
; Feigin and Svergun, 1987
) greatly
extends the ability to construct models of larger modular structures in
solution based on high-resolution data.
Small-angle scattering of x-rays or neutrons is sensitive to the
overall size and shape of a protein or its complex (Trewhella, 1997
;
Doniach, 2001
). When employing neutrons (SANS), contrast can be created
from the large difference in the scattering lengths of H and D by
isotopic H/D exchange, perdeuteration of the subunits, and varying the
deuterium oxide content of the solvent (Koch and Stuhrmann, 1979
). On
the basis of the scattered intensity, low-resolution envelopes can be
constructed ab initio by various ways (Stuhrmann 1970
; Chacon et al.,
1998
; Svergun, 1999
). However, provided that the subunit structures of
a quaternary complex are known to high resolution, the arrangement of
the subunits can be searched by finding the best fit between the
experimental scattered intensity I(s) and
amplitudes computed from two-body structures according to
|
(1) |
s is the electron density of the solvent and

b is the contrast of the hydration shell
(Svergun et al., 1995The rotational uncertainty remaining from SAS measurements could be
resolved using NMR. Enhanced anisotropic tumbling of molecules in a
liquid crystal (Tjandra and Bax, 1997
; Hansen et al., 1998
) gives rise
to readily observable residual dipolar couplings,
D(
,
), which depend on the polar coordinates
and
of the internuclear vector relative to the principal axes of the
molecular alignment tensor A:
|
(2) |
is the axial and
A
is the transverse component of
A; i.e., R = A
/A
is the rhombicity (Bastiaan et al., 1987Main-chain dipolar couplings are the most attainable to measure and to
assign (Ottiger et al., 1998
; Permi and Annila 2000
; Permi et al.,
2000
). Values of dipolar couplings relate the N-H bond directions to
the principal axes system (PAS) of the molecular alignment tensor
(A) (Eq. 3). The elements of the traceless and symmetric
A in the molecular frame can be determined by singular value
decomposition, using the measured dipolar couplings and projection
angles of corresponding internuclear vectors derived from the known
domain structures (Losonczi et al., 1999
). The diagonalization of
A makes the molecular and principal frames coincide and
yields a rotation matrix R
O
The complex between calmodulin (CaM) and trifluoperazine (TFP) was
chosen to illustrate the method and to assess the accuracy and
precision that can be obtained. The CaM system is very well defined
structurally and has already required a combination of structural
methods for a correct molecular description of its function. The rigid
connectivity between the N- and C-terminal domains observed in the
crystal (Chattopadhyaya et al., 1992
) was shown not to be retained in
solution using small-angle x-ray scattering (Heidorn and Trewhella
1988
; Heidorn et al., 1989
) and NMR relaxation measurements (Barbato et
al., 1992
) and recently by means of residual dipolar couplings (Chou et
al., 2001
). The two domains are relatively extended and independently
mobile. However, in the presence of models of the natural targets, NMR (Ikura et al., 1992a
; Ikura et al., 1992b
) and x-ray structures (Meador
et al., 1992
) have revealed that the two domains bind such peptides in
a more compact, globular complex. Similarly, hydrophobic ligands can
also shift the equilibrium position from independently mobile domains
toward a compact globular structure (Vandonselaar et al., 1994
; Cook et
al., 1994
; Osawa et al., 1998
, 1999
). According to x-ray scattering
(SAXS) measurements, the CaM-TFP complex predominantly populates a
compact globular form (Matsushima et al., 2000
), in which more than
five times molar excess of TFP is required to reach full occupancy. The
positions of four TFP molecules have been defined by x-ray diffraction
(1LIN) (Vandonselaar et al., 1994
), two of which occupy higher-affinity sites (Matsushima et al., 2000
; Craven et al., 1996
).
| |
MATERIALS AND METHODS |
|---|
|
|
|---|
Calmodulin was produced and purified as described earlier
(Craven et al., 1996
). Samples for NMR measurements were 0.3 mM in
dilute liquid crystals. Liquid crystal media for NMR measurements were
prepared by dissolving to an approximate concentration of 30 mg
ml
1 filamentous phage (Pf1) particles that
align in the magnetic field (Hansen et al., 1998
). The actual aligning
power of the medium was quantified from the deuterium quadrupolar
splitting of deuterium oxide. All spectra were recorded using Varian
800 and 600 NMR spectrometers.
For each residue of CaM the main-chain N-H heteronuclear splittings
between multiplet components were measured from
/
half-filtered 15N,1H correlation spectra
(Cordier et al., 1999
) at 37°C in 18 h. The dipolar contribution
to an N-H splitting was obtained by subtracting the scalar coupling
measured from reference spectra recorded from 0.5 mM sample in water.
The five independent elements of the traceless and symmetric molecular
alignment tensor were determined by singular value decomposition.
The SAXS experiments were carried out with CuK
radiation made monochromatic with a Ni filter and a totally reflecting mirror. Radiation was obtained from a sealed fine-focus x-ray tube and
detected by a linear one-dimensional position-sensitive proportional
counter (MBraun OED 50M). The beam has a narrow profile; together with
detector resolution the instrumental function has FWHM = 0.005 Å
1 horizontally and FWHM = 0.35 Å
1 vertically. The sample was injected with a
syringe in a steel-framed cell that has thin stretched polyimide foils
as flat windows. The protein concentration was 0.3 mM. The measurements
were made at room temperature using 3 h of accumulation time. The
sample-to-detector distance was 165 mm. The background and non-sample
scattering was measured separately and subtracted from each
measurement. The scattered intensity was corrected for the instrumental
broadening, and the distance distribution function was calculated using
the program GNOM (Svergun 1992
).
| |
RESULTS |
|---|
|
|
|---|
The complexation of CaM with TFP molecules causes a large change
in P(r) (Fig. 1),
characterized by a reduction in the radius of gyration
(Rg) from 20.3 to 17.9 Å and in the
longest diameter Dmax from ~65 to 55 Å. This is indicative of the transition to a compact globular form
(Heidorn et al., 1989
; Osawa et al., 1999
).
|
Residual main chain 15N-1H
dipolar couplings (DNH) were measured
in the presence of more than fivefold molar excess of TFP in a dilute
liquid crystalline phase (Fig. 2). The
principal frames of each CaM domain in the complex with TFP were
determined independently, using the co-ordinates of free CaM (1CLL)
rather than the TFP complex form (1LIN) to be consistent with the
notion of building quaternary complexes from unligated subunits.
Residues from the central helix in the vicinity of the mobile region,
residues subject to rapid transverse relaxation, and data points with
poor signal-to-noise ratio were not included. The calculated
D(
,
) values agree closely with the measured data
(Fig. 2). The RMSD is 2.2 and 1.8 Hz for the N- and C-domains,
respectively. The uncertainty in A was estimated on the
basis of the variation in the diagonal elements of A
introduced by a jackknife procedure (Shao and Tu, 1995
), deleting on
average 10% of the measured data points at random 20,000 times. This
simulates the experimental situation where data points are lost due to
poor signal-to-noise ratio or signal overlap. The half-widths and base
widths of Axx, Ayy, and Azz distributions are ~0.5 Hz and 2.0 Hz,
respectively, for both domains. This corresponds to the 1.2-Å RMSD and
standard deviation of 2.6 Å (N-domain) and 1.6-Å RMSD and 1.3-Å SD
(C-domain) between the 20,000 family members. The correlation between
the measured and computed values of
DNH are r = 0.95-0.89
(N-domain) and r = 0.93-0.82 (C-domain). The
uncertainties obtained above are similar in magnitude to those obtained
from Monte-Carlo simulations using 1-Hz imprecision in the measurement
of dipolar couplings. The quality factors (Ottiger and Bax, 1999
) for
the optimum orientations were 0.23 and 0.37 for the N- and C-domains,
respectively. The PAS orientations of the individual domains closely
coincide, as expected for a compact globular system. The correlation
decreases marginally to 0.91 when the N- and C-domain data are fitted
simultaneously. Therefore, changes in the dipolar couplings due to
complex formation can primarily be attributed to changes in the
relative orientation of the domains rather than to changes in domain
structure.
|
Comparisons similar to those above using on the average 80% of the experimental values yielded families of domain orientations with RMSDs (SD) of 2.7 (4.9) and 2.5 (1.9) Å for the N- and C-domains, respectively, and with only 50% of the experimental values 6.0 (6.9) and 6.0 (5.2) Å for the N- and C-domains, respectively. Obviously, the smaller the number of residual dipolar couplings used in the determination the poorer the precision. Also, the choice of nonredundant directions becomes important as is reflected by the increasing deviation among orientations.
Four possible relationships between the two domains of CaM result from
the fitting procedure, because residual dipolar couplings are even
functions of
and
(Eq. 3). In this particular case the
comparatively short linker between the domains of CaM imposes a
covalent constraint that excludes two of the four possibilities. The
remaining two can be readily distinguished from the
2 value when the SAXS data are included. In
general, the feasibility of this procedure for choosing the appropriate
symmetry alternative depends on the overall shape of the complex and
subunits, but the CaM domains are not unusually irregular in shape.
Alternatively, the fourfold degeneracy can be lifted by recording data
from the complex in another orientation utilizing a different liquid
crystal (Al-Hashimi et al., 2000
).
The relative position of the pre-oriented domains was determined from
SAXS data by minimizing the sum of least-squares between the measured
Iexp(s) and computed
I(s), defined as
|
(3) |
(s) is the standard deviation of the recorded
scattering curve. The model intensity I(s) was
obtained from the atomic coordinates by the Debye formula. The
coordinates of the model were transformed by changing the relative
position of the domains according to a given translation vector. The
minimum of
2 equal to 3.5 was sought as a
function of the three interdomain distance parameters with the
nonlinear optimization algorithm of Nelder and Mead. For the radius of
gyration our model yielded Rg = 17.7 Å. A similar comparison between experimental and model P(r) functions was also made yielding similar results.
The precision of the minimum position was considered by computing
2 in a grid with a 1-Å spacing around the
minimum (Fig. 3). The distance between
the domains is obtained with a good precision of ±1 Å because the
SAXS data are intrinsically highly sensitive to
Rg. However, the perpendicular
precision is more modest, ±4 Å on a convex surface, equivalent to a
nearly conical distribution of ±10°.
|
Although the Debye formula does not take into account such important
factors as the excluded volume and the hydration shell of the protein,
these effects are negligible in our case when considering the error in
the scattering intensity caused by the TFP molecules that were not
present in the model. This was inspected by calculating intensities
from the CaM-TFP crystal structure (1LIN). The radius of gyration
Rg computed from 1LIN increases from
15.6 Å to 16.4 Å when the TFP density is removed. The removal of TFP
molecules caused larger differences in the intensity at large
s than the effects of the excluded volume and hydration shell, calculated by CRYSOL (Svergun et al., 1995
). For the minimum conformation obtained by the procedure described above the
2 was also calculated with the program CRYSOL,
which does take the hydration shell and the excluded volume into
account (Eq. 1). This minimum corresponds to
2 = 2.4. In the program CRYSOL, the excluded volume is represented using
a two-dimensional envelope function that may not adequately take into
account the cavity between the domains of the model compared with a
dummy atom representation for the excluded volume.
Comparison to the CaM-TFP crystal structure (1LIN) reveals that the
model based on the SAXS and NMR data has closely similar relative
domain orientations and distance between the domains (Fig.
4). When the 1LIN structure and our best
solution are superimposed on their C
-traces of
residues 4-74 and 86-147 the RMSD is 2.4 Å. The distance between the
centers of domains in the model is ~2 Å larger than in the crystal
structure, almost entirely along the long axis of the shape tensor.
Although this may be a genuine difference between the solution and
crystal structures, other factors, most importantly the lack of TFP
molecules from our model, contribute to the discrepancy. The TFP
molecules were not included, because we did not want to define their
positions a priori. The TFP molecules in 1LIN account for ~10% of
the mass of the complex, meaning that the model is some 20% deficient
in the scattering power at I(0). The approximate positions
of TFP molecules in the model were searched by placing them, one at a
time, at Cartesian grid nodes of 1 Å spacing in x,
y, and z, with three angles
,
, and
stepped in 60° at each node. No van der Waals radii restrictions were applied. Minimum
2 values indicated that the
first and second TFP molecules were localized in the central part of
the model. For the third and fourth TFP molecule the fit improved but
almost with no spatial information. As an alternative, we included an
artificial spherical scatterer in the model between the domains to
account for the scattering power of the TFP molecules. This improved
the fit by ~15%. Rg decreased to
17.5 Å, but most improvements to the fit were at higher s
values.
|
A minor contribution to
2 came from the few
N-terminal residues of CaM that according to NMR data (Barbato et al.,
1992
) and nonexistent electron density (Meador et al., 1992
) are not
structured and therefore not included in the model. In our case where
the lack of TFP molecules from the model accounts for most of the error
there is also a small contribution from the hydration shell. It has
been shown by comparison of SAXS and SANS data that neglecting the
hydration shell will lead to a systematic error in
Rg (Svergun et al., 1998
), resulting
in SAXS data producing a larger apparent structure by 1-2 Å.
The CaM-TFP system therefore is a considerable challenge to the method,
yet despite the incompleteness of the description of its components,
the model is highly similar to the crystal structure (1LIN) (Fig. 4).
| |
DISCUSSION |
|---|
|
|
|---|
The ability to record NMR data to complement SAS data for larger
proteins and their complexes has recently been developed through the
exploitation of relaxation interference between dipole-dipole and
chemical shift anisotropy interactions of the amide nuclei (TROSY)
(Pervushin et al., 1997
; Salzmann et al., 1998
). The TROSY method is
applicable to a large number of NMR experiments including the
measurement of residual dipolar couplings from weakly aligned systems
(Permi and Annila, 2000
; Permi et al., 2000
). Perdeuteration, a further
means of NMR sensitivity enhancement for large complexes that can be
domain specific or introduced segmentally within a protein using
inteins (Yamazaki et al., 1998
), limits the traditional short-range distance data that are obtainable, but assists the recording of residual dipolar couplings. Additionally, perdeuteration will serve to produce contrast for neutron scattering experiments. High
contrast between the constituents of a complex coupled with high
angular definition will further improve the possibilities to build high
quality models from known subunits by SAS and NMR. Importantly for
large systems, our method does not require complete assignment, but a
few tens of unambiguous residual dipolar couplings will suffice (Clore,
2000
). In this way structures of significantly larger protein complexes
in solution can be studied than have been accomplished so far. In
particular, homodimers or homo-oligomers are especially amenable to
such model building owing to the symmetry constraints.
The precision with which orientations of domains can be defined depends
on the number of residual dipolar couplings and their precision
measured in nonredundant directions and on the structural agreement
between the couplings and the individual high-resolution domains
structures determined by x-ray crystallography or NMR spectroscopy. A
dense sampling of directions will also assist the detection of local
structural changes. A portion of the couplings may also be affected by
dynamics within each domain or conformational exchange within a domain
(Tolman et al., 2001
; Meiler et al., 2001
). Methods to cross-validate
the data (Skrynnikov and Kay, 2000
) and ways to refine the models (Chou
et al., 2001
) have been developed. Asymmetry within the constituents of
the complex will improve the precision, because fewer degrees of
freedom remain ambiguous from the SAXS measurements. The completeness
of the model will also determine the precision, but as illustrated
above, considerable incompleteness can be tolerated.
This study of ligand-induced globularization of CaM illustrates the
principle of constructing overall solution structure models from
high-resolution coordinates of domains or subunits on the basis of NMR
data from weakly aligned systems and small-angle scattering data. From
the small-angle scattering perspective, NMR residual dipolar coupling
data serve to reduce angular degrees of freedom. Alternatively, this
could also be achieved using anisotropy in chemical shielding (CSA) to
give the orientation with respect to the molecular alignment frame
(Sanders et al., 1994
; Wu et al., 2001
). From the NMR point of view,
the SAS data serve to reduce the translational degrees of freedom. This
combination of methods circumvents difficulties in the observation and
assignment of short-range (<5 Å) distance restraints that limit
current applications of residual dipolar couplings to rigid-body
dynamics assembly of complexes (Clore, 2000
). We anticipate that the
demand to construct models of larger complexes will increase rapidly as
the protein data bank grows in its structural diversity.
| |
ACKNOWLEDGMENTS |
|---|
We are indebted to Andrea Hounslow and Clare Treritt for advice and assistance.
This work was supported by the Academy of Finland and Technology Agent of Finland (TEKES).
| |
FOOTNOTES |
|---|
Address reprint requests to Dr. Arto Annila, Institute of Biotechnology, University of Helsinki, P.O. Box 56, FIN-00014 Helsinki, Finland. Tel.: 358-9-191-50629; Fax: 358-9-191-50639; E-mail: arto.annila{at}helsinki.fi.
Submitted October 31, 2001, and accepted for publication February 19, 2002.
| |
REFERENCES |
|---|
|
|
|---|
Biophys J, August 2002, p. 1177-1183, Vol. 83, No. 2
© 2002 by the Biophysical Society 0006-3495/02/08/1177/07 $2.00
This article has been cited by other articles:
![]() |
M. V. Petoukhov and D. I. Svergun Global Rigid Body Modeling of Macromolecular Complexes against Small-Angle Scattering Data Biophys. J., August 1, 2005; 89(2): 1237 - 1250. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |