| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
Biophys J, May 2002, p. 2720-2736, Vol. 82, No. 5
-Bundle Proteins

Departments of *Biochemistry and Cell Biology and
Physiology and Biophysics, Center for Structural Biology,
SUNY Stony Brook, Stony Brook, New York 11794-5115 USA
| |
ABSTRACT |
|---|
|
|
|---|
Helix-helix interactions are important for the folding,
stability, and function of membrane proteins. Here, two independent and
complementary methods are used to investigate the nature and distribution of amino acids that mediate helix-helix interactions in
membrane and soluble
-bundle proteins. The first method
characterizes the packing density of individual amino acids in helical
proteins based on the van der Waals surface area occluded by
surrounding atoms. We have recently used this method to show that
transmembrane helices pack more tightly, on average, than helices in
soluble proteins. These studies are extended here to characterize the packing of interfacial and noninterfacial amino acids and the packing
of amino acids in the interfaces of helices that have either right- or
left-handed crossing angles, and either parallel or antiparallel
orientations. We show that the most abundant tightly packed interfacial
residues in membrane proteins are Gly, Ala, and Ser, and that helices
with left-handed crossing angles are more tightly packed on average
than helices with right-handed crossing angles. The second method used
to characterize helix-helix interactions involves the use of helix
contact plots. We find that helices in membrane proteins exhibit a
broader distribution of interhelical contacts than helices in soluble
proteins. Both helical membrane and soluble proteins make use of a
general motif for helix interactions that relies mainly on four
residues (Leu, Ala, Ile, Val) to mediate helix interactions in a
fashion characteristic of left-handed helical coiled coils. However, a
second motif for mediating helix interactions is revealed by the high
occurrence and high average packing values of small and polar residues
(Ala, Gly, Ser, Thr) in the helix interfaces of membrane proteins.
Finally, we show that there is a strong linear correlation between the occurrence of residues in helix-helix interfaces and their packing values, and discuss these results with respect to membrane protein structure prediction and membrane protein stability.
| |
INTRODUCTION |
|---|
|
|
|---|
Membrane and water-soluble proteins commonly fold
into bundles of
-helices. However, the nature and distribution of
the amino acids in these proteins are very different. The difference in the composition of the surface-exposed residues is well known and
simply reflects the environment of the protein, i.e., in soluble proteins polar and charged residues are on the water-accessible surface, whereas in membrane proteins hydrophobic residues cover the
lipid-exposed surface (Rees et al., 1989
). Much less is known about the
nature and distribution of amino acids in the interiors of membrane and
soluble proteins.
There is a long history involving efforts to understand the
folding and architecture of membrane proteins. The idea that membrane proteins had an "inside-out" architecture (Engelman and Zaccai, 1980
) was appealing when it was originally introduced because it
provided an explanation for the mechanism of helix association in
membrane proteins. The recent analysis of known crystal structures, however, clearly shows that membrane proteins do not have polar cores
of amino acids. Rees et al. (1989)
showed that the residues in the
interior of membrane proteins are less hydrophobic, on average, than
the lipid-exposed residues, but are comparable in hydrophobicity to the
residues in the interiors of soluble proteins. They proposed the use of
a helical "hydrophobic moment" (Eisenberg et al., 1984
) as a way to
identify the lipid-exposed surface of transmembrane helices. Their
analysis left open the question concerning how the hydrophilic residues
are distributed in the interior of membrane proteins. More recently,
Stevens and Arkin (1999)
concluded that the hydrophilic moment is
a poor indicator of helix orientation based on an extensive analysis of
known membrane protein structures. They found that helical hydrophilic
moments did not generally point toward the center of mass of the
protein. As a result, there are still unresolved questions involving
the internal architecture of membrane proteins.
Over the past few years, we (Javadpour et al., 1999
; Eilers et al.,
2000
) and others (Langosch and Heringa, 1998
; Russ and Engelman, 1999
;
Adamian and Liang, 2001
; Ulmschneider and Sansom, 2001
) have addressed
how helices pack in membrane proteins using a number of different
approaches. We have developed two methods for studying helix
interactions in membrane proteins. The first method is based on
constructing "contact plots" for all interacting helix pairs in a
membrane protein (Javadpour et al., 1999
). Based on an analysis of four
polytopic membrane proteins we showed that glycine had an unusually
high occurrence in helix interfaces and at helix crossing points. One
limitation of this early study was that few membrane protein structures
were available, a situation that has changed considerably over the past
three years. Moreover, a detailed comparison with soluble proteins was
not made. Finally, the focus of this previous study was primarily on
the role of glycine in transmembrane helix association. A larger data
set allows for a more comprehensive analysis of all amino acids.
The second method we have developed is based on the use of occluded
surfaces to probe amino acid packing. We have shown that membrane
proteins are generally more tightly packed than helical soluble
proteins (Eilers et al., 2000
). The packing analysis strongly suggested
that small and polar residues contribute to tight helix interactions.
However, the origin of the high packing values in membrane proteins
could not be unambiguously established because the packing values were
not separately calculated for interfacial and noninterfacial residues.
In this paper, we revisit the question of how helix interactions differ
between membrane and soluble
-bundle proteins by combining the
two methods in our analysis. We restrict our comparison to only those
soluble proteins classified as
-bundle proteins, because the
-bundle architecture is most similar to that of membrane proteins
and consequently provides the best comparison. With the recent
structure determination of several large membrane proteins and the
inclusion of
-bundle domains in soluble proteins, the current data
set is significantly larger and the average resolution of structures is
higher than that previously used for analyzing helix packing (Eilers et
al., 2000
). We have generated helix contact plots for 11 unique helical
membrane proteins and 23 soluble
-bundle proteins and
-bundle
domains. As a result, the contact plots now allow us to address helix
packing as a function of the location of any residue. In addition,
the helix pairs that are defined in our analysis can be categorized as
having either left- or right-handed crossing angles and parallel or
anti-parallel orientations. This allows us to address differences in
packing and helix interactions as a function of the helix geometry.
By combining the packing and helix contact analyses, we are able to show that small and polar residues serve to mediate tight helix-helix interactions in membrane proteins, and propose that these residues constitute a general packing motif that is well-represented in helical membrane proteins. The results refine how the hydrophilic moment of transmembrane helices relates to the internal architecture of membrane proteins, namely that the hydrophilic moment points between helix pairs rather than toward the center of mass of the protein. Finally, we discuss the use of packing values and interfacial propensities for predicting the relative orientation of transmembrane helices.
| |
METHODS |
|---|
|
|
|---|
Helix packing: method of occluded surfaces
The occluded surface (OS) method for analyzing packing
interactions in proteins has previously been described (Pattabiraman et
al., 1995
; DeDecker et al., 1996
). The OS method calculates packing
values at the level of individual atoms, amino acids, or entire
proteins. Packing values range from 0.0 to 1, corresponding to totally
exposed and totally occluded environments. Hexagonally packed spheres
have a maximum packing value of 0.8 due to the void space that exists
where the spheres are not in direct contact (Richards and Lim, 1993
).
The concept behind the OS method is illustrated in Fig.
1. In the OS calculation, van der Waals
surfaces are drawn around each atom in the protein and normals are
constructed that extend outward until they reach another surface or a
length of 2.8 Å, the diameter of a water molecule. The cutoff of 2.8 Å between amino acid surfaces accounts for the possibility that water
can occupy that space and therefore the corresponding surface is
defined as being nonoccluded (by another amino acid, chromophore, or
prosthetic group). The definition of the OS packing value (Fleming and
Richards, 2000
) takes into account the normalized occluded (or buried)
surface area weighted by the distance to the occluding neighbors. The
OS packing value (PV) for each residue is defined as
|
|
The packing values for individual residues can be directly compared because division by the total molecular surface area normalizes the packing value to account for the various sizes of the amino acids. Moreover, the method works equally well for both buried residues and surface residues. We have seen no systematic bias in packing values based on residue size or interfacial or noninterfacial location. This is illustrated in Fig. 2, which plots the difference in the average packing values for amino acids in interfacial or noninterfacial positions of membrane proteins. The packing value differences are all positive, indicating that the interior positions are more tightly packed. Also, there is no significant difference between the packing value differences for the abundant small residues (e.g., Gly, Ala, Ser, Thr) and large residues (e.g., Phe, Trp, Tyr). (The packing value differences are more variable for charged and highly polar residues that are not abundant in the transmembrane helices of membrane proteins.) This is important in the analysis because amino acids with small volumes tend to have high packing values in transmembrane helices, and a question that immediately arises is whether this results from helix-helix interactions or from small residues being surrounded by large residues on the same helix.
|
The occluded surface calculations were carried out on full protein structures, but the packing values we report represent only the amino acids in helices. The average occluded surface packing value for a protein is the average of all of the individual amino acid packing values for that protein. Prosthetic groups and chromophores were included in the calculations, whereas detergent, lipid, and water molecules were excluded. The calculations were carried out on monomers except for the ion channels and the light harvesting complex, where the functional tetramer (1J95), pentamer (1MSL), and trimer (1KZU) were used. We describe in the database section below how the helices in membrane and soluble proteins were assigned.
Helix contact plots
Helix-helix contacts were evaluated using a modified version of
the program Euler, which calculates backbone-to-backbone distances between transmembrane helices (Javadpour et al., 1999
). The program calculates the interatomic distances between all backbone atoms of each
interacting helix pair. In our analysis, two helices were considered to
be interacting if the minimum backbone-to-backbone distance was between
3 Å and 8 Å and there were at least 100 distances of <8 Å between
backbone heteroatoms. The helix-helix interface is defined by those
residues that occur at a local minimum in the contact plots or within
0.5 Å of a local minimum. Noninterfacial residues are those that do
not satisfy these criteria and include those that are oriented toward
lipids, internal aqueous pockets, or channels in membrane proteins and
toward water in soluble proteins. Since our definition of interacting
helices covers a broad range of interhelical distances, we separately
characterize the interfacial amino acids that have a
backbone-to-backbone separation of
6 Å, and those that have a
backbone-to-backbone separation of >6 Å.
Fig. 3 illustrates the concept behind the
construction of helix contact plots for three different helix
orientations. Fig. 3 A presents the contact plot for helix
4 in subunit L interacting with helix 4 in subunit M of the bacterial
photosynthetic reaction center (1AIJ). Both subunits have five
transmembrane helices, and three of these helices make contact with the
other subunit. Helix 4 in both subunits is in the central position of
the inter-subunit contacts. The two helices cross in the middle of the
membrane, and there are three amino acids (Phe-180, Asn-183, and
Ala-184), which lie in the
6 Å interface. Fig. 3 B
presents the contact plot for helices 2 and 7 in bacteriorhodopsin
(1C3W). The two helices diverge at the level of Pro-50 on helix 2. The
retinal chromophore of bacteriorhodopsin is attached to Lys-216 on
helix 7, which is adjacent to Pro-50. As a result, the open region
between helices 2 and 7 may be of functional importance in forming the retinal binding site. Fig. 3 C presents the contact plot
for helices 1 (
A) and 2 (
B) in cytochrome b of the
cytochrome bc1 complex (1BE3). These two helices coil in a
left-handed geometry, form close contacts along their entire length,
and serve as part of the scaffold for coordinating hemes
bL and bH in cytochrome
b. The helix crossing angles used to characterize left- and
right-handed helix pairs for the analysis in Table 3 were calculated
using the program define_structure (Richards and Kundrot, 1988
).
|
Statistics: test of significance
The z-test was used to evaluate whether the
calculated differences in the average (mean) packing values are
significant or simply result from sample variability. The
z-test evaluates the difference in the mean values of two
sets of data based on the number of elements in the data set and the
standard deviation between the elements. We applied the null hypothesis
to compare the average packing values. The null hypothesis gives
probabilities that the difference of mean values between two
populations originates from sample variability. If the difference is
significant and does not result simply from large sample variability,
then the p-values are low. P-Values of <0.05
indicate that there is a >95% probability that the difference is
significant. In Tables 1 and 2 we divided the results of the
z-test into three classes: p
0.05 (not
significant), 0.01
p < 0.05, and p < 0.01, marked as
, +, and ++, respectively.
|
|
Database of membrane and soluble proteins
The database used for our analysis included 11 membrane and 23 soluble
-bundle proteins. The 11 transmembrane proteins are all
-helical and of known structure with a resolution of
3.5 Å. Only
nonhomologous proteins were used in the analysis. When several crystal
structures of homologous proteins were present in the Protein Data
Bank, the highest-resolution structure was chosen. We included the
cytochrome bc1 complex from Saccharomyces cerevisiae (1EZV) rather than either of two lower-resolution structures (1BE3 and 1BCC). Similarly, the photosynthetic reaction
center from Rhodopseudomonas viridis (1DXR) was included rather than the reaction center from R. sphaeroides (1AIJ) or a lower resolution structure from R. viridis (6PRC). In
the case of the two structures of the light-harvesting complex (1KZU and 1LGH) whose structures have similar resolution (2.5 Å and 2.4 Å,
respectively) and R factors (22% and 21%, respectively), we selected
the complex from R. acidophila (1KZU) whose transmembrane helices have much lower thermal factors (16.1 vs. 31.5, respectively). We included bacteriorhodopsin (1C3W) and excluded halorhodopsin (1E12) because these proteins have the same architecture and their functions can be interconverted by a single amino acid substitution. We
did not include membrane-associated proteins (e.g., Lpp-56) or membrane
proteins that did not include transmembrane helices (e.g., TolC). The
data set was selected from those structures deposited in the Protein
Data Bank as of July 2001.
The Appendix lists the resolution and packing values of the proteins used in our analysis. Given the limited data set for helical membrane proteins, we have chosen to include membrane proteins whose resolution ranges from 1.55 Å (1C3W) to 3.5 Å (1MSL). The wide variation of resolution has the potential to influence the conclusions drawn about membrane protein packing and interhelical contacts. However, the higher-resolution structures are generally associated with higher packing values. The four membrane protein structures with a resolution of 2.2 Å or less have packing values well above the average packing value for soluble proteins. Moreover, the resolution of even the 3.5 Å structure of the mechanosensory channel (1MSL) is sufficient to define the relative orientation of the transmembrane helices for the helix-packing analysis. As indicated above, to account for some uncertainty in atomic positions, we define interfacial residues as those that occur at a local minimum in the contact plots or within 0.5 Å of a local minimum.
For comparison with membrane proteins, we analyzed the family of
soluble proteins classified as
-bundle proteins. These helical soluble proteins are the most similar in architecture to membrane proteins, and consequently provide the best database for comparison. The soluble proteins selected have known x-ray structures and show no
homology. We selected proteins classified as
-bundle proteins in the
CATH database as of May 2001 (http://www.biochem.ucl.ac.uk /bsm/cath_new/index.html; Michie et al., 1996
; Orengo et al., 1997
),
and used only those proteins that have at least three helices of nine
or more residues. The CATH classification of
-bundle proteins are
those that "must have at least 60%
and less than 5%
secondary structure assignment, with at least 50%
-
and less
than 5%
-
secondary structure contacts. The helices lie approximately parallel or antiparallel to one another. Specifically, pairwise angles between the helical axes of approximately 0° and 180° predominate."
The PDB codes for the helical membrane proteins analyzed are as follows: 1C3W (0.451), 1DXR (0.472), 1EUL (0.419), 1EZV (0.413), 1F88 (0.439), 1FX8 (0.466), 1J95 (0.424), 1KZU (0.415), 1MSL (0.387), 2OCC (0.451), 1QLA (0.426). The average packing values for the membrane helices are in parentheses. The hydrophobic boundaries of the helices were assigned based on the position of basic and acidic residues that bracketed the central hydrophobic portion of the helix.
The PDB codes for the soluble
-bundle proteins analyzed are as
follows: 1A17 (0.385), 1B5L (0.411), 2BCT (0.438), 2CCY (0.393), 1DVK
(0.413), 1ECM (0.369), 1FT1 chain A (0.447), 1IHB (0.423), 1LIS
(0.389), 1LRV (0.417), 1POC (0.420), 1VDF (0.387). The average helix
packing values are in parentheses. The helix assignments for soluble
proteins were taken directly from the PDB files. Only helices with nine or more residues were considered.
The PDB codes for the soluble
-bundle domains analyzed are as
follows with the average packing values for the helices in parentheses:
1A26 (0.393), 1A5T (0.390), 1BUC (0.429), 1CHK (0.441), 1CIY (0.407),
1DIK (0.452), 1FUP (0.404), 1KNY (0.408), 1MTY (0.430), 1VNS (0.450),
1YGE (0.400). The helix assignments for the soluble protein domains
were taken directly from the PDB files. Only helices with nine or more
residues were considered.
| |
RESULTS |
|---|
|
|
|---|
Helix packing in membrane and soluble
-bundle proteins
The internal packing of membrane proteins of known structure has
been studied in detail using the method of occluded surfaces. Table 1
presents a comparison of the average residue packing values of 11 helical membrane proteins and 23 soluble
-bundle proteins and
-bundle domains. The average amino acid packing values were
calculated by taking the sum of the packing values for each individual
amino acid of a given type (e.g., all Ala residues in membrane
proteins) and dividing by the total number of those amino acids. The
average protein packing values were calculated by taking the sum
of the packing values for each individual amino acid in a helix of the
protein and dividing by the total number of amino acids.
We first compare the average packing values for amino acids in helices
of membrane proteins (average PV = 0.441) with that for soluble
-bundle proteins and
-bundle domains (average PV = 0.418).
The average packing value for membrane proteins is distinctly higher.
Based on the number of residues that compose the data set and the
standard deviation between the individual amino acid packing values, it
is possible to assess the significance of the difference between the
average or mean packing values using the z-test. The
z-test indicates that there is a >99.9% probability (p < 0.001) that the higher average packing value
calculated for membrane proteins is statistically significant. The
significance of this result is better appreciated when one considers
that 8 of the 11 membrane proteins studied have higher packing values than the average soluble protein packing value. The membrane
protein with the lowest packing value (0.387) is the mechanosensitive channel (1MSL), which has a nonselective central ion pore.
One of the advantages of the OS method is that packing values are calculated for individual amino acids. This allows us to assess how different amino acids contribute to the average protein packing density. Table 1 summarizes the average residue packing values for each of the 20 amino acids. Of note is the observation that glycine has the highest overall packing value in membrane proteins (0.524), followed by proline (0.507) and alanine (0.488). In soluble proteins, Cys (0.497) has the highest overall packing value, followed by Phe (0.491) and Ile (0.482). When comparing the packing value differences between membrane and soluble proteins, the z-test (Table 1, right column) provides a convenient way to assess significance. For instance, the higher packing values for glycine and proline in membrane proteins are significant, whereas the higher packing value observed for alanine (0.488 vs. 0.472) is not significant.
A second way to assess how different amino acids contribute to the
average protein packing value is to calculate the occurrence of each
amino acid type as a function of packing value. Fig.
4 lists the most abundant amino acids
having high packing values (>0.55), intermediate packing values
(0.55-0.30), and low packing values (<0.30). Only amino acids with an
occurrence of >5% are included. The most striking result of this
analysis is the high abundance of Gly, Ser, and Thr in the tightly
packed category for membrane proteins. In contrast, the most tightly
packed residues in soluble
-bundle proteins are Ala, Leu, Val, Gly,
Ile, and Phe. Four of these residues (Leu, Ala, Ile, and Val) have high occurrences in the "a" and "d" positions of left-handed
coiled-coil structures (Cohen and Parry, 1990
). Another interesting
observation is that phenylalanine, which has a low average residue
packing value (0.408) in membrane proteins and a very high packing
value in soluble proteins (0.491), contributes to all three packing ranges in membrane proteins. This implies that Phe is a very versatile amino acid in terms of membrane protein structure.
|
Helix-helix interactions in membrane and soluble
-bundle
proteins using contact plots
The first challenge in characterizing helix-helix interactions is
to define the helix-helix interface and the specific residues that are
involved in mediating helix-helix association. The approach we have
taken is to generate contact plots for all interacting pairs of helices
in a protein structure. The contact plots have several advantages,
including that they are easy to generate and there is a straightforward
visual correlation between the contact plot and the geometry of the
interacting helix pair. The interfacial residues are defined as those
located at a local minimum. For each turn of an
-helix there are
either one or two residues that fit this definition. In an idealized
left-handed coiled coil of helices, our definition would correspond to
the residues at the "a" and "d" positions. The other positions
are considered noninterfacial. These may be involved in helix-helix,
helix-lipid, or helix-water interactions. An alternative definition of
interfacial residues involving contact between Voronoi polyhedra that
share common edges is less restrictive and would include residues in
the "e" and "g" positions of a heptad repeat (Adamian and
Liang, 2001
).
The contact plots shown in Fig. 3 illustrate that helices have a wide
range of relative orientations. The helix interface can be very closely
packed in the region where the helices cross and can be very loosely
packed in the region where the helices diverge. We have constructed
contact plots for 142 interacting helix pairs in membrane proteins and
190 interacting helix pairs in soluble
-bundle proteins and
-bundle domains. Fig. 5 presents the
distribution of the minimum backbone-to-backbone distances between
helices in membrane proteins (filled bars) and soluble proteins
(open bars). The minimum distances range from 3.0 to 7.8 Å. For
soluble proteins, the distribution of minimum distances is symmetric
about an average backbone-to-backbone distance of 5.22 Å. For membrane
proteins, the distribution is clearly different. The higher relative
occurrence of short distances is consistent with the high abundance of
residues with small side chains in the most tightly packed class
(>0.55) in the packing analysis shown in Fig. 4. The average minimum
distance for membrane proteins is 5.10 Å. Based on the
z-test, however, the difference between the average minimum
distances for membrane and soluble proteins is not statistically
significant (p
0.05).
|
To characterize the nature and distribution of amino acids in
interfacial and noninterfacial positions, we divided the 2118 residues
in membrane protein helices and the 2604 residues in soluble
-bundle
protein helices into three categories: those at a local minimum in the
helix contact plots and having a minimum backbone-to-backbone distance
of
6 Å, those at a local minimum in the helix contact plots and
having a minimum backbone-to-backbone distance of >6 Å, and those not
at a local minimum. The first category represents ~50% of the
interfacial residues, and we use the terms interface and interfacial
residues for these amino if not otherwise stated. A 6 Å minimum
distance between backbone atoms corresponds to an axial separation of
~11 Å, and includes most residues that would be either close or
moderately packed in helix interfaces. (For comparison, the average
minimum backbone-to-backbone distance in the long left-handed coiled
coil of GCN4 is 5.7 Å, while the average axial separation for membrane
and soluble proteins is ~9.6 Å).
Table 2 lists the occurrence of contact residues in helix interfaces by
amino acid type for membrane proteins and for soluble
-bundle
proteins and
-bundle domains. The most striking result in Table 2 is
that in membrane proteins, compared to soluble
-bundle proteins,
there are twice as many residues that have occurrences of >5%. This
clearly shows that membrane proteins have a more diverse set of
interactions mediating close helix-to-helix contacts.
Fig. 6 illustrates the results of the
contact plot analysis by listing the most abundant residues with
occurrences of 5% or greater in the three categories defined above. In
the first category (interfacial residues where the minimum backbone
separation is
6 Å), there are eight amino acids in membrane proteins
that are statistically overrepresented, whereas in soluble proteins
there are only four amino acids. For both membrane and soluble
proteins, the most abundant interfacial residue is alanine. Membrane
and soluble proteins diverge at the second most abundant amino acid, which is glycine (12.1%, a total of 82) in membrane proteins and leucine (15.8%, a total of 119) in soluble
-bundle proteins (Table 2, Fig. 6).
|
Finally, a comparison of the results summarized in Table 1 and Fig. 6
shows that in general, small amino acids pack more tightly in membrane
proteins, whereas large amino acids pack more tightly in soluble
-bundle proteins. For instance, Leu, Ile, and Val have higher
occurrences and higher packing values in soluble proteins, whereas Gly,
Ser, and Thr have higher occurrences and higher packing values in
membrane proteins.
Amino acid propensities in helix interfaces
The packing and contact analyses, which are summarized in Figs. 4
and 6, are based on total occurrences. Leu and Ala have the highest
total occurrences in the helices of both membrane and soluble
-bundle proteins (Table 1), and consequently it is not surprising
that these two residues rank high in the analyses. A more complete
picture for how membrane and soluble proteins use different amino acids
to mediate helix interactions is given by calculating amino acid
propensities. Propensities will not be influenced by total
occurrences, but rather will provide an "intrinsic" measure of
whether an amino acid is likely to be found in an interfacial or
noninterfacial position. We define interfacial propensity as the
interfacial occurrence of an amino acid (Table 2) divided by its total
occurrence (Table 1). Table 2 lists the interfacial propensity by amino
acid type for residues with backbone-to-backbone separations of
6 Å.
The amino acids with the highest interfacial propensities are Pro and
Cys in membrane and soluble proteins, respectively.
The drawback of interfacial propensities is that it does not take into account the total interfacial occurrence of an amino acid. As a result, residues that are fairly rare in helices (e.g., Cys, with an occurrence of only 2%) can rank very high. The way to combine the intrinsic propensities with total occurrences is to calculate a "weighted propensity" by multiplying the interfacial (or noninterfacial) propensity by the interfacial (or noninterfacial) occurrence. Table 3 lists the amino acids by weighted propensities for both interfacial and noninterfacial amino acids. Fig. 7 depicts the results for those residues with a weighted propensity of >5%. This analysis complements the analyses based on total occurrences and on propensities alone. The most striking result from Fig. 7 is the high weighted propensities of the small residues Ala, Gly, and Ser for mediating helix-to-helix interactions in membrane proteins.
|
|
Occurrence and packing of amino acids in helices with left- and right-handed crossing angles and with parallel and antiparallel orientations
The first two sections above greatly extend our previous studies using the occluded surface analysis and helix contact plots to characterize amino acid packing and helix-helix interactions, respectively. The results of these methods can now be combined to investigate the differences in packing for helices having left- or right-handed crossing angles and having parallel or antiparallel helix orientations.
Crick (1953)
originally introduced the "knobs-into-holes" model to
describe the role of steric contacts and surface complementarity in
helix-to-helix packing. He found that the optimal packing angle between
helices was at +20°, corresponding to the angle formed for helices
forming left-handed coiled coils. A second preferred packing angle of
70° was also described. Subsequently, this basic model has been
refined (Richmond and Richards, 1978
) and several additional models
have been proposed (Chothia et al., 1977
, 1981
; Walther et al., 1996
).
Most recently, Bowie (1997a)
and Walther et al., (1998)
have shown that
if one accounts for the statistical bias toward crossing angles with
perpendicular orientations, there is a preference for helices to be
aligned in a parallel or antiparallel fashion. Bowie (1997a)
argued
that this preference does not agree with the regular packing models. In
this section we address the question of whether differences are
observed in the nature and distribution of residues in transmembrane
helices having left- and right-handed crossing angles. The results are
compared to soluble
-bundle proteins.
We determined the packing angles for all of the helix pairs used in the
contact analysis. The distribution of angles agrees with the
distributions observed by Senes et al. (2001)
and others (Bowie, 1997b
;
Walther et al., 1998
). Left-handed crossing angles are preferred in
both membrane (61%:39%) and soluble
-bundle proteins (62%:38%),
with antiparallel left-handed orientations being favored in membrane
proteins (42% of all helix pairs). Table 4 lists the packing values and
occurrences for all interfacial residues in helices that have either
left- or right-handed crossing angles for both membrane and soluble
-bundle proteins. The relative occurrences of amino acids in the
interfaces of helices with left- and right-handed crossed angles are
nearly the same for both membrane and soluble
-bundle proteins. The
only notable exception is that of Leu in soluble proteins where there
is a much higher interfacial occurrence in helices with left-handed
crossing angles.
|
Table 4 also lists the average packing values for the interfacial
residues in helices with left- and right-handed crossing angles in
membrane and soluble
-bundle proteins. The data address whether
there are statistically significant differences between left- and
right-handed packing. The average packing value for helices with
left-handed crossing angles in membrane proteins (0.518) is
significantly higher (p < 0.01) than for helices with right-handed crossing angles (0.501). This difference agrees with the
smaller average packing angles that are found in helices with left-hand
crossing angles because smaller crossing angles will allow the helices
to be more closely associated along their entire length. The average
packing value for left-handed helices in membrane proteins is also
significantly higher than the average packing values for helices in
soluble proteins with either left- or right-handed crossing angles.
This may simply reflect the fact that transmembrane helices are much
longer, on average, than helices in soluble proteins, and exhibit a
strong preference for the optimal +20° left-handed crossing angle
characteristic of coiled coils. Bowie (1997a)
proposed that this strong
preference is due to regular helix packing. In left-handed coiled coils
the side chains of one helix pack into the "holes" or "grooves"
on the opposing helix and greatly restrict the helix crossing angle. In
contrast, Bowie (1997a)
found that the helix packing angles in soluble
proteins were more variable and not well described by regular packing
models. This would imply that the side chains would be less tightly
packed and there would be no difference in the packing of helices with
left- or right-handed crossing angles.
We further determined helix packing as a function of the helix
orientation in membrane proteins. In our data set of membrane proteins,
antiparallel orientations (66%) are favored over parallel orientations. This agrees with the previous observation of Bowie (1997b)
. We find that helix pairs with antiparallel orientations are
more tightly packed than helix pairs with parallel orientations and
have an average residue packing value of 0.522 for those amino acids in
the
6 Å interface. In contrast, helix pairs with parallel orientations have an average packing value of 0.500. This difference is
significant (p < 0.001). If we consider all
interfacial residues, the average packing value of helices with
antiparallel orientations (0.495) is still significantly higher
(p < 0.001) than that for helices with parallel
orientations (0.478). These results are in agreement with Bowie (1997b)
who showed that helix pairs with antiparallel orientations tend to have
larger contacting surfaces.
| |
DISCUSSION |
|---|
|
|
|---|
Membrane proteins have a higher diversity of residues in helical
interfaces than soluble
-bundle proteins
The combined contact-packing analysis reveals that membrane
proteins have a much higher diversity of interhelical interactions than
soluble proteins, and that there is a high propensity for small and
polar residues in closely packed helix interfaces. The contact plots
not only reveal which amino acids line the helix-helix interfaces, they
also provide information on pairwise interactions. Tables
5 and
6 present the pairwise
interactions for amino acids in the helix interfaces of membrane and
soluble
-bundle proteins. The pairwise contacts were calculated for
helix interfaces with backbone separations of
6 Å. As might be
expected, there is a much broader distribution of pairwise interactions
in membrane proteins than in soluble
-bundle proteins. Significant
numbers of pairwise contacts are made between Leu, Ala, Val, and Ile in both membrane and soluble proteins, whereas in membrane proteins there
are a large number of contacts that are also made by small and polar
residues.
|
|
The distributions of pairwise contacts were also calculated for
amino acids in helix interfaces of membrane and soluble
-bundle proteins with backbone separations >6 Å (data not shown). The distributions were considerably different than those in Tables 5 and 6
and in both cases were dominated by large residues (mainly Leu, Ile,
Phe, and Val). The highest pairwise interactions in membrane proteins
were Leu-Val (32), Leu-Phe (28), and Leu-Ile (27), while in soluble
proteins Leu-Leu (62) contacts dominated the pairwise interactions,
followed by Leu-Ile (39) and Leu-Ala (34). This analysis complements
the recent work of Adamian and Liang (2001)
who investigated the
pairwise interactions of residues in membrane proteins using Voronoi
constructions to define interacting neighbors. They reached a similar
conclusion that membrane proteins exhibit a high diversity of helix
interactions, and noted that transmembrane helices have a larger
variety of polar-polar interactions than soluble proteins.
The high diversity of helix-helix interactions in membrane proteins is
likely to be related to membrane protein function and structure. Unlike
soluble proteins, where the functional sites are on the protein surface
in active site clefts or grooves, the functional sites in membrane
proteins are often in the protein interior. These sites typically
contain highly polar amino acids. For instance, Lys-216 in the interior
of bacteriorhodopsin is the site of attachment for the protein's
retinal chromophore. The Lys-216 side chain is packed against Pro-50
and Ala-53 in the interface between helices 2 and 7 (see Fig.
3 B). Lysine, along with Asp, Glu, Arg, Asn, and Gln, are
relatively rare in transmembrane helices. The importance of these
highly polar amino acids in forming diverse pairwise interactions is
not reflected in Tables 5 and 6, which list absolute occurrences. These
residues do have high pairwise propensities as seen in the
analysis of Adamian and Liang (2001)
.
Small residues have high propensities for packing in helix interfaces in membrane proteins
The combined contact-packing analysis shows that small and polar
residues have a high weighted propensity to occur in transmembrane helix interfaces and are among the most tightly packed amino acids in
membrane proteins. Our analysis strongly argues that helix interactions
are both qualitatively and quantitatively different between membrane
and soluble proteins. The comparison can be further quantified by
plotting the propensity as a function of amino acid volume (Fig.
8, A and B) and
hydrophobicity (Fig. 8, C and D). For membrane
proteins (Fig. 8 A), there is a rough linear correlation (R = 0.70) between the propensity and volume. This correlation does not hold for soluble
-bundle proteins (Fig. 8 B)
(R = 0.28). The observation that small residues have a high
propensity for lining the interfaces between helices also agrees with
an analysis of surface roughness in helical membrane proteins that
shows that in general the lipid-exposed surface is rough, whereas the
helix-helix interfaces are smooth (Renthal, 1999
).
|
In contrast, for soluble
-bundle proteins and domains (Fig.
8 D), there is a rough linear correlation (R = 0.66)
between the weighted interfacial propensity and hydrophobicity.
This correlation does not hold for membrane proteins (Fig.
8 C) (R = 0.31). This is not surprising because the
folding of soluble proteins is driven by the hydrophobic effect. In
membrane proteins, large hydrophobic residues (Leu, Phe, Val, and Ile)
have the highest occurrences on the lipid-exposed surface of membrane
proteins (Fig. 6).
The fact that small and polar residues emerge from the analysis of
helix interactions in membrane proteins based on both absolute occurrences and propensities emphasizes their importance. Small and
polar residues also occur in the interfaces of soluble proteins; however, neither their absolute occurrences nor interfacial
propensities are as striking as in membrane proteins. There are several
studies where small/polar residue packing is highlighted. Richmond and Richards (1978)
, in a comprehensive analysis of helix packing in sperm
whale myoglobin, described the packing of Gly-25 and Gly-59 at the
crossing point of helices B and E. They suggested that the crossing
angle between helices would be inversely correlated with the volume of
the central residue in a helix. Reddy and Blundell (1993)
showed that
the axial separation between helices is dependent on the volume of
the interfacial residues. Efimov (1979)
showed that the axial
separation was less in helices that were packed in a "polar"
fashion, i.e., where hydrophilic residues lie on one face of a pair of
interacting helices. More recently, Walther et al. (1996)
found that
helices with different axial separations used different packing
patterns. They concluded that Ala has the highest packing flexibility,
in agreement with our results where Ala is found to have a high
occurrence and propensity in the interfaces of both membrane and
soluble
-bundle proteins.
Two general motifs exist for helix-helix interactions in membrane proteins
One of the conclusions that emerges from our analysis is that
there are statistically significant differences between membrane and
soluble
-bundle proteins. The higher diversity of membrane protein
interactions and the propensity of small and polar residues in tightly
packed interfaces suggests that membrane proteins have at least two
general motifs for mediating helix interactions. Both membrane and
soluble proteins exhibit "knobs-into-holes" packing exemplified by
"leucine zippers" (Cohen and Parry, 1990
; Langosch and Heringa,
1998
). Membrane proteins, however, have a second general motif,
exemplified by the dimer interface of glycophorin A (Lemmon et al.,
1992
; Smith et al., 2001
), in which small and polar residues form
smooth surfaces that allow very close approach of the backbones of
interacting helices. In this section we present examples of these two
general motifs in polytopic membrane proteins and discuss recent
studies in which these motifs were found to mediate the dimerization of
membrane proteins with single transmembrane helices.
The first motif, which is common to both membrane and soluble
proteins, is exemplified by the heptad repeat of leucine residues, LxxLxxxLxx, characteristic of leucine zippers. Analysis of left-handed coiled coils shows that the predominant residues in the "a" and "d" positions of this motif are Leu (33%), Ala (16%), Ile (10%), and Val (7%) (Cohen and Parry, 1990
). These four amino acids dominate the core residues involved in helix-helix interactions in soluble proteins (53%, Table 2), and contribute significantly to helix interactions in membrane proteins (42%, Table 2). In an analysis of
three membrane proteins, Langosch and Heringa (1998)
found that
transmembrane helices exhibited knobs-into-holes packing characteristic
of left-handed coiled coils in soluble proteins. They concluded that
helix packing is less compact than in soluble proteins. This correlates
with our results summarized in Fig. 4, which shows that Leu, Val, Ala,
and Ile are the most abundant amino acids in the intermediate packing
region (0.55-0.30) in membrane proteins, whereas they dominate the
high packing region in soluble proteins. Moreover, Langosch and
co-workers showed that a heptad motif of leucine residues can drive the
association of designed membrane proteins with single transmembrane
helices (Gurezka et al., 1999
).
Fig. 9 presents an example of
knobs-into-holes packing characteristic of leucine zippers. The helix
pair is from the structure of the Ca2+-ATPase (1EUL). The
interfacial residues shown are Val-93, Ile-94, Ile-97, Asn-101, and
Val-104 for helix M2, and Leu-793, Leu-797, and Leu-802 for helix M6.
There are several interesting features exhibited by this helix pair.
First, there is an Asn in helix M2 that is involved in interhelical
hydrogen bonding. Asn hydrogen bonding is a hallmark of the leucine
zipper coiled coil of GCN4. Second, there is a
-bulge near Leu-802
in helix M6. The presence of this distortion in the helix does not
disrupt the knobs-into-holes packing arrangement. Finally, the minimum backbone-to-backbone distance is 6.28 Å. As a result, this helix pair
falls into the >6 Å category defined for our contact analysis. In
this category, Leu and Ile have the highest weighted propensities (Fig.
7).
|
The second motif that appears to be common in helix-to-helix
association in membrane proteins is exemplified by the GxxxG motif
observed in glycophorin A. The GxxxG sequence was the most dominant
motif found in a statistical analysis of membrane protein sequences
(Senes et al., 2000
) and dimerization-dependent screens (Russ and
Engelman, 2000
) for helix interactions in membrane proteins having
single membrane spanning helices. In these studies, serine is the most
common residue (after glycine) found at the position of one of the
glycines in the motif. Our analysis strongly suggests that a similar
general motif exists in polytopic membrane proteins where the
interfacial positions are occupied by Gly, Ser, Ala, and Thr.
Fig. 10 presents an example of the
"small and polar residue" motif from the glycerol-facilitator
channel (1FX8). In this protein, there are six full-length
transmembrane helices and two "half" helices (Fu et al., 2000
). The
three helix pairs shown in panels A-C correspond to the six
full-length transmembrane helices. The interfaces of all three helix
pairs are lined by small and polar residues, and have close (
6
Å) backbone contacts. Close glycine-glycine contacts are involved in
each helix pair, and at least one interfacial glycine is highly
conserved across the large family of membrane channels known as
aquaporins, of which the glycerol-facilitator channel is a member (Fu
et al., 2000
). Two of the helix pairs (helices M1-M4 and M2-M6)
involve a Ser residue that is in a position to form an interhelical
hydrogen bond. Ser-92 on helix M4 may hydrogen-bond to Cys-11 on helix M1, and Ser-45 on helix M2 is in a position to hydrogen-bond to the
backbone carbonyl of Ala-192 on helix M6.
|
A unique structural role for small residues in helix interfaces may
involve stabilization of helix dimers via dipolar interactions involving backbone amide C==O and N---H groups or direct
hydrogen-bonding interactions involving the polar side chains of Ser,
Thr, or Cys. Both types of interactions are facilitated by short
interhelical spacing in the region of Gly-Gly and Gly-Ala contacts. The
importance of hydrogen bonding interactions in hydrophobic protein
interiors cannot be overstated because hydrogen bond strengths are much higher in membrane environments (Pace, 2000
).
We have examined all potential hydrogen bonding interactions of transmembrane Ser and Thr residues by looking at the distances between the side chain hydroxyl oxygens and all other heteroatoms within 3.4 Å. Based on this analysis (data not shown), most Ser (70%) and Thr (79%) residues in transmembrane helices hydrogen bond to the i-3 or i-4 backbone carbonyl. Nevertheless, ~20-30% of Ser and Thr residues in membrane proteins are in a position to form interhelical hydrogen bonds. Of the Ser and Thr residues that can form interhelical hydrogen bonds to backbone C==O and N---H groups across the helix interface, the hydrogen-bonding partners are predominantly Ser (28%) and Ala (16%), similar to that seen in the interface of the glycerol facilitator protein.
It is interesting to note in regard to interhelical hydrogen bonding
that Ser and Thr were largely ineffective for driving helix association
in the context of model transmembrane helices containing predominantly
leucine (Gratkowski et al., 2001
; Zhou et al., 2001
). In these studies,
the more polar (and much less abundant) amino acids, such as Asn, Asp,
Glu, and Gln, were able to drive dimerization. We suggest that Ser and
Thr are ineffective in the context of large bulky residues, and that
the "small and polar" motif provides specificity for dimer
formation in both single-pass and polytopic membrane proteins.
A strong correlation exists between helix packing and interhelical propensity
In soluble proteins, protein stability is closely correlated with
the packing of core residues (Richards, 1997
). For instance, increased
packing appears to be one mechanism by which the extremely stable
hyperthermophilic proteins gain increased stability over their
mesophilic counterparts (DeDecker et al., 1996
). One of the motivations
for the current study was to combine the analysis of amino acid packing
in membrane proteins with an analysis of helix contacts to address the
mechanism of membrane protein stability.
The results in Table 2 indicate that the high packing values in membrane proteins are associated with interfacial interactions. The correlation between packing and interfacial interactions can be further quantified by plotting the packing value as a function of the propensity to occur within the helical interface (Fig. 11). A linear correlation (R = 0.84) exists between the packing values determined by the OS method and the interfacial propensity derived from the helix contact plots. These data strongly argue that membrane protein stability (as expressed by packing) has a strong contribution from small and polar interfacial amino acids.
|
Finally, the correlation between packing and interfacial propensity
suggests that interfacial propensities may provide a useful scale for
predicting the relative orientation of transmembrane helices. Helical
faces with a large number of small and polar residues would be
predicted to be closely packed in helix-helix interfaces. Our analysis
agrees with that of Rees et al. (1989)
that the amino acids on the
lipid-exposed surface of membrane proteins are more hydrophobic than
interior residues. Using the hydrophobicity scale proposed by Engelman
et al. (1986)
, the residues in the helix interfaces of membrane
proteins are less hydrophobic on average (
1.07) than the
noninterfacial residues (
1.42). The data also agree with the
conclusions of Stevens and Arkin (1999)
that the hydrophilic moment of
the transmembrane helices is often not oriented toward the center of
the transmembrane helical bundle. Rather, the helical hydrophilic
moment is often oriented between tightly packed helix pairs containing
polar residues, as shown in Figs. 9 and 10. Together, these data
illustrate a high degree of complexity in the internal architecture of
membrane proteins (compared to soluble helical proteins) and reveal
basic strategies used by membrane proteins for forming tight
interactions between hydrophobic helices in membrane environments.
| |
APPENDIX |
|---|
|
|
|---|
Tables 7 and
8 list the packing values and resolution
for the 11 helical membrane proteins and 23 helical
-bundle proteins and protein domains used in our analysis. For the 23 soluble proteins, the average resolution was 2.15 Å (standard deviation 0.34 Å and average R factor of 19.1). For the 11 membrane proteins, the average resolution was 2.43 Å (standard deviation 0.51 Å and an average R
factor of 21.9). Fig. 12 plots the
packing values of the membrane (A) and soluble proteins
(B) used in our analysis as a function of their resolution.
For the membrane protein structures, the packing values tend to be
higher for the higher-resolution structures. The six highest-resolution
structures have packing values of 0.451 (1C3W, 1.55 Å), 0.472 (1DXR,
2.0 Å), 0.466 (1FX8, 2.2 Å), 0.426 (1QLA, 2.2 Å), 0.413 (1EZV, 2.3 Å), and 0.451 (2OCC, 2.3 Å). The packing values for these structures
with the exception of 1EZV are all well above the average packing value
(0.418) of the 23 soluble
-bundle proteins.
|
|
|
| |
ACKNOWLEDGMENTS |
|---|
We thank Erwin London and Stuart McLaughlin for valuable discussions.
This work