help button home button Biophys. J.
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS

This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Noskov, S. Yu.
Right arrow Articles by Lim, C.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Noskov, S. Yu.
Right arrow Articles by Lim, C.

Biophys J, August 2001, p. 737-750, Vol. 81, No. 2

Free Energy Decomposition of Protein-Protein Interactions

Sergey Yu. Noskov* and Carmay Lim*dagger

 *Institute of Biomedical Sciences, Academia Sinica, 11529 Taipei, and  dagger Department of Chemistry, National Tsing-Hua University, 300 Hsinchu, Taiwan


    ABSTRACT
TOP
ABSTRACT
INTRODUCTION
METHODS
RESULTS
DISCUSSION
CONCLUSIONS
REFERENCES

A free energy decomposition scheme has been developed and tested on antibody-antigen and protease-inhibitor binding for which accurate experimental structures were available for both free and bound proteins. Using the x-ray coordinates of the free and bound proteins, the absolute binding free energy was computed assuming additivity of three well-defined, physical processes: desolvation of the x-ray structures, isomerization of the x-ray conformation to a nearby local minimum in the gas-phase, and subsequent noncovalent complex formation in the gas phase. This free energy scheme, together with the Generalized Born model for computing the electrostatic solvation free energy, yielded binding free energies in remarkable agreement with experimental data. Two assumptions commonly used in theoretical treatments; viz., the rigid-binding approximation (which assumes no conformational change upon complexation) and the neglect of vdW interactions, were found to yield large errors in the binding free energy. Protein-protein vdW and electrostatic interactions between complementary surfaces over a relatively large area (1400-1700 Å2) were found to drive antibody-antigen and protease-inhibitor binding.


    INTRODUCTION
TOP
ABSTRACT
INTRODUCTION
METHODS
RESULTS
DISCUSSION
CONCLUSIONS
REFERENCES

To understand molecular recognition of protein surfaces, it is important to obtain the quantitative contributions of the individual forces governing binding affinity and specificity. Consequently, it is useful to develop reliable methods for computing absolute or relative free energies that can dissect the relative magnitudes of different thermodynamics effects. Many such methods have been developed and they fall generally into three classes. The first class includes free energy simulation methods (Kollman, 1996; Tembe and McCammon, 1984) that treat solute and solvent atoms explicitly with an atomic force-field description. These simulation methods have a rigorous statistical mechanics basis and can be used to compute the difference between the binding free energies of two closely related medium-sized compounds (e.g., ligands differing only in a single residue) in solution fairly accurately (Jorgensen and Tirado-Rives, 1995). However, they are less reliable for ligand-protein or protein-protein association due mainly to the problem of sampling the relevant configurations of the solute and solvent (McCammon, 1998).

The second class encompasses empirical approaches that use parameters obtained from a "training set" of known interactions (Andrews et al., 1984; Bohm, 1994; Horton and Lewis, 1992; Weng et al., 1997; Williams et al., 1993). The binding free energy is taken to be a sum of different terms associated with hydrophobic contacts, hydrogen bonds or salt bridges, and conformational entropy loss. The various terms are usually not derived from theoretical backgrounds (e.g., statistical thermodynamics or molecular mechanics), but from available binding data in a statistical manner. The key advantage of empirical methods is their speed. However, these phenomenological scoring functions rely critically on the quality of the training database.

The third class includes methods that treat the solute atoms explicitly and the solvent as a continuum dielectric medium (Kollman et al., 2000). The electrostatic contribution to the solvation free energy is computed using finite difference Poisson-Boltzmann (PB) (Honig and Nicholls, 1995) methods or the Generalized Born (GB) approximation (Still et al., 1990; Qiu et al., 1997), whereas the nonelectrostatic contribution is treated as an empirical function of the solvent-accessible surface area (SASA) (Still et al., 1990). These methods appear promising for certain systems because they represent a good balance between speed (by avoiding sampling of solvent configurations) and accuracy (by incorporating long-range electrostatics, ionic strength, and polarization effects).

Although the computed absolute/relative binding free energies for various systems have been reported to be in close agreement with the respective experimental values, this does not necessarily mean that the underlying theory or scheme for computing the binding free energy is accurate. This is because the agreement may be fortuitous and errors due to experimental issues may mask the errors inherent in computing the various free energy components. For example, in many cases the x-ray structure of the complex, but not its unbound counterparts, is known. In such cases, the binding free energy is often estimated from the atomic coordinates of the complex alone assuming no conformational change upon complexation (rigid-binding approximation) (Novotny et al., 1989; Williams et al., 1991; Horton and Lewis, 1992; Jackson and Sternberg, 1995; Friedman and Honig, 1995; Froloff et al., 1997; Novotny et al., 1997; Olson, 1999). In cases where the atomic coordinates of both free and bound proteins are available, the protein may have been crystallized at a pH that is significantly different from the pH at which the binding free energy was measured. Furthermore, NMR structures are often solved at low pH. However, the free energy for protein association that is mediated by ionizable residues can be very sensitive to pH (Gibas, 1997; Xavier and Wilson, 1998). Discrepancy between the computed and experimental binding free energies has been attributed to the use of low pH structures rather than shortcomings in the free energy function/calculations, like the neglect of van der Waals (vdW) interactions (Krystek et al., 1993).

Partly due to the above issues, the various theoretical studies have not reached a consensus regarding the major force driving the binding process. Pauling concluded that the cooperation of weak vdW and hydrogen-bonding interactions between complementary surfaces over an area sufficiently large to overcome the disrupting influence of thermal agitation govern protein-protein recognition (Pauling, 1974; Pauling and Pressman, 1945). In contrast, Chothia and Janin suggested that vdW and polar interactions contribute little to complex stability, and it is hydrophobicity (the release of interfacial water) that is the major factor stabilizing protein-protein associations (Chothia and Janin, 1975; Kauzmann, 1959). This view is also implicitly assumed in studies that fit the experimental binding free energy to buried surface areas (Horton and Lewis, 1992). Honig and coworkers and other groups found that nonpolar interactions, represented by a free energy-surface area relationship that is assumed to account for the hydrophobic effect and enhanced interfacial packing, provide the major driving force for MHC class I protein-peptide, trypsin-inhibitor and lambda cl repressor protein-DNA association (Froloff et al., 1997; Jayaram et al., 1999; Misra et al., 1998). Note that Chothia and Janin as well as Honig and coworkers did not take vdW interactions into account explicitly in computing the absolute binding free energy.

In this work, Scheme 1 (see next section), in conjunction with continuum dielectric models, has been implemented to compute the absolute free energy of protein-protein association. Our initial goals are twofold: to assess the accuracy of our scheme for computing absolute binding free energies, and to assess the validity of the various assumptions in previous theoretical treatments like the rigid-binding approximation or the neglect of vdW interactions and vibrational entropy (Novotny et al., 1989; Williams et al., 1991; Horton and Lewis, 1992; Vajda et al., 1994; Jackson and Sternberg, 1995; Novotny et al., 1997; Froloff et al., 1997; Olson, 1999). To address our first goal, we chose systems for which experimental binding free energies are available for comparison with the computed values, and accurate structures are available to minimize errors due to experimental issues. In this way, any observed discrepancy between the calculated and experimental results would reflect primarily the assumptions or deficiencies in our methodology rather than inaccuracies in the experimental structures. Specifically, we chose systems for which the crystal structures of both the complex and its components have been solved to <= 1.9-Å resolution to eliminate errors arising from poor quality structures or absence of free protein structures. Having the x-ray structures of the unbound and bound proteins also enable us to account for conformational changes that accompany binding (especially at the interaction surfaces). Also, we verified that the pH at which the free and complex structures were solved is close to the pH at which the corresponding binding free energy was measured to minimize errors due to structural changes at different pH values. To address our second goal, the systems chosen satisfied not only the above criteria, but have also been studied using various approximations.

Only two systems were found to satisfy all of the aforementioned criteria. They are trypsin complexed with bovine pancreatic trypsin inhibitor (BPTI), and the fragment variable (Fv) region of mouse monoclonal antibody, D1.3, bound to hen egg-white lysozyme (HEL). Their experimental binding free energies, Protein Data Bank (PDB) entries (Bernstein et al., 1977) and various properties are listed in Table 1. The interface area is defined as the SASA difference between the complex and its components. Note that both systems correspond to association of positively charged proteins to form a +12e complex with only small changes in conformation, as evidenced by root-mean-square deviation (RMSD) of the backbone atoms in the free x-ray structure from that in the corresponding complex crystal structure of <1 Å (see Table 1). The excellent agreement found between the computed and experimental binding free energies (see Results) allowed us to identify the key forces governing protease-inhibitor and antibody-antigen complexation with interface areas in the range of 1400-1700 Å2. Finally, we compare our findings with results obtained in previous studies for the same or homologous systems as well as for other protein-protein complexes (see Discussion).


                              
View this table:
[in this window]
[in a new window]
 
TABLE 1   Properties of protein-protein complexes studied in this work


    METHODS
TOP
ABSTRACT
INTRODUCTION
METHODS
RESULTS
DISCUSSION
CONCLUSIONS
REFERENCES

Theory

The binding free energy calculations were based on the thermodynamic cycle
<AR><R><C>[<UP>r</UP><SUP><UP>min</UP></SUP>]<SUB><UP>gas</UP></SUB></C><C><UP>+</UP></C><C>[<IT>l</IT><SUP><UP>min</UP></SUP>]<SUB><UP>gas</UP></SUB></C><C> <LIM><OP><ARROW>→</ARROW></OP><UL>&Dgr;G<SUB><UP>gas</UP></SUB></UL></LIM> </C><C>[(<UP>R · L</UP>)<SUP><UP>min</UP></SUP>]<SUB><UP>gas</UP></SUB></C></R><R><C><UP> ↑</UP><SUP><UP>−&Dgr;G</UP><SUB><UP>isom</UP></SUB>(r)</SUP></C><C></C><C><UP> ↑</UP><SUP><UP>−&Dgr;G<SUB>isom</SUB></UP>(l)</SUP></C><C></C><C><UP> ↓</UP><SUB><UP>&Dgr;G</UP><SUB><UP>isom</UP></SUB>(R · L)</SUB></C></R><R><C>[<UP>r</UP>]<SUB><UP>gas</UP></SUB></C><C><UP>+</UP></C><C>[<UP>l</UP>]<SUB><UP>gas</UP></SUB></C><C></C><C>[<UP>R  ·  L</UP>]<SUB><UP>gas</UP></SUB></C></R><R><C><UP> ↑</UP><SUP><UP>−&Dgr;G<SUB>solv</SUB></UP>(r)</SUP></C><C></C><C><UP> ↑</UP><SUP><UP>−&Dgr;G<SUB>solv</SUB></UP>(l)</SUP></C><C></C><C><UP> ↓</UP><SUB><UP>&Dgr;G<SUB>solv</SUB></UP>(R · L)</SUB></C></R><R><C>[<UP>r</UP>]<SUB><UP>sln</UP></SUB></C><C><UP>+</UP></C><C>[<UP>l</UP>]<SUB><UP>sln</UP></SUB></C><C> <LIM><OP><ARROW>→</ARROW></OP><LL>&Dgr;G°</LL></LIM></C><C>[<UP>R  ·  L</UP>]<SUB><UP>sln</UP></SUB></C></R></AR>
In Scheme 1, lower case r and l denote the unbound receptor and ligand conformations, whereas upper case R and L denote the bound receptor and ligand conformations. Note that [x]sln and [x]gas, where x = r, l, or R · L, correspond to the same structure; viz., the relaxed "all-hydrogen" x-ray structure (see next section). The latter was energy minimized in the gas phase to yield [xmin]gas, which corresponds to a local energy minimum that is required to compute gas-phase vibrational frequencies and thus the vibrational entropy change accompanying binding (see Gas-phase Binding Free Energy).

The standard free energy change (Delta G0) for the noncovalent association of two molecules in solution according to Scheme 1 is given by
&Dgr;G°=&Dgr;&Dgr;G<SUB><UP>solv</UP></SUB>+&Dgr;&Dgr;G<SUB><UP>isom</UP></SUB>+&Dgr;G<SUB><UP>gas</UP></SUB>, (1)
where
&Dgr;&Dgr;G<SUB><UP>solv</UP></SUB>=&Dgr;G<SUB><UP>solv</UP></SUB>(R · L)−&Dgr;G<SUB><UP>solv</UP></SUB>(r)−&Dgr;G<SUB><UP>solv</UP></SUB>(l) (2)
and
&Dgr;&Dgr;G<SUB><UP>isom</UP></SUB>=&Dgr;G<SUB><UP>isom</UP></SUB>(R · L)−&Dgr;G<SUB><UP>isom</UP></SUB>(r)−&Dgr;G<SUB><UP>isom</UP></SUB>(l). (3)
The desolvation free energy, -Delta Gsolv, corresponds to the work of transferring the molecule in its solution conformation to the same conformation in the gas phase at 298 K. The isomerization free energy, -Delta Gisom, is the work of transforming the molecule in its x-ray conformation to a nearby local minimum structure in the gas phase. Delta Ggas is the standard free energy change per mole for the noncovalent association of the molecules in the gas phase at 298 K. The calculations of Delta Gsolv, Delta Gisom, and Delta Ggas are described below.

Structures

In computing the absolute free energy according to Scheme 1, x-ray structures of the free proteins and their respective complex were used (see Table 1). The ionizable groups in the free proteins and respective complex were protonated or deprotonated according to available experimental pKa values and the pH of crystallization. All aspartic acid and glutamic acid residues as well as COOH-terminal groups were deprotonated, whereas arginine, lysine, cysteines, and NH2-terminal groups were protonated. Histidine residues were protonated only if their side-chain nitrogen atoms were within 3.5 Å of a hydrogen acceptor in the crystal structures. However, for the proteins studied here, the histidine side chains were found not to be involved in hydrogen bonding. Thus, they were treated as neutral by protonating at Nepsilon 2 or Ndelta 1 according to their local environment in the crystal structure. In trypsin, His40 and His91 were protonated at Nepsilon 2, but His57 was protonated at Ndelta 1 (Jackson and Sternberg, 1995). BPTI has no histidine residues. For the Fv region of mouse monoclonal D1.3 antibody and lysozyme, all the histidines were protonated at Nepsilon 2 (Tanokura, 1983). Only one histidine is located in the interface region in the complex crystal structures; viz., His57 in the trypsin/BPTI complex.

First, hydrogen atoms were added to the crystal structure using the HBUILD module of the CHARMM program (Brooks et al., 1983). The resulting structure was subjected to several steps of minimization using steepest descent followed by adopted-basis Newton-Raphson with strong (10 kcal/mol/Å2) harmonic constraints on all heavy atoms. This relieved close contacts in the protein without disrupting its overall conformation (Philippopoulos and Lim, 1995), as evidenced by RMSDs from the respective crystal structure of <= 0.064 Å (see Table 2). The relaxed all-hydrogen x-ray structures, denoted by [r]sln, [lsln, and [R · L]sln, were used to compute the solvation free energies (see below). For the isomerization step, each all-hydrogen x-ray structure was energy minimized (initially with constraints, then without) using a dielectric constant of one for >= 2500 steps of steepest descent, followed by ~7500 steps of adopted-basis Newton-Raphson until the average force was <2 × 10-6 kcal/mol/Å. The fully minimized [rmin]gas, [lmin]gas, and [(R · L)min]gas structures, which remain close to their respective crystal structure (see Table 2), were used to obtain the gas-phase complexation energies and entropies. All energy minimizations were carried out using the CHARMM program (Brooks et al., 1983) and the version 22 all-hydrogen forcefield (MacKerell et al., 1998).


                              
View this table:
[in this window]
[in a new window]
 
TABLE 2   RMSD of heavy atoms from crystal structures*

In computing Delta G0 using the rigid-binding approximation, the starting conformations of the unbound r and l proteins were obtained from the relaxed all-hydrogen x-ray structure of the R · L complex; i.e., [r]sln = [R]sln and [l]sln = [L]sln. These rigid free structures exhibit heavy atom RMSDs from the respective crystal structures that range from 0.97 to 1.29 Å (see Table 2). The latter value is greater than the motion of protein crystal atoms, 0.3-0.5 Å, derived from their B-factors (Froloff et al., 1997; Luzzati, 1952). Thus, in the rigid-binding approximation, Delta G0 is computed according to
<AR><R><C>[<UP>R</UP><SUP><UP>min</UP></SUP>]<SUB><UP>gas</UP></SUB></C><C><UP>+</UP></C><C>[<IT>L</IT><SUP><UP>min</UP></SUP>]<SUB><UP>gas</UP></SUB></C><C> <LIM><OP><ARROW>→</ARROW></OP><UL>&Dgr;G<SUB><UP>gas</UP></SUB></UL></LIM> </C><C>[(<UP>R · L</UP>)<SUP><UP>min</UP></SUP>]<SUB><UP>gas</UP></SUB></C></R><R><C><UP> ↑</UP><SUP><UP>−&Dgr;G</UP><SUB><UP>isom</UP></SUB>(R)</SUP></C><C></C><C><UP> ↑</UP><SUP><UP>−&Dgr;G<SUB>isom</SUB></UP>(L)</SUP></C><C></C><C><UP> ↓</UP><SUB><UP>&Dgr;G</UP><SUB><UP>isom</UP></SUB>(R · L)</SUB></C></R><R><C>[<UP>R</UP>]<SUB><UP>gas</UP></SUB></C><C><UP>+</UP></C><C>[<UP>L</UP>]<SUB><UP>gas</UP></SUB></C><C></C><C>[R  ·  L]<SUB><UP>gas</UP></SUB></C></R><R><C><UP> ↑</UP><SUP><UP>−&Dgr;G<SUB>solv</SUB></UP>(R)</SUP></C><C></C><C><UP> ↑</UP><SUP><UP>−&Dgr;G<SUB>solv</SUB></UP>(L)</SUP></C><C></C><C><UP> ↓</UP><SUB><UP>&Dgr;G<SUB>solv</SUB></UP>(R · L)</SUB></C></R><R><C>[<UP>R</UP>]<SUB><UP>sln</UP></SUB></C><C><UP>+</UP></C><C>[<UP>L</UP>]<SUB><UP>sln</UP></SUB></C><C> <LIM><OP><ARROW>→</ARROW></OP><LL>&Dgr;G° </LL></LIM></C><C>[<UP>R  ·  L</UP>]<SUB><UP>sln</UP></SUB></C></R></AR>
The Delta G0 was also evaluated using free receptor and free ligand structures that have been modeled from the respective complex by minimizing the receptor R and ligand L conformations using CHARMM with a distance-dependent dielectric constant and constraints on all heavy atoms (denoted by min + cons in Scheme 3). In this case, [r]sln = [Rmin+cons]sln and [l]sln = [Lmin+cons]sln. Their heavy-atom RMSDs from the respective crystal structures are similar to those of the rigid structures (see Table 2).
<AR><R><C>[<UP>R</UP><SUP><UP>min</UP></SUP>]<SUB><UP>gas</UP></SUB></C><C><UP>+</UP></C><C>[<IT>L</IT><SUP><UP>min</UP></SUP>]<SUB><UP>gas</UP></SUB></C><C> <LIM><OP><ARROW>→</ARROW></OP><UL>&Dgr;G<SUB><UP>gas </UP></SUB></UL></LIM> </C><C>[(<UP>R · L</UP>)<SUP><UP>min</UP></SUP>]<SUB><UP>gas</UP></SUB></C></R><R><C><UP> ↑</UP><SUP><UP>−&Dgr;G</UP><SUB><UP>isom</UP></SUB>(R<SUP><UP>min+cons</UP></SUP>)</SUP></C><C></C><C><UP> ↑</UP><SUP><UP>−&Dgr;G<SUB>isom</SUB></UP>(L<SUP><UP>min+cons</UP></SUP>)</SUP></C><C></C><C><UP> ↓</UP><SUB><UP>&Dgr;G</UP><SUB><UP>isom</UP></SUB>(R · L)</SUB></C></R><R><C>[<UP>R<SUP>min+cons</SUP></UP>]<SUB><UP>gas</UP></SUB></C><C><UP>+</UP></C><C>[<UP>L</UP><SUP><UP>min+cons</UP></SUP>]<SUB><UP>gas</UP></SUB></C><C></C><C>[R  ·  L]<SUB><UP>gas</UP></SUB></C></R><R><C><UP> ↑</UP><SUP><UP>−&Dgr;G<SUB>solv</SUB></UP>(R<SUP><UP>min+cons</UP></SUP>)</SUP></C><C></C><C><UP> ↑</UP><SUP><UP>−&Dgr;G<SUB>solv</SUB></UP>(L<SUP><UP>min+cons</UP></SUP>)</SUP></C><C></C><C><UP> ↓</UP><SUB><UP>&Dgr;G<SUB>solv</SUB></UP>(R · L)</SUB></C></R><R><C>[<UP>R<SUP>min+cons</SUP></UP>]<SUB><UP>sln</UP></SUB></C><C><UP>+</UP></C><C>[<UP>L<SUP>min+cons</SUP></UP>]<SUB><UP>sln</UP></SUB></C><C> <LIM><OP><ARROW>→</ARROW></OP><LL>&Dgr; G°</LL></LIM></C><C>[<UP>R  ·  L</UP>]<SUB><UP>sln</UP></SUB></C></R></AR>

Calculations

Solvation free energy Delta Gsolv

The standard solvation free energy, Delta Gsolv, can be expressed as a sum of 3 terms,
&Dgr;G<SUB><UP>solv</UP></SUB>=&Dgr;G<SUP><UP>cav</UP></SUP><SUB><UP>solv</UP></SUB>+&Dgr;G<SUP><UP>vdW</UP></SUP><SUB><UP>solv</UP></SUB>+&Dgr;G<SUP><UP>elec</UP></SUP><SUB><UP>solv</UP></SUB> (4)
The first two terms in Eq. 4 constitute the nonelectrostatic contribution (Delta G<UP><SUB>solv</SUB><SUP>nonel</SUP></UP>) to the solvation free energy due to the work required to create the solute cavity in solution (Delta G<UP><SUB>solv</SUB><SUP>cav</SUP></UP>) and to vdW interactions between the solute and solvent (Delta G<UP><SUB>solv</SUB><SUP>vdW</SUP></UP>). The last term in Eq. 4 gives the electrostatic contribution (Delta G<UP><SUB>solv</SUB><SUP>elec</SUP></UP>) to the solvation free energy.

The nonelectrostatic solvation free energy, Delta G<UP><SUB>solv</SUB><SUP>nonel</SUP></UP>, was approximated by a linear function of the SASA (Lee and Richards, 1971),
&Dgr;G<SUP><UP>nonel</UP></SUP><SUB><UP>solv</UP></SUB>=&Dgr;G<SUP><UP>vdW</UP></SUP><SUB><UP>solv</UP></SUB>+&Dgr;G<SUP><UP>cav</UP></SUP><SUB><UP>solv</UP></SUB> (5)

≈(&ggr;<SUP><UP>vdW</UP></SUP><UP>+&ggr;<SUP>cav</SUP></UP>)×<UP>SASA=&ggr; × SASA</UP>.
gamma  was set to 7.2 cal/mol/Å2. This value in Eq. 5 and the GB model (Eq. 6 below) could reproduce the experimental hydration free energies of a wide range of neutral small molecules (Still et al., 1990) and the experimental free energies of dihydrofolate reductase and trypsin binding (Zou et al., 1999). It is also in accord with that (6.4 ± 1.7 cal/mol/Å2) (Friedman and Honig, 1995) derived from the partition coefficients of linear alkanes between water and the gas phase using AMBER radii to compute the SASA. gamma vdW was set to -38.8 cal/mol/Å2 (Jayaram et al., 1999), the value obtained from experimental vaporization enthalpies of hydrocarbons (Ohtaki and Fukushima, 1992). The difference between gamma  and gamma vdW yields gamma cav = 46.0 cal/mol/Å2 (Sharp et al., 1991). The SASA of the molecule was computed using the GEPOL program (Pascual-Ahuir and Silla, 1990) and CHARMM vdW radii (MacKerell et al., 1998). Hence, Delta Delta G<UP><SUB>solv</SUB><SUP>nonel</SUP></UP> is proportional to the loss in SASA at the protein-protein interface.

The electrostatic solvation free energy, Delta G<UP><SUB>solv</SUB><SUP>elec</SUP></UP>, was computed using two continuum solvent approaches: the analytical GB model (Qiu et al., 1997; Still et al., 1990) and numerical PB methods (Honig and Nicholls, 1995). The former is denoted by Delta G<UP><SUB>solv,GB</SUB><SUP>elec</SUP></UP>, whereas the latter by Delta G<UP><SUB>solv,PB</SUB><SUP>elec</SUP></UP>. In the GB model (Still et al., 1990), Delta G<UP><SUB>solv,GB</SUB><SUP>elec</SUP></UP> is expressed as a sum of Coulombic interactions between each pair of charges (qi, qj) in a solvent of dielectric constant epsilon  and the Born self solvation energy of each individual charge,
&Dgr;G<SUP><UP>elec</UP></SUP><SUB><UP>solv,GB</UP></SUB>=−166<FENCE>1−<FR><NU>1</NU><DE>ϵ</DE></FR></FENCE><LIM><OP>∑</OP><LL><UP>i=1</UP></LL><UL><UP>n</UP></UL></LIM> <LIM><OP>∑</OP><LL><UP>j=1</UP></LL><UL><UP>n</UP></UL></LIM> <FR><NU>q<SUB><UP>i</UP></SUB>q<SUB><UP>j</UP></SUB></NU><DE>f<SUB><UP>GB</UP></SUB></DE></FR>,

f<SUB><UP>GB</UP></SUB>=<RAD><RCD>r<SUP>2</SUP><SUB><UP>ij</UP></SUB>+&agr;<SUB><UP>i</UP></SUB>&agr;<SUB><UP>j</UP></SUB><UP>exp</UP> <FENCE>−<FR><NU>r<SUP><UP>2</UP></SUP><SUB><UP>ij</UP></SUB></NU><DE>4&agr;<SUB><UP>i</UP></SUB>&agr;<SUB><UP>j</UP></SUB></DE></FR></FENCE></RCD></RAD>. (6)
fGB is an effective distance function that depends on the interatomic distances (rij) and the effective Born radii of atoms i and j (alpha ialpha j). In computing the electrostatic solvation free energies, epsilon  was set to 80, the dielectric constant for bulk water, and the atomic charges were taken from the CHARMM version 22 forcefield (MacKerell et al., 1998). The effective Born radii were computed using an empirical scheme first proposed by Still and co-workers (Qiu et al., 1997) and modified for protein solvation calculations by Dominy and Brooks (1999).

The effective Born radius of an atom depends on the atom's specific molecular environment (Babu and Lim, 1999, 2001). It is defined as the atomic radius that would give the Born solvation free energy (Born, 1920) of the atom with unit charge, if all other atoms in the molecule were uncharged and served only to displace the solvent dielectric medium; i.e.,
&agr;<SUB><UP>i</UP></SUB>=−<FR><NU>166</NU><DE>G<SUB><UP>pol,i</UP></SUB></DE></FR>, (7)
where
G<SUB><UP>pol,i</UP></SUB>=<FENCE>1−<FR><NU>1</NU><DE>ϵ</DE></FR></FENCE> <FENCE><FR><NU>1</NU><DE>&lgr;</DE></FR><FENCE>−<FR><NU>166</NU><DE>R<SUB><UP>vdW,i</UP></SUB></DE></FR></FENCE>+P<SUB>1</SUB><FENCE><FR><NU>166</NU><DE>R<SUP><UP>2</UP></SUP><SUB><UP>vdW,i</UP></SUB></DE></FR></FENCE></FENCE> (8)

<FENCE>+<LIM><OP>∑</OP><LL><UP>j</UP></LL><UL><UP>Bond</UP></UL></LIM> <FR><NU>P<SUB>2</SUB>V<SUB><UP>j</UP></SUB></NU><DE>r<SUP><UP>4</UP></SUP><SUB><UP>ij</UP></SUB></DE></FR>+<LIM><OP>∑</OP><LL><UP>j</UP></LL><UL><UP>Angle</UP></UL></LIM> <FR><NU>P<SUB>3</SUB>V<SUB><UP>j</UP></SUB></NU><DE>r<SUP>4</SUP><SUB><UP>ij</UP></SUB></DE></FR>+<LIM><OP>∑</OP><LL><UP>j</UP></LL><UL><UP>Nonbond</UP></UL></LIM> <FR><NU>P<SUB>4</SUB>V<SUB><UP>j</UP></SUB></NU><DE>r<SUP><UP>4</UP></SUP><SUB><UP>ij</UP></SUB></DE></FR> <UP>CCF</UP></FENCE>
and
<UP>CCF</UP>=1.0, <UP>if</UP> <FENCE><FR><NU>r<SUB><UP>ij</UP></SUB></NU><DE>(R<SUB><UP>vdW,i</UP></SUB>+R<SUB><UP>vdW,j</UP></SUB>)</DE></FR></FENCE><SUP>2</SUP>><FR><NU>1</NU><DE>P<SUB>5</SUB></DE></FR>, (9a)

<UP>CCF</UP>=<FENCE>0.5<FENCE>1.0−<UP>cos</UP><FENCE><FENCE><FR><NU>r<SUB><UP>ij</UP></SUB></NU><DE>(R<SUB><UP>vdW,i</UP></SUB>+R<SUB><UP>vdW,j</UP></SUB>)</DE></FR></FENCE><SUP>2</SUP> P<SUB>5</SUB>&pgr;</FENCE></FENCE></FENCE><SUP>2</SUP> (9b)

<UP>if</UP> <FENCE><FR><NU>r<SUB><UP>ij</UP></SUB></NU><DE>(R<SUB><UP>vdW,i</UP></SUB> + R<SUB><UP>vdW,j</UP></SUB>)</DE></FR></FENCE><SUP>2</SUP>≤ <FR><NU>1</NU><DE>P<SUB>5</SUB></DE></FR>
In Eq. 8, rij is the distance from the point charge of interest, i, to an uncharged atom j in the molecule; RvdW,i is the vdW radius of atom i, and Vj = <FR><NU>4</NU><DE>3</DE></FR>pi R<UP><SUB>vdW,j</SUB><SUP>3</SUP></UP> is the volume of atom j. The vdW radii were taken from the CHARMM version 22 all-hydrogen forcefield with the polar hydrogen radii set to 0.8 Å following previous works (Dominy and Brooks, 1999; Qiu et al., 1997). P1, P2, P3, P4, P5 and lambda  are fitting parameters. They have been optimized by Dominy and Brooks (1999) for a combined protein and nucleic acid structure database and the CHARMM version 22 forcefield. The parameters used in this work are P1 = 0.448, P2 = 0.173, P3 = 0.013, P4 = 9.015, P5 = 0.9, and lambda  = 0.705 (Dominy and Brooks, 1999). The reader is referred to the original works for the derivation of Eq. 8 (Qiu et al., 1997) and the fitting procedure to obtain the parameters (Dominy and Brooks, 1999).

The Delta G<UP><SUB>solv</SUB><SUP>elec</SUP></UP> energies have also been evaluated by solving numerically the linearized Poisson-Boltzmann equation using the DelPhi program (Gilson and Honig, 1988). The protein was treated as a low dielectric medium (epsilon int = 1) surrounded by a high dielectric solvent (epsilon out = 80 for water). The ionic strength was set to zero, but increasing it to 0.1 M had little effect on the magnitude of the solvation free energy, as found in previous works (Froloff et al., 1997; Jackson and Sternberg, 1995). The low-dielectric region of the protein was defined as the region inaccessible to contact by a 1.4-Å sphere rolling over the molecular surface, defined by the atomic coordinates and vdW radii. The vdW radii and atomic charges used in the DelPhi calculations were taken from the CHARMM version 22 forcefield. The electrostatic potentials were calculated on a cubic grid with a spacing of 0.5 Å and a 90% grid fill to avoid grid boundary artifacts. The difference between the electrostatic potential calculated in solution (epsilon out = 80) and in the gas phase (epsilon out = 1) yielded the electrostatic contribution to the solvation free energy.

Isomerization free energy Delta Gisom

The isomerization free energy for each molecule was estimated from the energy difference between the relaxed all-hydrogen structure and the respective fully minimized structure (see above); i.e.,
&Dgr;G<SUB><UP>isom</UP></SUB>≈&Dgr;E<SUB><UP>isom</UP></SUB>=<UP>E</UP>(X)−<UP>E</UP>(X<SUP><UP>min</UP></SUP>), (10)
where X = r, l, or R · L, and
E≈E<SUP><UP>vdW</UP></SUP>+E<SUP><UP>elec</UP></SUP> (11)
The energy E was calculated using the CHARMM program (Brooks et al., 1983) with the version 22 all-hydrogen forcefield (MacKerell et al., 1998) with epsilon  = 1 and an atom-based force-shifting function, which shifts the nonbonded forces to zero at 12 Å.

Gas-phase binding free energy, Delta Ggas

The standard gas-phase free energy change is given by
&Dgr;G<SUB><UP>gas</UP></SUB>=&Dgr;E<SUB><UP>gas</UP></SUB>+p&Dgr;V (12)

−T(&Dgr;S<SUP><UP>trans</UP></SUP>+&Dgr;S<SUP><UP>rot</UP></SUP>+&Dgr;S<SUP><UP>vib</UP></SUP>+&Dgr;S<SUP><UP>conf</UP></SUP>)
where the intramolecular energy change, Delta Egas is computed from Eqs. 11 and 13,
&Dgr;E<SUB><UP>gas</UP></SUB>=E((R · L)<SUP><UP>min</UP></SUP>)−E(r<SUP><UP>min</UP></SUP>)−E(l<SUP><UP>min</UP></SUP>) (13)
and pDelta V = -RT = -0.6 kcal.mol at 298 K.

The gas-phase translational and rotational entropies for the free proteins and their complex were calculated from ideal gas partition functions (QtransQrot) using classical statistical mechanics (McQuarrie, 1976).
TS<SUP><UP>trans</UP></SUP>=<FENCE><FR><NU>5</NU><DE>2</DE></FR>+<FR><NU>3</NU><DE>2</DE></FR> <UP>ln</UP><FENCE><FR><NU>2&pgr;mk<SUB><UP>B</UP></SUB>T</NU><DE>h<SUP>2</SUP></DE></FR></FENCE>−<UP>ln</UP>(&rgr;)</FENCE>k<SUB><UP>B</UP></SUB>T, (14)

TS<SUP><UP>rot</UP></SUP>=<FENCE><FR><NU>3</NU><DE>2</DE></FR>+<FR><NU>1</NU><DE>2</DE></FR> <UP>ln</UP>(&pgr;I<SUB><UP>a</UP></SUB>I<SUB><UP>b</UP></SUB>I<SUB><UP>c</UP></SUB>)+<FR><NU>3</NU><DE>2</DE></FR> <UP>ln</UP><FENCE><FR><NU>8&pgr;<SUP>2</SUP>k<SUB><UP>B</UP></SUB>T</NU><DE>h<SUP>2</SUP></DE></FR></FENCE>−<UP>ln</UP>(&sfgr;)</FENCE>k<SUB><UP>B</UP></SUB>T.
In Eq. 14, m is the total mass of the molecule, kB is Boltzmann's constant, T is the temperature (298 K), h is Planck's constant, rho  is the number density (1 M per liter), IA, IB, IC are the 3 principal moments of inertia, and sigma  is the symmetry number (which is equal to 1 for nonsymmetric molecules).

The gas-phase vibrational entropies for the free proteins and the respective complex were calculated from normal mode frequencies nu j based on the expression (McQuarrie, 1976):
TS<SUB><UP>vib</UP></SUB>=<LIM><OP>∑</OP><LL><UP>j</UP>=1</LL><UL>3<UP>N</UP>−6</UL></LIM><FR><NU>hv<SUB><UP>j</UP></SUB></NU><DE>e<SUP><UP>h&ngr;<SUB>j</SUB>/k<SUB>B</SUB>T</UP></SUP><UP>−1</UP></DE></FR>−k<SUB><UP>B</UP></SUB>T <UP>ln</UP>(1−e<SUP>−<UP>h&ngr;<SUB>j</SUB>/k<SUB>B</SUB>T</UP></SUP>). (15)
The normal mode frequencies nu j were computed using the CHARMM program (Brooks et al., 1983) with the version 22 all-hydrogen forcefield (MacKerell et al., 1998) using an iterative diagonalization scheme (Janezic and Brooks, 1995), a dielectric constant of one, and a cutoff of 12 Å to truncate the nonbonded forces. For each structure, there were six zero-frequency modes corresponding to overall translational and rotational motions and no imaginary frequencies.

The conformational entropy change, Delta Sconf, was approximated by the loss of side-chain conformational entropy upon binding, which, in turn, was estimated using the empirical scale of Pickett and Sternberg (1993, 1994). This model assumes that solvent-accessible side chains (with relative SASA > 60%) populate different rotamers, whereas buried side chains (with relative SASA <=  60%) are restricted to one rotamer. The conformation entropy of a given side chain r(Sconf,r) is given by,
S<SUP><UP>conf,r</UP></SUP>=−R<LIM><OP>∑</OP><LL><UP>i=1</UP></LL><UL><UP>N</UP></UL></LIM> p<SUB><UP>i</UP></SUB><UP> ln</UP>(p<SUB><UP>i</UP></SUB>), (16)
where pi is the probability of the side chain in rotameric state i. The values of pi were obtained from the observed distribution of side-chain rotamers in 50 nonhomologous protein crystal structures, taking into account the effects of symmetry and free rotation (Pickett and Sternberg, 1993).


    RESULTS
TOP
ABSTRACT
INTRODUCTION
METHODS
RESULTS
DISCUSSION
CONCLUSIONS
REFERENCES

In computing Delta G0 according to Schemes 1-3, two assumptions were made: the free energy components in Eq. 1 were assumed to be additive; and they were based on a single time-averaged x-ray structure rather than ensemble averaged. Hence, the present scheme is best suited for the noncovalent binding of two proteins that undergo only slight conformational changes to form a high affinity complex, as in the case here (see Table 1). However, for the binding of flexible peptides to proteins, free energy estimates based on a single conformation of the unbound and bound species may not be appropriate.

Tables 3 and 4 summarize the results for D1.3·Hel and trypsin·BPTI binding, respectively. In each case, the binding free energy was computed using three different sets of structures for the free proteins (see Methods): the relaxed all-hydrogen x-ray structures (Scheme 1); rigid [r]sln = [R]sln and [l]sln = [L]sln structures (Scheme 2); and [r]sln = [Rmin+cons]sln and [l]sln = [Lmin+cons]sln (Scheme 3). In what follows, we will first compare the free energies computed using the relaxed all-hydrogen x-ray structures with experiment. Agreement with experiment then allows us to assess the relative contributions from protein-protein versus protein-solvent versus solvent-solvent interactions and electrostatic versus vdW forces to the binding free energy. We then evaluate the accuracy of binding free energies computed using rigid as well as [Rmin+cons]sln and [Lmin+cons]sln free protein structures.


                              
View this table:
[in this window]
[in a new window]
 
TABLE 3   Free energy and its components for the binding of the Fv region of mouse monoclonal antibody D1.3 to hen egg-white lysozyme (Hel)*


                              
View this table:
[in this window]
[in a new window]
 
TABLE 4   Free energy and its components for the binding of trypsin to BPTI*

Comparison between computed and experimental Delta Gexp: GB versus PB

Tables 3 and 4 show that the Delta G<UP><SUB>GB</SUB><SUP>0</SUP></UP> (-11.4 kcal/mol for D1.3·Hel and -18.6 kcal/mol for trypsin·BPTI) based on [r]sln, [l]sln, and [R · L]sln relaxed x-ray structures and Delta Delta G<UP><SUB>solv,GB</SUB><SUP>elec</SUP></UP> are in excellent agreement with the experimental values (-11.4 and -18.1 kcal/mol). In contrast, the Delta G<UP><SUB>PB</SUB><SUP>0</SUP></UP> based on [r]sln, [l]sln, and [R · L]sln relaxed x-ray structures and Delta Delta G<UP><SUB>solv,PB</SUB><SUP>elec</SUP></UP> exhibit larger deviations from the experimental numbers than Delta G<UP><SUB>GB</SUB><SUP>0</SUP></UP>, whereas the Delta G<UP><SUB>GB</SUB><SUP>0</SUP></UP> and Delta G<UP><SUB>PB</SUB><SUP>0</SUP></UP> differ by roughly 20%. The remarkable agreement between Delta G<UP><SUB>GB</SUB><SUP>0</SUP></UP> and experiment indicates that systematic errors involved in computing free energies/energies in the right and left legs of Scheme 1 have largely cancelled. Because the Delta G<UP><SUB>solv,GB</SUB><SUP>elec</SUP></UP> based on x-ray structures are more negative than the respective Delta G<UP><SUB>solv,PB</SUB><SUP>elec</SUP></UP> by <= 7.6%, but the CPU time needed to compute Delta G<UP><SUB>solv,GB</SUB><SUP>elec</SUP></UP> is 5 to 6 times less than that for Delta G<UP><SUB>solv,PB</SUB><SUP>elec</SUP></UP>, the GB formulation together with Scheme 1 appears to be an efficient and reliable way of computing binding free energies and obtaining trends (see below).

Solvation versus gas-phase contributions to binding affinity

Scheme 1 enables the net binding free energy to be dissected into solvation (protein-solvent and solvent-solvent) versus gas-phase (protein-protein) contributions. The latter, which is a sum of Delta Ggas and Delta Delta Gisom terms, is negative, thus favoring complexation, whereas Delta Delta Gsolv is positive and opposes binding (Tables 3 and 4). For both systems, gas-phase vdW and electrostatic protein-protein interactions favor binding (Delta E<UP><SUB>gas</SUB><SUP>vdW</SUP></UP> and Delta E<UP><SUB>gas</SUB><SUP>elec</SUP></UP> negative). Note that Delta E<UP><SUB>gas</SUB><SUP>elec</SUP></UP> is negative even though the receptor and ligand have net positive charges (Table 1). The solvent-solvent cavitation term (Eq. 5) also favors protein-protein complexation (Delta Delta G<UP><SUB>solv</SUB><SUP>cav</SUP></UP> negative) as less work is required to create the complex cavity than the free protein cavities in solution. In contrast to the favorable protein-protein and solvent-solvent interactions, vdW and electrostatic protein-solvent interactions generally oppose binding (Delta Delta G<UP><SUB>solv</SUB><SUP>vdW</SUP></UP> and Delta Delta G<UP><SUB>solv</SUB><SUP>elec</SUP></UP> positive) due to the cost of desolvating the unbound proteins.

Decomposition of Delta G0 into component energies

Tables 3 and 4 show that, for both systems, Delta Delta G<UP><SUB>solv</SUB><SUP>nonel</SUP></UP> (-10.4 kcal/mol for D1.3·Hel and -12.0 kcal/mol for trypsin·BPTI) roughly cancels the gas-phase conformational entropy term, -TDelta Sconf (9.3 kcal/mol for D1.3·Hel and 11.9 kcal/mol for trypsin·BPTI). This finding can be rationalized if Delta Delta G<UP><SUB>solv</SUB><SUP>nonel</SUP></UP> approx  -TDelta Delta Ssolvent, based on the fact that Delta Delta G<UP><SUB>solv</SUB><SUP>nonel</SUP></UP> is related to the hydrophobic effect, which, at room temperature, is dominated by the solvent entropy term. The observed cancellation can then be attributed to the increase in the solvent entropy upon complexation (as protein side chains at the interface release water molecules), and the concomitant decrease in the solute entropy (as protein side chains at the interface lose torsional degrees of freedom upon interacting with other residues) (Novotny et al., 1989). To verify whether the observed cancellation of Delta Delta G<UP><SUB>solv</SUB><SUP>nonel</SUP></UP> and TDelta Sconf is general, these two quantities were also computed for other systems for which x-ray structures are available for both the free and bound proteins (although the structures for these systems may not be as accurate because they do not satisfy all the criteria specified in the Introduction, Delta Delta G<UP><SUB>solv</SUB><SUP>nonel</SUP></UP> and TDelta Sconf are not very sensitive to the structures because they depend on the SASA, see Tables 3 and 4). The results in Table 5 show that Delta Delta G<UP><SUB>solv</SUB><SUP>nonel</SUP></UP> and -TDelta Sconf oppose each other and their difference is less than 1.5 kcal/mol, indicating that Delta Delta G<UP><SUB>solv</SUB><SUP>nonel</SUP></UP> and -TDelta Sconf generally offset one another. It should be emphasized that the observed Delta Delta G<UP><SUB>solv</SUB><SUP>nonel</SUP></UP> and TDelta Sconf compensation rests on the magnitude of the coefficient gamma , 7.2 cal/mol/Å2, used in this work (see below).


                              
View this table:
[in this window]
[in a new window]
 
TABLE 5   Changes in the side-chain conformational entropy (TDelta Sconf) and nonelectrostatic solvation free energy (Delta Delta G<UP><SUB>solv</SUB><SUP>nonel</SUP></UP>) in various protein-protein associations

Because the Delta Delta G<UP><SUB>solv</SUB><SUP>nonel</SUP></UP> and -TDelta Sconf terms offset one another in Tables 3 and 4, the remaining components of the binding affinity can be partitioned into net gas-phase protein-protein vdW effects (Delta Evdw), net protein-protein and protein-solvent electrostatic effects (Delta Gelec), and the sum of translational, rotational, and vibrational entropy changes (-TDelta Strans+rot+vib). The magnitudes of Delta EvdW, Delta Gelec, and TDelta Strans+rot+vib in Tables 3 and 4 are sizable, indicating that all three components contribute to the overall binding affinity, so that
&Dgr;G°∼&Dgr;E<SUP><UP>vdw</UP></SUP>+&Dgr;G<SUP><UP>elec</UP></SUP>−T&Dgr;S<SUP><UP>trans+rot+vib</UP></SUP>. (17)
The Delta G<UP><SUB>GB</SUB><SUP>0</SUP></UP> computed using Eq. 17 are -9.7 kcal/mol for D1.3·Hel and -17.9 kcal/mol for trypsin·BPTI, which underestimate the experimental numbers by 1.7 and 0.2 kcal/mol, respectively. Of the three terms in Eq. 17, only the gas-phase protein-protein vdW interactions favor binding (Delta Evdw negative). Furthermore, the magnitude of Delta Evdw is larger than that of Delta Gelec or TDelta Strans+rot+vib for both D1.3·Hel and trypsin·BPTI complexation. These findings are consistent with the experimental observation that enthalpy drives D1.3·Hel binding from measurements of enthalpy and entropy changes by titration calorimetry (Bhat et al., 1994). Hence, close packing and shape complementarity play important roles in noncovalent protein association. The net electrostatic interactions generally oppose formation of complexes with an interface area between 1400 and 1700 Å2 (Delta Gelec positive). This finding does not mean that electrostatic interactions are unimportant for protein binding, but shows that the gain in hydrogen bonding and charge-charge protein-protein interactions across the interface is offset by the loss of favorable electrostatic protein-solvent interactions. Although translational and rotational entropy loss inevitably oppose complexation (Delta Strans+rot negative), vibrational entropy can favor or disfavor protein-protein association: Delta Svib is negative for D1.3·Hel complexation, but is positive for trypsin-BPTI binding (see also Discussion).

Because the magnitude of the TDelta Strans+rot term is similar for two ligands of roughly equal volumes, l and l', binding to a common receptor r, their binding free energy difference will be given by
&Dgr;&Dgr;G°∼&Dgr;E<SUP><UP>vdw</UP></SUP><UP>−&Dgr;</UP>E′<SUP><UP>vdw</UP></SUP>+&Dgr;G<SUP><UP>elec</UP></SUP> (18)

<UP>−&Dgr;</UP>G<UP>′</UP><SUP><UP>elec</UP></SUP>+T&Dgr;S<SUP><UP>vib</UP></SUP>−T&Dgr;S′<SUP><UP>vib</UP></SUP>,
where the prime denotes the thermodynamic change for binding of l'. Hence, the discrimination between two similarly shaped ligands, which defines specificity, is governed by close packing of the protein surfaces so that the proper hydrogen bonds can be formed (McCammon, 1998). Eq 18 shows that the affinity of a drug ligand l' for its receptor protein with an interface area between 1400 and 1700 Å2 can be improved by mutations that increase the magnitude of Delta E'vdw, but decrease the magnitude of Delta G'elec.

Evaluation of the rigid-binding approximation

The RMSDs of the heavy atoms in the free x-ray structure from the corresponding complex crystal structure is <1.3 Å for both systems studied (Table 1). Therefore, the structural changes upon complexation are relatively small and it seems reasonable, to a first approximation, to neglect any conformational changes upon protein-protein association; i.e., to assume [r]sln = [R]sln and [l]sln = [L]sln. In fact, previous studies used this rigid-binding approximation to compute the free energy for D1.3·Hel and trypsin·BPTI binding (Tables 3 and 4). The binding free energy computed using Scheme 2 does not agree with the experimental value (Tables 3 and 4, column 3), and is even quite positive for the binding of D1.3 antibody to Hel. The discrepancy between theory and experiment arises from both protein-protein as well as protein-solvent electrostatic interactions. The rigid free receptor and free ligand conformations isomerize to local minima, which have less favorable electrostatic interactions than the local minima derived from the relaxed x-ray structures of the free proteins. Consequently, the Delta E<UP><SUB>gas</SUB><SUP>elec</SUP></UP> values computed using the rigid free receptor and free ligand structures (-242 and -315 kcal/mol) are more negative than the respective numbers in the second column of Tables 3 and 4 (-106 and -228 kcal/mol). In contrast, the Delta Delta G<UP><SUB>solv</SUB><SUP>elec</SUP></UP> values computed using the rigid free structures (119 and 198 kcal/mol) are more positive than the respective numbers in column 2 of Tables 3 and 4 (98 and 85 kcal/mol).

In the case of D1.3 antibody binding to Hel, the relative component contributions to the binding free energy computed using Scheme 2 do not agree with those obtained using Scheme 1. In particular, Scheme 2 predicts that the dominant contribution to the net binding free energy for D1.3·Hel complexation is not Delta EvdW, but the entropy term, TDelta Strans+rot+vib, in contrast to Scheme 1 and experimental observations (see above). These results show that, even if the structural changes upon complexation are known to be small, the rigid-binding approximation (Scheme 2) will generally not yield accurate binding free energy, and may also not reveal the dominant contribution to the binding free energy.

Evaluation of free Rmin+cons receptor and free Lmin+cons ligand structures

In cases where the free receptor or free ligand structures have not been experimentally solved but their complex crystal structure is known, the former can be predicted from the latter by minimizing the receptor and ligand conformations in the complex (see Structures section above) provided that binding results in minimal conformational changes. Like the rigid structures, the [r]sln = [Rmin+cons]sln and [l]sln = [Lmin+cons]sln structures predict a positive binding free energy for D1.3·Hel complexation, whereas they severely overestimate the magnitude of the free energy for trypsin·BPTI binding. However, they yield trends in the relative component contributions to the binding free energy that are similar to those based on the relaxed x-ray structures (Tables 3 and 4). The magnitude of the component contributions based on the [Rmin+cons]sln and [Lmin+cons]sln structures are in-between the respective numbers based on the x-ray and rigid structures (Tables 3 and 4). This suggests that approximating [r]sln and [l]sln by [Rmin+cons]sln and [Lmin+cons]sln, respectively, in cases where slight conformational rearrangements accompany complexation, as in the cases studied here, can reveal qualitative features in the binding free energies.

Dependence of component free energies on free and complex structures

The results using Schemes 1-3 show that accurate structures are needed to obtain reliable absolute binding free energies (Sharp, 1998). This sensitivity of the atomic coordinates is expected because electrostatic and vdW interactions depend on the interatomic distances. Hence, the Delta Evdw and Delta Gelec terms computed using different free structures vary significantly (see Tables 3 and 4). The vibrational entropy term is also sensitive to the local minimum structure used to compute the frequencies (see Table 6). In contrast, the Delta Delta G<UP><SUB>solv</SUB><SUP>nonel</SUP></UP> and TDelta Sconf terms computed using different free structures do not vary as much as they depend on the solvent-accessible solvent surface area. These findings are in accord with previous works (Froloff et al., 1997; Jackson and Sternberg, 1995; Novotny et al., 1997).


                              
View this table:
[in this window]
[in a new window]
 
TABLE 6   Lowest four nonzero vibrational frequencies (cm-1) and vibrational entropies (TS in kcal/mol) from normal mode calculations for the fully minimized structures*


    DISCUSSION
TOP
ABSTRACT
INTRODUCTION
METHODS
RESULTS
DISCUSSION
CONCLUSIONS
REFERENCES

Below, we compare our above findings with results obtained in previous studies for the same or homologous systems and for other protein-protein complexes.

Comparison of the various free energy decomposition schemes

The scheme used here to compute the free energy for protein-protein complexation is closest in spirit to that used by Jayaram et al. (1999) for protein-DNA complexation. The key difference lies in the isomerization step in Scheme 1, which was introduced to obtain local-minimum structures (with no imaginary frequencies) for normal mode frequency calculations. In contrast, Jayaram et al. computed first the isomerization/adaptation of the protein structure in the free state to the conformation in the complex state (the adaptation energy), and, subsequently, their desolvation (see Scheme 4). As a result, the gas-phase complexation step corresponds to species that are not in energy minima, thus their normal mode frequencies cannot be computed.