help button home button Biophys. J.
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS

This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Perera, L.
Right arrow Articles by Pedersen, L. G.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Perera, L.
Right arrow Articles by Pedersen, L. G.

Biophys J, December 2000, p. 2925-2943, Vol. 79, No. 6

Modeling Zymogen Protein C

Lalith Perera,* Charles Foley,* Thomas A. Darden,dagger Darrel Stafford,Dagger Timothy Mather,§ Charles T. Esmon,§ and Lee G. Pedersen*dagger

 *Department of Chemistry, University of North Carolina, Chapel Hill, North Carolina 27599-3290;  dagger National Institute of Environment Health Science, Research Triangle Park, North Carolina 27709;  Dagger Department of Biology, University of North Carolina, Chapel Hill, North Carolina 27599-3290;  §Howard Hughes Medical Institute, Oklahoma Medical Research Foundation, 825 NE13, Oklahoma City, Oklahoma 73104 USA




    ABSTRACT
TOP
ABSTRACT
INTRODUCTION
COMPUTATIONAL PROCEDURE
RESULTS AND DISCUSSION
SUMMARY
REFERENCES

A solution structure for the complete zymogen form of human coagulation protein C is modeled. The initial core structure is based on the x-ray crystallographic structure of the gamma -carboxyglutamic acid (Gla)-domainless activated form. The Gla domain (residues 1-48) is modeled from the x-ray crystal coordinates of the factor VIIa/tissue factor complex and oriented with the epidermal growth factor-1 domain to yield an initial orientation consistent with the x-ray crystal structure of porcine factor IXa. The missing C-terminal residues in the light chain (residues 147-157) and the activation peptide residues 158-169 were introduced using homology modeling so that the activation peptide residues directly interact with the residues in the calcium binding loop. Molecular dynamics simulations (Amber-particle-mesh-Ewald) are used to obtain the complete calcium-complexed solution structure. The individual domain structures of protein C in solution are largely unaffected by solvation, whereas the Gla-epidermal growth factor-1 orientation evolves to a form different from both factors VIIa and IXa. The solution structure of the zymogen protein C is compared with the crystal structures of the existing zymogen serine proteases: chymotrypsinogen, proproteinase, and prethrombin-2. Calculated electrostatic potential surfaces support the involvement of the serine protease calcium ion binding loop in providing a suitable electrostatic environment around the scissile bond for IIa/thrombomodulin interaction.



    INTRODUCTION
TOP
ABSTRACT
INTRODUCTION
COMPUTATIONAL PROCEDURE
RESULTS AND DISCUSSION
SUMMARY
REFERENCES

Protein C (PC), a vitamin K-dependent (VKD) plasma serine protease zymogen (Mammen et al., 1960; Stenflo, 1976), is synthesized mainly in the liver and in endothelial cells (Tanabe et al., 1991) as a single chain polypeptide. The majority of the protein is converted into a two-chain disulfide-linked zymogen during secretion from the cell (Foster et al., 1990; Grinnell et al., 1991). The structure of PC is highly homologous to other VKD blood coagulation factors VII, IX, and X. However, once activated by thrombin bound to thrombomodulin on the endothelial cell surface, it functions as a natural anticoagulant (for reviews on PC and its activation, see Esmon and Esmon, 1984; Esmon, 1989, 1995a,b; Tuddenham and Cooper, 1994; Suzuki, 1995; Davie, 1995). Genetic defects in the PC pathway are associated with an increased risk for venous thrombosis (Reitsma, 1997). Deficiency of PC is one of the most common causes of an inherited autosomal dominant thrombophilia.

The zymogen (see Fig. 1) contains a pre-pro leader peptide (absent in the mature protein), a gamma -carboxyglutamic acid (Gla) domain of 9 Gla residues, a short helical hydrophobic amino acid stack, two epidermal growth factor (EGF)-like domains, a linking peptide between the light chain and the heavy chain, an activation peptide, and a trypsin-like SP domain in which the catalytic triad is located at His-211(cn57), Asp-257(cn102), and Ser-360(cn195) (Fernlund and Stenflo, 1982; Stenflo and Fernlund, 1982; Foster and Davie, 1984; Foster et al., 1985, 1987). Post-translational modification removes the dipeptide Lys-156-Arg-157, so that the single chain form is converted into a two-chain molecule linked by a disulfide bond; 80% of the zymogen PC circulating in plasma is in this form (Foster et al., 1990; Grinnell et al., 1991). The activation, fluorescence properties and amidolytic activity of the single- and two-chain protein are identical (Rezaie and Esmon, 1995). Also, carboxylation of Glu residues in the amino terminal Gla domain (Stenflo, 1976; Kisiel et al., 1976), hydroxylation of an Asp residue in the EGF1 domain (Drakenberg et al., 1983; Fernlund and Stenflo, 1983) and glycosylation (McClure et al., 1992; Grinnell et al., 1991) are post-translational events.




View larger version (32K):
[in this window]
[in a new window]
 
FIGURE 1   Primary amino acid sequence of human PC deduced from Foster et al. (1985). The domains are labeled according the definitions used in the text. The catalytic residues are also (hatched) circled. Bold-faced letters indicate the residues whose positions were initially modeled. gamma , gamma -carboxyglutamic acid residues; beta , beta -hydroxyaspartic acid.

Activation occurs when a negatively charged peptide (Lys-158-Arg-169) is released by the cleavage of the peptide bond between Arg-169 and Leu-170. Activated protein C (APC), in the presence of cofactor protein S (PS), proteolytically inactivates both factors Va and VIIIa on platelets and endothelial cells (Fay et al., 1991; Kalafatis et al., 1994). Recently, it has been shown that in addition to the function of factor V as a procoagulant after its activation by thrombin, factor V plays an important role in the anticoagulant system as a phospholipid (PL)-bound cofactor to APC (Dahlback and Hildebrand, 1994). It is also reported that factor V acts synergistically with PS as PL-bound cofactors in the presence of calcium ions (Dahlback and Hildebrand, 1994; Shen and Dahlback, 1994; Shen et al., 1997a; Dahlback, 1997). In a study using fluorescence resonance energy transfer, Yegneswaran et al. (1997) concluded that the function of PS as a cofactor for APC is to alter the active site location of APC above the membrane surface. Yegneswaran et al. (1997) also showed that the chimera protein obtained by replacing the Gla domain of APC by that of prothrombin alters the active site location above the membrane surface. This chimera protein alone shows the same inactivation rate of factor Va as the APCwild-type/PS complex. Factor Xa also protects factor Va from inactivation by APC (Nesheim et al., 1982; Jane et al., 1991). Both active and precursor forms of factor X bind to factor Va with equal efficacy and both compete with APC for its factor Va binding site (Jane et al., 1991). In addition to its anticoagulant activity, APC has also been shown to have profibrinolytic (Zolton and Seegers, 1973; Comp and Esmon, 1981; De Fouw et al., 1988), anti-ischemic, and anti-inflammatory (Esmon et al., 1991) activities.

The solution of the x-ray crystal structure of a significant portion of APC (Mather et al., 1996) provides an opportunity to complete the zymogen structure by theoretical techniques. Major advances of past decade in the quality of force fields (Cornell et al., 1995; Cheatham et al., 1995), molecular dynamics methodology (Essmann et al., 1995), and parallel computers make such a substantial undertaking possible. The hypothesis is that molecular dynamics (MD) simulations, if firmly based on structure, are now reasonably accurate for time scales of nanoseconds.

We have developed herein a solvent-equilibrated model for the complete zymogen PC in the presence of bound calcium ions. The initial structure is based on the 2.8 Å resolution x-ray crystal structure available for the Gla-domainless human APC (Mather et al., 1996). Homology modeling, described below in computational procedure, is used to introduce the Gla domain, the following aromatic amino acid stack, and the activation peptide. Molecular dynamics (MD) simulations are used to obtain an accurate solution structure of calcium-bound human PC. In the present model, the carbohydrate chains are not included at the four Asn-linked glycosylation sites that are post-translationally glycosylated in the circulating zymogen. Also, the post-translationally removed Lys-156-Arg-157 dipeptide in PC is present in the model, although this segment does not appear to play a role in activation (Rezaie and Esmon, 1995). Comparison of the modeled zymogen structure with the x-ray crystal structure of APC may clarify functional activities of the enzyme form. Consequently, the structural changes between the zymogen (solution model) and the enzyme (x-ray) are compared with three other zymogen/active SP systems for which x-ray crystal data are available for both zymogen and active forms. Possible functional roles of Gla and EGF1 domains and the high affinity calcium ions bound to EGF1 and SP domains are also discussed. Whereas earlier modeling work focused on the activated thrombin (IIa)-thrombomodulin (TM)-PC complex in the absence of solvent or calcium ions (Knobe et al., 1999) and on a static model that included the activation peptide and SP domain only (Fisher et al., 1994), our objective has been to obtain an accurate description for the solution structure, including state of the art dynamics, of zymogen PC in its complete calcium-bound, fully hydrated configuration. The coordinates for the model are available from the authors on request.



    COMPUTATIONAL PROCEDURE
TOP
ABSTRACT
INTRODUCTION
COMPUTATIONAL PROCEDURE
RESULTS AND DISCUSSION
SUMMARY
REFERENCES

Model construction

The x-ray crystal coordinates of human APC (Protein Data Bank (pdb) entry, 1aut), along with the crystallographic water molecules, were used to model the initial structure of the zymogen form of human PC. The Gla domain and the following aromatic helical stack were absent in the x-ray crystal coordinates, as was the activation peptide. A flow chart of the model building procedure, described below in detail, is given in Fig. 2. The structure of the Gla domain of PC and the following aromatic helical stack, was constructed by replacing the necessary residues of the x-ray crystal structure of factor VIIa (pdb entry 1dat) using SYBYL 6.4 (Tripos, Inc., St. Louis, MO). The x-ray crystal structure of factor VIIa was determined in the presence of tissue factor (TF) and the Gla domain does, in fact, interact directly with TF, potentially causing a reorientation in the Gla-EGF1 orientation. Thus, the Gla-EGF1 orientation found in the factor VIIa/TF complex may be unique for that particular system and may not necessarily reflect the orientation of a general Gla-EGF interaction. The crystal structure is available for another VKD protein, porcine factor IXa, but the electron density beyond the helical stack toward the N-terminus of the Gla domain was not sufficient to resolve the Gla domain structure (Brandstetter et al., 1995). In our current model, we construct the Gla segment based on the x-ray crystal structure of the factor VIIa Gla domain, but link it to the x-ray crystal structure of Gla-domainless PC so that the Gla-EGF1 orientation is similar to that of the porcine factor IXa (pdb entry 1pfx). PC residues Gln-49,Cys-50,Cys-69-Arg-91 and factor IXa residues Gln-50, Cys-51, and Cys-62-Leu-84 (including beta -hydroxy aspartate (Hya)-71) were used in the backbone atom alignment. Also, residues in the helical stack at the C-terminal end of the Gla domain served to position the Gla domain to the Gla domainless PC. Because the Gla domain of the x-ray crystal structure of factor VIIa was mutated to create the Gla domain of PC, the relative calcium positions were preserved. Using the calcium ion coordinated to the EGF1 domain of factor VIIa as a template, a calcium ion was introduced to the Gla-EGF1 connecting region of PC. An additional calcium ion was introduced to the catalytic region (Mather et al., 1996) at a site similar to the calcium binding site in trypsin.




View larger version (45K):
[in this window]
[in a new window]
 
FIGURE 2   A flow chart summarizing the model construction of zymogen PC from the x-ray crystal structure (Mather et al., 1996) of calcium-free Gla-domainless APC.

During activation, a 12-residue polypeptide is cleaved from the zymogen form of PC. In addition to the cleaved segment, modeling to determine the coordinates for the final 9 residues from the C-terminus of the light chain of APC (absent in the x-ray coordinate file due to disordered electron density) was necessary. The complete activation loop was added in two steps. First, it was necessary to redefine the N-terminus of the SP domain, since this fragment is inserted into the body of the protein in the structure of APC (x-ray). The N termini of available SP x-ray crystal structures in their activated forms are remarkably similar. Fortunately, x-ray crystal structures exist for the zymogen forms of several SPs; chymotrypsinogen (Wang et al., 1985), prethrombin-2 (Vijayalakshmi et al., 1994), and proproteinase E (Gomis-Ruth et al., 1995). In these zymogen structures, the backbone atoms of key residues starting at residue 20 (chymotrypsin numbering (cn)) are remarkably similar when superimposed. In Fig. 3, the N-terminal segment of the SP domains of PC, chymotrypsin (pdb entry 4cha), alpha -thrombin complexed with hirugen (pdb entry 1hah), porcine pancreatic elastase (pdb entry 1btu) are compared. Backbone atoms of the residues 20-25 (cn numbering) are used for the alignment. Also shown are the zymogens; chymotrypsinogen (pdb entry 2cga), prethrombin-2 complexed with hirugen (pdb entry 1hag), and proproteinase E in a ternary complex with procarboxypeptidase A and chymotrypsinogen (pdb entry 1pyt). The N-termini of the active forms of these SPs align well. The residues at position 16 in chymotrypsinogen and prethrombin-2 are found to have a similar orientation. In proproteinase E, however, residue 16 is orientationally rotated ~120°. Finally the residue at position 16 in all of the activated SP systems is found at a relative orientation of 120° to both chymotrypsinogen/prethrombin-2 and proproteinase. Thus, we were able to reconstruct the N-terminus region of the catalytic domain of the zymogen PC by overlaying it on the N-terminus of any of these three zymogen systems. We chose to reconstruct the N-terminus region of PC by using the corresponding residues of proproteinase E as a template. This choice generally orients the initial model for the activation peptide toward the putative calcium ion binding site in the catalytic domain (see later discussion). The loop search algorithm in SYBYL6.4 Biopolymer package (Tripos) was then used to introduce the AP between the C-terminus of the light chain and the N-terminus of the catalytic domain. Twenty-five different loops were generated, of which 10 appeared reasonable. Of these, a conformation that presented the pre-PC linking dipeptide (Lys-156-Arg-157) as well as the scissile activation bond well exposed was chosen. This conformation accommodates interaction of the AP with residues that directly coordinate the SP domain bound calcium ion. All necessary hydrogen atoms were added using the Protonate module of AMBER 5.0 (Case et al., 1997).




View larger version (33K):
[in this window]
[in a new window]
 
FIGURE 3   The Ca alignment of the N-terminal residues of activated serine proteases: chymotrypsin, alpha -thrombin, bovine pancreatic elastase, and PC. All the N-termini are well superimposed for the above active serine proteases; N labels the N-terminal Calpha of the active forms. The Calpha values of zymogens (prethombin-2, chymotrypsinogen, and proproteinase E) are also shown here. Alignment uses residues 20-27(cn) for all proteins. In chymotrypsinogen and prethrombin-2, the residues corresponding to the AP of PC are found to the left in the figure; the corresponding residues of proproteinase E exhibit a rotation of 120° to the right.

Simulation protocol

We initially energy minimized the structure around the PC/factor IX Gla domain splice point (residues 34-46) and the structure surrounding the calcium ion bound to the EGF1 domain, since part of the helical stack (residues Ser-42, Asp-46, and Asp-48) contacts this calcium ion. Since the remainder of the protein structure was based on the energy minimized x-ray crystal structure, no additional segment-wise energy minimization runs were necessary for backbone refinements. However, energy minimization of the side chains was employed (500 steepest descent steps followed by 10,000 conjugate gradient steps). The protein (along with the crystallographic water molecules and the calcium ions) was then solvated in a box of water molecules so that the box boundaries were at least 12.5 Å away from any protein atom. Water molecules used in the solvation of the protein for which the oxygen or hydrogen atoms were within 2.0 Å of any atom in the protein were excluded. The central simulation box contained 409 residues of the zymogen PC, 25,010 water molecules, and 9 calcium ions. In addition to the calcium ions, six sodium ions were introduced as free non-protein-bound counterions. These counterions did not coordinate with any of the protein residues at any time during the MD simulation. The simulation system was thus electrically neutral. The total number of atoms in this box was 81,399. In the first step, only the added water molecules and counterions were energy minimized at constant volume (10,000 conjugate gradient steps), then all atoms except the protein were subjected to energy minimization (another 10,000 conjugate gradient steps) and finally, the whole system was energy minimized (10,000 conjugate gradient steps). All atoms except those of the protein were subsequently subjected to a slow heating procedure to bring the temperature of the system to 300 K. In this heating procedure, the temperature of the system was raised by 50° at a time in seven intervals of 1.5 ps each. After 25 ps of a constant volume-constant temperature MD simulation, the system was reminimized (5000 conjugate gradient steps). After another heating run of 10 ps to bring the temperature back to 300 K, a constant-temperature-constant volume MD run was performed for 25 ps. Finally, a constant pressure-constant temperature protocol was adopted to simulate 2900 ps of MD.

In the present study, we used the second generation of AMBER force field (Cornell et al., 1995) in conjunction with the particle mesh Ewald method (Essmann et al., 1995) to accommodate long range interactions in all of the solution simulations. The TIP3P model was used to represent the water molecules. The MD trajectory calculations were done with AMBER version 5 (Case et al., 1997). Non-bonded cutoff for the direct sum was chosen to be 8.0 Å. The time step was 1 fs with the nonbonded interactions updated at every step. This choice of updating nonbonded interactions and the periodical removal of the translational and rotational motions of the center of mass (at every 2.5 ps) prevents artifacts such as the "flying ice cube" (Harvey et al., 1998).



    RESULTS AND DISCUSSION
TOP
ABSTRACT
INTRODUCTION
COMPUTATIONAL PROCEDURE
RESULTS AND DISCUSSION
SUMMARY
REFERENCES

Global aspects of the simulation

Our model for the zymogen form of human PC in solution is based on the x-ray crystal structure of APC (Mather et al., 1996). The Gla domain, the helical stack following the Gla domain, the AP, and the calcium ions were aligned to the x-ray crystal structure based on homology considerations. The production runs were performed at constant pressure and constant temperature. The density of the system was found to fluctuate around 0.996 g/cc during the final 500 ps of the 2.9-ns trajectory.

The root mean square deviations (RMSDs) of backbone atoms from their initial positions (t = 0 ps) have been used to measure the stability of the simulation and to provide insight into possible structure fluctuations (Hamaguchi et al., 1992; Li et al., 1996; Perera et al., 1997). The RMSDs for the complete protein and for certain selected substructural domains are presented in Fig. 4. The evaluation of RMSDs of substructural domains is carried out after aligning appropriate residues from a conformation at time t, with that at t = 0 ps. For example, only Gla domain residues are used for alignment in the calculation of RMSD for that domain. The sharp rise observed during the first 800 ps in the RMSD of all residues tends to flatten out but is subject to transient fluctuations. The magnitude of this RMSD curve, however, does not continue to increase for the final 2000-ps segment, implying that the protein structure is stable over this time scale. Individual domains show relatively smaller RMSDs and fluctuations. Small fluctuations around the average values over the complete time period of 2900 ps is an indication of stable secondary structure.




View larger version (37K):
[in this window]
[in a new window]
 
FIGURE 4   The root mean square deviations (RMSDs) of the backbone atoms. (a) All amino acid residues (dark solid line), the Gla residues 1-35 (filled circles), the EGF1 residues 48-92 (open triangles) and the EGF2 residues 93-136 (open squares). (b) The AP (residues including the connecting region; 137-169) (open triangles) and the SP domain (residues 170-409) (filled circles). (c) Combined Gla/EGF1 region (residues 1-91) (open squares), combined EGF1/EGF2 region (residues 48-136) (filled circles) and the entire light chain (residues 1-136) (dark solid line).

We conclude that on the 3-ns time scale there are no significant structural changes when going from crystal to solution. The EGF1 domain displays slightly larger magnitude in RMSD. Similar large RMSD is found in the initial 1500 ps for the AP. The final 1-ns segment of the AP is remarkably flat, implying that this region has reached its equilibration during the first 1500 ps of dynamics. Although the initial segment of the RMSD of the EGF1 domain rises sharply during the first 200 ps, an eventual stabilization is observed for this domain. Since part of the EGF1 domain (second disulfide loop containing a Hya residue) was necessarily modeled, it is not surprising that the early time RMSD values were elevated. Likewise, the significant changes to the coordinates of the N-terminus residues of the EGF1 domain, modeled so as to align the Gla-EGF1 orientation consistent with that of factor IXa, contribute to early time RMSDs. Residues 137-169 (including AP residues 158-169) show somewhat larger fluctuations during the first 1500-ps segment, but these become subdued during the latter part of the trajectory. In general, domain-domain movements are apparent in that the RMSDs increase as domains are jointly considered (RMSD of light chain > RMSD of {Gla + EGF1} or {EGF1 + EGF2} > RMSD of Gla or EGF1 or EGF2).

The relative dynamics of individual residues (alpha -carbons) that may not contribute to observable reorganization in secondary structure can be investigated by computing the atomic B-factors (York et al., 1994) using the variance, < Delta r2> , in the simulations through the relationship B = [8pi 2/3]< Delta r2> ). The simulation B-factors were calculated using the coordinates of the last 200-ps segment of the MD trajectory (Fig. 5). Since there is a tumbling of the molecule within the central box, one must first remove this motion in obtaining the average structure during the last 200 ps. The simulation B-factors were computed by two different procedures; first by optimally aligning all backbone atoms with those of the t = 0 ps configuration (shown in solid lines) and secondly by aligning backbone atoms of each domain with the corresponding atoms of the t = 0 ps configuration (shown in dotted lines). It is clear that the latter procedure is desired if domain-domain movement takes place. Relatively smaller B-factors are apparent for the catalytic domain (residues 170-409), indicating a stable solution structure for this domain for the zymogen form (Fig. 5 b). Although not exact, the experimental pattern for the peak positions is largely conserved in the calculated B-factors indicating the atoms with large motion in the x-ray crystal structure remain mobile during the simulation. The patterns in the calculated B-factors remain the same regardless of the two methods used to calculate them, but the calculations performed using the entire molecule for alignment show slightly elevated B-factors. The segment 1-12 (Omega -loop residues) at the N-terminus of the light chain shows relatively large B-factors if the entire molecule is used in the alignment. There is a noticeable shift, however, in the baseline from the N-terminal Gla region to the end of EGF1 domain. This shift is not observed when the B-factors are calculated using the alignment of individual domains. This baseline shift originates from the interdomain motion among Gla, EGF1, and EGF2 domains. Relatively larger peaks observed in the dotted curve may be attributed to the intradomain rearrangements. For example, the Omega -loop as a region can move without yielding a large perturbation to the Gla domain because of its protruding arrangement and, indeed, this region shows larger B-factors. The same is true for the central region of EGF1 domain where several residues are involved in the binding of a calcium ion.




View larger version (38K):
[in this window]
[in a new window]
 
FIGURE 5   B-factors calculated for backbone Calpha -atoms using the last 200-ps segment of the MD trajectory. (a) Light chain. (b) SP domain. The B-factors were calculated by using the entire protein as a template for alignment (dark solid lines) or after aligning individual domains and combining data (dotted line). The domain definitions are given in the text. X-ray B-factors are also shown (open circles). Interdomain motion, unless removed, confounds B-factor evaluation.

Light chain

The light chain of PC (Fig. 6 a) consists of three domains: Gla (residues 1-37, Fig. 6 b), EGF1 (residues 49-91, Fig. 6 c), and EGF2 (residues 92-136, Fig. 6 d). A helical stack (residues 37-48) connects the Gla domain to the first EGF domain. The residues of this stack participate in the coordination of the calcium ion bound to the first EGF1 domain. Following the EGF2 domain, a 21-residue peptide (residues 137-157) connects the EGF2 domain to the activation peptide (residues 158-169). This 21-residue peptide chain is also linked to the SP domain through a disulfide bond.




View larger version (28K):
[in this window]
[in a new window]
 
FIGURE 6   (a) The light chain backbone structure created from the last snapshot at t = 2900 ps of the trajectory. Calcium ions; light solid spheres. The Gla and EGF1 domains stack vertically; the EGF2 domain is positioned perpendicular to the hypothetical long axis of Gla and EGF1 domains. The SP domain (not shown) positions on top of the EGF2 domain. (b) The Gla domain backbone structure generated at t = 0 ps (dark tube) is superimposed with the final snapshot of the backbone atoms (light tube) of the Gla domain. The calcium ions correspond to the final structure. (c) Same comparison as in b, but for the EGF1 domain. (d) Same comparison as b, but for the EGF2 domain. All figures are generated using the program Molscript (Kraulis, 1991).

The Gla domain and the helical stack (residues 1-47) were missing from the N-terminal region in the x-ray crystal structure of human APC (Mather et al., 1996). Since the reported 3-D structure was of the activated form, the AP is also absent (residues 158-169) and, due to diffuse electron density, a peptide fragment from the C-terminal region of the light chain was missing (residues 147-157). We reconstructed the missing portions in our simulated protein using homology modeling. The stability of the individual domains can be examined by computing the differences between the backbone alpha  carbon positions in the initial and the final configurations after 3-D alignments. In Fig. 7 a, the deviations of Calpha positions are measured by aligning a given domain structure at the final configuration to the corresponding domain structure at t = 0 ps. On the average, most residues remain near their initial Calpha positions (deviations around 1 Å). Smaller deviations occur for the residues in the EGF2 domain. The segments that exhibit appreciable Calpha deviations include the N-terminal region of the EGF1 domain (residues 47-56). This region involved considerable modeling to generate the structure, particularly for the calcium ion in the EGF1 domain. The Calpha deviations can also be used as a measure for the domain separations. In Fig. 7 b, each domain of the final solution structure is aligned separately to the corresponding domains of the solution structure at t = 0 ps. We display the curves after separate helical stack, EGF1 and EGF2 alignments. Once the helical stack (residues 34-48) is aligned (dark solid line), domain motions in both sides are visible via their increased Calpha distances. Similar behavior is observed for EGF1 (dotted line) and EGF2 (light solid line) alignments. We conclude that the Gla, EGF1, and EGF2 domains have relative movements during the trajectory, but each largely retains an individual domain structure similar to the initial structure.




View larger version (45K):
[in this window]
[in a new window]
 
FIGURE 7   (a) Deviations of alpha -carbons of the corresponding light chain residues in the active and zymogen (solution model) structures after optimal alignments. Each domain is separately aligned. (b) Calpha deviations for the corresponding residues in the active and zymogen forms after aligning only one domain at a time. Domain alignments are shown by dark solid line, the helical hydrophobic stack (residues 35-48); dotted line, the EGF1 domain (residues 48-91); and light solid line, the EGF2 domain (residues 92-136).

The Gla, EGF1, and EGF2 domains do not directly interact via hydrogen bonds or salt bridges. Only the residues in the connecting regions make direct hydrogen bonds with the residues in the adjacent domains. Flexibility around the connecting region results, though the integrity of the individual domains are maintained. Even though there is interdomain motion, the length of the protein is relatively unaltered, because the motions are largely around the long axis of the molecule.

In the initial modeling, the Gla-EGF1 orientation of porcine factor IXa was used to orient the Gla domain of PC with its EGF1 domain. However, in the final structure of PC, an orientation distinct from that of factor IXa is observed. The light chain of PC, when compared with the corresponding segments of factors VIIa (in the TF-bound form) and IXa, has evolved to a shape between that of the light chain of elongated factor VIIa and of curved factor IXa.

Gla domain

The Gla domain of human PC was modeled by making necessary amino acid residue substitutions to the x-ray crystal structure of factor VIIa (Banner et al., 1995, 1996) calcium and TF bound, pdb entry 1dan. The calcium and crystal water molecules were retained unmodified. In the initial configuration, in which the Gla-EGF1 orientation is similar to the x-ray crystal factor IXa-assumed alignment, there are no direct hydrogen bonds, van der Waals contacts or salt bridges among the residues of the two domains. As has been indicated by the smaller magnitudes and fluctuations in the RMSD as well as the small deviations of Calpha positions, the secondary structure of the Gla domain is not altered from its original form (see Fig. 6 b, in which the Gla domains of PC in the final structure and factor VIIa are compared).

The Omega -loop and the unique H-bond network at Ala-1, which is tucked into the interior of the protein and makes H-bonds with residues Gla-16, Gla-20, Ile-21 and Gla-26, is maintained. Also, the Gla-calcium network that is found in the structure of the Gla domains of VKD proteins (prothrombin and factor VIIa) is well-preserved in the present simulation. Of the seven calcium ions present in contact with Gla residues, four have two or fewer coordinations with water molecules. The remaining three Gla residues, which are more solvent accessible, are coordinated with three or more water molecules throughout the trajectory calculation. The three hydrophobic residues (Phe-4, Leu-5, and Leu-8) in the Omega -loop extend away from the protein, oriented for potential insertion into the lipid membrane.

Compared to the other VKD blood coagulation proteins, human PC has a low PL (phosphatidylserine/phosphatidylcholine) binding affinity (Kd ~ 1500 nM). Bovine PC and human and bovine factor VIIa have been shown to have even lower binding affinities toward lipid surfaces (Kd > 15,000 nM; McDonald et al., 1997a,b; Shen et al., 1997b). It has been proposed (Shen et al., 1997b) that the lower affinity of bovine PC is largely due to the presence of proline at position 10 in place of His in human PC. However, in an experiment in which the first 46 residues in the N-terminal region of PC is replaced by that of factor VII (Geng and Castellino, 1997), the activated form of the modified protein was found to be very similar to that of its wild-type counterpart in plasma anticoagulant activity, as well as activity toward inactivation of coagulation factor VIIIa. All VKD coagulation proteins except factor VII and PC contain a Gla residue in position 33. McDonald et al. (1997) suggest that position 33 which is occupied by either Gln (in PC) and Arg (in factor VII), may be responsible for the low binding affinities. Also, the mutation of Ser-11, a residue unique to human PC (in all other VKD coagulation proteins it is Gly), to Gly yields increased membrane affinity (Shen et al., 1998). The residues discussed above are highlighted in Fig. 8. Assuming that the lipid binding of the Gla domain is through a combined mechanism of PL insertion of Phe-4, Leu-5, and Leu-8 (Christiansen et al., 1994, 1995a; Christiansen and Castellino, 1994; Jalbert et al., 1996) and calcium bridging, one can assign the calcium ion plane to be coplanar with the lipid head groups. Given this conformation, His-10 could have an increased interaction strength with the anionic lipid head groups since it can be positively charged. Conversely, Ser-11 is placed so that it blocks any incoming lipid headgroup from bridging with a nearby calcium ion. Such a situation would reduce calcium ion exposure with the charged lipid head groups, thereby possibly reducing the lipid binding affinity of human PC.




View larger version (33K):
[in this window]
[in a new window]
 
FIGURE 8   A snapshot of the Gla domain (zymogen) viewed perpendicular to the helical hydrophobic stack (residues 35-48) at t = 2900 ps. Important residues described in the text are marked using the stick representation of bonds. The solid spheres represent the calcium ions. gamma , gamma -carboxyglutamic acid.

Gla-26-Lys is a functionally defective mutation in human PC found in the plasma of patients with hereditary thrombophilia (Nishioka et al., 1996). This mutant exhibits lowered activation rates and anticoagulant activity. Even though defective APC and normal APC both displayed similar PS binding, defective PC and its activated form do not bind to phospholipids or cultured human endothelial cells. The finding of Nishioka et al. (1996) suggests that a Gla-26-dependent conformation is required for the binding of PC or APC to PL membrane, the IIa/TM complex and the surface of endothelial cell PC/APC receptor, but not for PS interactions. In the zymogen PC conformation, we find that Gla-26 is substantially involved in the Gla-calcium network apparently required for membrane binding (Colpitts et al., 1995). It is directed toward the interior of the protein where several of the buried calcium ions are located, forming four ionic bonds with three different calcium ions. Gla-26, along with Gla-7, Gla-16, and Gla-29, coordinates three calcium ions, whereas the other Gla residues have only one or two calcium ions in their coordination shells. Particularly, Gla-26, in concert with Gla-7 and Gla-16, may play an indirect role on the formation of the Omega -loop, since all bind to a calcium ion that is in the immediate solvation shell of Ala-1. Since the initial calcium ion positions were obtained from the x-ray crystal structure of FVII/TF complex, and the Gla residues are well preserved between FVII and PC, and also based on our previous computational work on factor IX calcium positioning that was used to guide the NMR experimental data (Li et al., 1997), we are confident that the calcium positions in our model after 3 ns may be near to what one might find in the actual system.

The location of the active site of membrane-bound APC relative to a PL surface has been determined using fluorescence resonance energy transfer (Yegneswaran et al., 1997). In the absence of PS, the distance is found to be 94 ± 3 Å. The incorporation of PS, however, reduced this distance to 84 Å. Subsequent addition of PS did not change the distance of the active site from the lipid surface for factors Xa, IXa, and VIIa. The reduction in distance was attributed to the cofactor function that apparently relocates the active site of APC. In the present simulation, the active site of the zymogen form of PC is about 89 Å above the hypothetical membrane surface. To provide this conservative estimate, we assume that the calcium ion plane resides on the membrane surface for bridging interactions and the three hydrophobic residues (Phe-4, Leu-5, and Leu-8) are inserted into the lipid membrane.

EGF1 domain

The main function of the EGF domains is likely to provide protein-protein or protein-cell interactions (Hogg et al., 1992). Cheng et al. (1997) evaluated the importance of the functional interactions of the EGF1 residues of PC with activators and substrates. They noted through mutational studies that the charged residues of the EGF1 domain and the calcium ion binding site maintain the structural integrity of this module but these residues have little direct functional interaction with activators or substrates. Also, the calcium ion binding to the Gla domain does not seem to be influenced by calcium ion binding to the EGF1 domain (Geng et al., 1996; Cheng et al., 1997).

The EGF1 domain of PC differs somewhat from the corresponding domain of other VKD coagulation proteins, with the presence of an additional small loop (residues 59-63) created by a disulfide bond. This domain shows the largest RMSD among the individual domains present in the light chain (Fig. 4), but the alignment of the x-ray crystal and solution backbone structures (Fig. 6 c) does not reveal any global restructuring in this domain. The Calpha deviations (Fig. 7) for the EGF1 domain indicate that a significant contribution to the RMSD for this domain may arise from residues 52-57. Also, above average Calpha deviations are found in the region bracketing Hya-71. The calcium ion present near the EGF1 domain (recall that we have modeled and equilibrated this structural element) is doubly coordinated with Hya-71 through two side chain carboxylate oxygens. The only other residue from the EGF1 domain to coordinate this calcium ion is Ile-73, via the backbone carbonyl oxygen. Ser-42, Asp-46, and Asp-48 from the connecting region between Gla and EGF1 domains also coordinate with this calcium. One stable water molecule is found in its coordination shell (Fig. 9 a). This calcium ion maintains a stable eight-member coordination shell. As was seen in the model of factor VIIa (Perera et al., 1999), the geometry of the expanded coordination shell is a pentagonal bipyramid in which carbonyl O of Ser-42, OD2 of Asp-46, both OD1 and OD2 of Hya-71, and carbonyl O of Ile-73 define the pentagon; a water O is located at the bottom vertex. OD1 and OD2 of Asp-48 share the other vertex. The equatorial oxygen distances are found within 2.1 to 3.1 Å of the neighboring ligands and the top and bottom vertex oxygens maintain their distances to other complexing oxygens between 3.0 and 3.8 Å. The water oxygen distances are biased toward larger values, indicating that the calcium ion maintains a one-sided, glove-like network with the protein atoms. The EGF1 calcium ion appears to provide a pivot point for Gla-EGF1 relative motion. In contrast, the SP domain bound calcium ion (Fig. 9 b) is tightly surrounded by 7 ligand oxygens in a square-bipyramidal environment throughout the simulation.




View larger version (18K):
[in this window]
[in a new window]
 
FIGURE 9   The final zymogen structure coordination spheres of the high affinity calcium ions bound to (a) the EGF1 domain and (b) the SP domain.

Christiansen et al. (1995) showed that the helix-breaking mutation of Ser-42-Pro results in a ~35% reduction in anticoagulant activity of APC (Christiansen et al., 1995b). In an assessment of the role of the helical stack (residues 34-43), substitution of the helical stack of PC by that of factor IX did not result in substantial changes in the Gla domain-related calcium-dependent properties, suggesting that the helical structural element plays a more important role than the specific amino acids. The backbone of the peptide fragment containing Hya-71 remains perpendicular to the helical stack axis of hydrophobic residues (see Fig. 6 a). If we assume our present model, and if the hydrophobic residues Phe-4, Leu-5, and Leu-8 do insert themselves into the PL surface, then the Gla and EGF1 domains can be viewed as a cylindrical column built on the inserted residues. We have observed a relative motion between the Gla and EGF1 domains throughout the simulation. Interestingly, we found smaller relative Gla-EGF1 motion in our simulation study for the light chain of factor VIIa (Perera et al., 1999) in the absence of TF. The EGF1-bound calcium ion and its coordinating residues are not in direct contact (no salt bridges or hydrogen bonds) with residues that bind to the calcium ions of the Gla domain. Thus, the simulation structure is consistent with the experimental work of Cheng et al. (1997), who concluded that no significant influence on calcium ion binding of the Gla domain is found due to the EGF1 calcium binding. Finally, the bulk of the EGF1 domain residues made no hydrogen bonds or salt bridges with the neighboring Gla and EGF2 domains. However, Arg-91 in the connecting region between the EGF1 and EGF2 domains maintains strong contacts with both Glu-85 (in EGF1) and Asp-101 (in EGF2). The lack of a number of significant contacts with neighboring domains, however, suggests the possibility of relatively low energy torsional motions near the domain connecting regions.

EGF2 domain

The EGF2 domain (Fig. 6 d) is a stable region that does not appear to undergo significant restructuring during the activation process. In fact, except for a few residues in the N-terminal connecting region, most EGF2 residues exhibit Calpha deviations smaller than 1.0 Å from the x-ray crystal structure (Fig. 7). In addition to the hydrogen bonds found in the activated form of the x-ray crystal structure (Asn-102/Arg-286, His-107/Thr-371, Tyr-108/Leu-278, and Cys-109/Arg-286) between the EGF2 and SP domains, a hydrogen bond evolves between the EGF2 and SP domains of the predicted solution structure of the zymogen form (Asp-101/His-369). Despite the nearness of EGF2 to the catalytic domain, direct contacts are limited.

Activation peptide (AP)

The AP of PC is the shortest (only 12 residues from Asp-158 to Arg-169) of the VKD proteins that release an AP. It contains 6 acidic residues and only one basic (Lys) residue, and is therefore highly negatively charged. On the other hand, the segment connecting the AP to the EGF2 domain (residues 137-157) contains 8 Lys and Arg residues and only one acidic Glu residue (charge = +7e). This situation creates a remarkable dipolar environment for this segment of this protein. In fact, bovine PC carries only four Asp residues whose charge is counterbalanced by two basic residues present in the AP region, making this region still a slightly acidic one. Still, the 23 amino acid residues that connect the AP with the EGF2 domain of bovine PC contain 10 Lys and Arg residues to one Glu (charge = +9e). APs of both human and bovine forms of factors IX (35 AP residues) and X (52 AP residues) are 3 to 4 times larger than that of PC and contain an excess of 6 to 10 acidic residues. For factors IX and X, the amino acid residues connecting the AP to the EGF2 domain contain an excess of only 3 or 4 basic residues. Such differences among PC and other VKD proteins in charge distributions around the scissile bond may require different conditions for the initial approach of the zymogen to its enzyme for its activation. For example, thrombin alone can activate PC but activation rates are increased by at least 1000-fold in the presence of TM. TM may thus enhance the electronegative environment around the active site of thrombin. In this section, we analyze the predicted structure of the AP (158-169) along with the linker peptide (residues 137-157) that connects the EGF2 domain to the AP. Despite the fact that this segment is modeled, RMSDs were found to be comparable with the other regions. Fig. 10 provides a snapshot (at t = 2.9 ps) of this region along with the backbone ribbon of the initial configuration (t = 0 ps).




View larger version (21K):
[in this window]
[in a new window]
 
FIGURE 10   The activation peptide (residues 158-169) along with the connecting region (residues 146-157) that follow the EGF2 domain. The backbone (ribbon) at t = 0 ps and the backbone atoms (sticks) at t = 2900 ps are displayed.

PC is activated by IIa bound to TM on endothelial cells (Esmon et al., 1993). Positively charged groups near the catalytic site cleft of thrombin may directly interact with the negatively charged AP in PC activation. Mutations of several acidic residues in or near active cleft of thrombin (Glu-217(cn)-Ala, Glu-192(cn)-Gln, and Glu-39(cn)-Lys) result in increased IIa/PC interactions but have a little effect on those with IIa/TM (Le Bonniec and Esmon, 1991; Le Bonniec et al., 1991; Rezaie and Esmon, 1993). Mutations of the acidic residues either to basic or neutral residues increase the electropositivity of the electrostatic surface around the thrombin catalytic cleft and favorably accommodate the negatively charged AP residues adjacent to the scissile bond through enhanced electrostatic interactions. Such movement would require the cleavage site of the AP of zymogen PC to be exposed to the incoming thrombin (or the IIa/TM complex). The AP cleavage site is readily available for such interactions in the present simulation structure (solvent accessibilities are 114 and 107 Å2 for Arg-169 and Leu-170, respectively). However, the P1' (Arg-169) residue in the current final configuration has its side chain in a salt bridge with Glu-232 in the calcium binding loop; this interaction must complete with the P1'-S1' interaction that purportedly occurs at the active site of IIa.

In the present model, several H-bonds are found between the AP and SP domains including several with the calcium ion binding residues in the SP domain. Both Arg-229 and Arg-230 of the SP domain interact with Asp-167 of the activation peptide through H-bonds. Rezaie and Esmon (1992) observed a twofold decrease in the maximum calcium-dependent fluorescence quenching due to the mutation of Asp residues (to Gly) found at P3 (Asp-167) and P3' (Asp-172) positions (scissile bond = Arg-169---Leu-170). Our simulation shows that Asp-167 can actively participate in the stabilization of the SP calcium ion binding loop. Also, in our current model Asp-172 interacts with Arg-314 via a strong salt bridge. Similarly, Lys-233 interacts with both Asp-161 and Glu-163. This network enhances the availability of acidic residues in the calcium ion binding loop (residues 225-235) for their calcium ion binding function by moving side chains of the basic residues in this loop away from the acidic residues. In addition, Trp-231 and Gln-165 also form a H-bond. Although the current simulation is of benchmark size and duration, complete exploration of the AP conformations may require a longer time scale than can be simulated with current computer capabilities.

Serine protease (SP) domain

Because a major aim of the present simulation is to estimate a reasonably accurate structure for the solution form of the calcium-bound zymogen, comparison of the SP domain of the modeled zymogen form with its crystallized activated form may be useful in obtaining information about restructuring that must occur near the catalytic residues on activation.

The SP domain of human PC consists of 240 residues. The three most important residues (His-211, Asp-257, and Ser-360, rendered in Fig. 11) define the functional role of a SP. It is common, in SP domains of both coagulant and anticoagulant SPs in the blood coagulation cascade, to observe the same global fold. The catalytic residues are located at the junction between two interconnected six-stranded beta -barrel domains. The SP domain of PC, along with trypsin, coagulation factors II, VII, IX, and X, possess a calcium ion binding site.




View larger version (60K):
[in this window]
[in a new window]
 
FIGURE 11   The catalytic domains of PC in the zymogen model (dark line) and the active x-ray crystal structure (light line). Catalytic residues in the zymogen model structure are rendered. Also, loops with observed movements (APC right-arrow PC) in the backbone structure are labeled.

In modeling the SP domain for the zymogen form of PC, two major modifications to the x-ray crystal configuration of the activated form were introduced. Structural similarities found among the crystal configurations of the SPs in the blood coagulation cascade in their active and zymogen forms were used in the creation of the N-terminal connection to the AP. All hemostatic proteases circulate in blood in the inactive zymogen form and become active due to specific cleavage after Arg-15(cn) in the SP. Several VKD coagulation proteins (PC, factors IX and X) contain two cleavage sites, and the removal of a small peptide takes place during the process of their activation. Once cleaved, the newly created N-terminus of the SP domain becomes embedded in the SP interior. This structural element apparently facilitates the charge transfer from Asp-102(cn) via His-57(cn) to Ser-195(cn) required for PC catalytic activity. In the zymogen x-ray crystal structure of proproteinase E (and for two other zymogens considered), the N-terminal residues of the catalytic domain points away from the protein with activation leading to insertion of the N-terminus near the active site. The structure of proproteinase E was adopted for the modeling of the N-terminal connecting region of PC (the structure of three residues, Leu-170(cn16), Ile-171(cn17), and Asp-172(cn18), are adapted by this implementation), thereby introducing a significant change to the SP domain of the APC template. Secondly, a calcium ion, which is critical for zymogen activity, was introduced to the region described in the literature (Mather et al., 1996). We shall analyze the structural consequences of these modifications.

As has been seen from Fig. 4, the RMSDs (zymogen versus APC) calculated for the SP domain are small and comparable with those of Gla and EGF2 domains indicating a relatively stable solution structure (see also Fig. 11). Considering that PC is in the zymogen form with no inhibitor present in the simulation (as opposed to the x-ray crystal structure of APC with inhibitor present), this result is rather unexpected. The RMSDs suggest that rearrangement in the N-terminal region of the SP domain of APC does not lead to large changes in the overall solution structure. There are, however, differences in the side chain orientations (PC versus APC) and several loops undergo movement.

The side chain motion can be studied with the help of B-factors calculated for side chain heavy atoms. In Fig. 12, we plot the side chain B-factors of the SP domain residues averaged over the final 200 ps of the trajectory. The configurations at each picosecond were aligned with the t = 0 ps structure, and B-factors were then calculated for all heavy atoms. Side chain B-factors of each residue were calculated by averaging over all side chain heavy atoms present. The experimental B-factors (averaged over the side chain heavy atoms of each residue) are also given in Fig. 12 for comparison. The backbone B-factors (Fig. 5) calculated from simulation are smaller in magnitude than the corresponding B-factors calculated in x-ray experiments while the computed side chain B-factors from the simulation show either comparable or larger magnitudes than the x-ray experiment. It is reasonable that the solvent environment may lead to larger thermal fluctuations that translate to larger B-factors (than experiment) observed for side chains in the simulation. The general pattern of fluctuations in the calculated B-factors of the zymogen, however, is similar to that observed in the x-ray crystal structure for APC. The solvent accessibilities (and their changes) are shown in Fig. 12 to ascertain whether there is a correlation between the B-factors and the solvent accessibilities. The changes in the solvent accessibility (Delta SAA = SAA in the solution structure (zymogen) - SAA in the x-ray crystal structure (activated)) can identify residues that are newly exposed to the solvent or new interactions that may accompany activation. The patch containing three Lys residues, Lys-191(cn37) to Lys-193(cn39), has large B-factors and also large solvent accessibilities. This segment is recognized to be crucial for activation by IIa/TM complex but not by IIa alone (Gerlitz and Grinnell, 1996). Though all of these Lys residues are highly solvent accessible, once activated, Lys-192(cn38) loses part of its solvent exposure. In another positively charged patch of residues, Lys-217(cn62) and Lys-218(cn63) show large side chain B-factors along with high solvent accessibilities in zymogen PC. Upon activation, these two residues become even more solvent-exposed (for Lys-217(cn62) from 172 to 206 Å2 and for Lys-218 from 94 to 158 Å2). In the zymogen model Lys-217(cn62) forms a H-bond with neighboring residue Ser-216(cn61) and Lys-218(cn63) makes H-bonds with Leu-219(cn64) and Asp-239(cn84). In the x-ray crystal form (APC), both are unbound.




View larger version (41K):
[in this window]
[in a new window]
 
FIGURE 12   (a) Side chain B-factors averaged over the heavy atoms of side chains of each residue of the catalytic domain (solid line) are compared with those from the x-ray crystal structure of APC (shaded histograms). (b) Solvent-accessible areas and (c) changes in the solvent-accessible areas in going from zymogen to APC {Delta (SAA) = SAAzymogen - SAAAPC}.

Cation binding to the SP domain

Rezaie et al. (1992) reported that the binding of the calcium ion at the high affinity binding site in the SP domain of PC results in a conformational change that is required for activation by the IIa/TM complex (Rezaie and Esmon, 1992; Rezaie et al., 1992). This calcium ion binding site was predicted to be near the 10 residue loop from Glu-225(cn70) to Glu-235(cn80) (Mather et al., 1996). This segment closely follows the structure of the calcium ion binding loop of trypsin (Bode and Schwager, 1975). In our initial modeled structure, the calcium ion was introduced to a six-center coordination environment by ligating it with OE1 of Glu-225(cn70), OE1 of Glu-232(cn77), OE1 and OE2 of Glu-235(cn80), carbonyl oxygens of Asp-227(cn72), and Arg-230(cn75). A seventh ligation position was occupied by a water molecule as the simulation progressed. The coordination shell of the calcium ion bound to the SP domain is more compact than that observed for the EGF1-domain bound calcium ion (see Fig. 9b). Mutation of Glu-235(cn80)-Lys results in a form that no longer requires calcium for rapid activation by the IIa/TM complex (Rezaie et al., 1994). Presumably the calcium ion binding site is disrupted. This mutation may, however, favorably change the electrostatic potential around this putative calcium site for incoming IIa/TM complex, much as the calcium ion does in the wild-type structure. It remains to be shown if occupancy of a calcium ion in this site is required for physiological functioning of APC (He and Rezaie, 1999).

PC activation is also upwardly regulated by sodium ion binding (He and Rezaie, 1999). He and Rezaie (1999) identified a loop in APC similar to the sodium ion binding loop of thrombin, and concluded that this loop may be allosterically linked with the divalent cation binding loop of the protease. We find that the sodium ion binding region predicted by He and Rezaie (1999), residues 385-390(cn221-225), is strongly electronegative via electrostatic potential surface computations (not shown). This loop (residues 385-390) is remote from the putative calcium binding site and we find no residues simultaneously shared by both putative ion binding sites. We find, however, that the putative sodium ion binding site is near the scissile bond of PC activation (within 10 Å). Thus, sodium ion binding to this site could modulate the electrostatics around the scissile bond, thereby enhancing the interaction of the active site cleft of thrombin. Further computational studies are currently under way in our laboratory to elucidate the effect of this sodium ion on the SP domain structure.

Comparison of PC/APC with other known zymogen/active SP pairs

Calpha deviations for the solution zymogen model and x-ray crystal structure of APC for the SP domain are given in Fig. 13. The residues that had crystal contacts with neighboring APC molecules in the crystal configuration are marked in this figure along with the residues that had inhibitor contacts. Since the crystal contacts and inhibitor contacts are removed in our solution model structure of the zymogen, the residues that participated in contacts relax to yield somewhat larger Calpha deviations (x-ray crystal APC x-ray versus solution PC). On the other hand, for more than half (133) of the residues, including two of the catalytic triad residues, His-211(cn57) and Asp-257(cn102), in the SP domain have Calpha deviations <1 Å and only 48 residues (20% of the total in this domain) have values >2 Å. The broad peak that brackets residues 301-313(cn142-158) is likely due to removal of crystal contacts, as most of the residues become exposed to the solvent upon solvation. Other residues with large deviations participate in forming hydrogen bonds and salt bridges with the residues in the AP or the reconstructed N-terminal part of the catalytic domain, for example, hydrogen bonds and salt bridges between Asp-167-Asn-313(cn150), Asp-172(cn18)-Arg-314(cn151), and Lys-174(cn20)-His-303(cn144)). Residues 350-358(cn186-183) and 383-389(cn219-224) contribute to peaks in the Calpha deviations. These segments are linked by a disulfide bridge, are near the 301-315(cn142-152) loop, and bracket residues that have inhibitor contacts and crystal contacts in the x-ray crystal structure. The segment 385-390(cn221-225) is the putative sodium ion binding site and the absence of a sodium ion in the current simulation may contribute to some disorder. Almost all regions that show elevated Calpha deviations contain residues exposed to solvent. Thus, differential solvation apparently provides the most significant contribution to the Calpha deviations observed in the present simulation.




View larger version (43K):
[in this window]
[in a new window]
 
FIGURE 13   Deviations of the alpha -carbon atoms of the corresponding SP domain residues of PC in the zymogen form and the active form after the best alignment. PC numbering is used. Residues with crystal contacts (open squares) and inhibitor contacts (diamond) in the x-ray crystal configuration of the active form are marked. Filled circles indicate the presence of additional residues (in APC) if aligned with chymotrypsin.

The Calpha deviations between PC (modeled) and APC (x-ray) are compared (Fig. 14) with three other zymogen/activated SP protein pairs for which the structures exist: a) chymotrypsin (pdb entry 2cga)/chymotrypsinogen (pdb entry 4cha), b) prethrombin-2(pdb entry 1hag)/alpha -thrombin (pdb entry 1hah) (both active and zymogen forms are complexed with hirugen), c) bovine proproteinase E (pdb entry 1pyt)/porcine pancreatic elastase (pdb entry 1btu). The active site residues are already in their activated conformations in all of the activated systems, as is the case of PC. Once the SP domain residues of PC are aligned with the homologous residues of chymotrypsin, additional residues of PC that do not have corresponding residues in chymotrypsin are found. These inserted residues are marked in Fig. 13 (dark circles in five separate segments). In Fig. 14, only the first residue in each group was used to facilitate the comparison. Removal of the Ca deviations corresponding to these inserted residues from the plot does not change the general characteristics of the plot (compare Figs. 13 and 14). Thus, the inserted residues of PC do not appear to alter the structure common to these SP domains. The overall picture that emerges from these comparisons is that the deviations in the Ca values (PC versus APC) are similar to the three available SP pairs.




View larger version (34K):
[in this window]
[in a new window]
 
FIGURE 14   Comparison of the Calpha deviations of PC (active versus zymogen; shaded histogram) and (a) chymotrypsin/chymotrypsinogen, (b) alpha -thrombin/prethrombin-2, and (c) elastase/proproteinase systems (dark solid lines). The backbone atoms of residues 24-238(cn) are used to align the zymogen with its active form.

Once superimposed, the backbone positions of the catalytic triad residues of PC in the simulation structure are essentially indistinguishable from the x-ray crystal structure (RMSD of 0.5 Å) of APC. Comparison of the x-ray crystal structures of the other zymogen/active SP pairs shows the same fixed structure for catalytic triad residues. Unaltered catalytic triad backbone atom positions among the activated and zymogen serine proteases imply that during activation, the local environment around the preformed catalytic triad in the zymogen form of PC undergoes electrostatic and steric changes suitable for enzymatic function. Such local changes could provide the specificity of the enzymatic reaction to a particular serine protease.

In the systems for which x-ray crystal data are available, the loop cn186-194 has been proposed (Blow and Steitz, 1970) to play a key role in regulating enzyme activity. In the active state, the N-terminus of the SP domain turns inward after cleavage and interacts via a salt bridge with Asp-194(cn), a neighbor of the catalytic residue Ser-195(cn) (Blow and Steitz, 1970). A similar situation is observed for APC: Leu-170(cn16) contacts Asp-359(cn194), also through a salt bridge(Mather et al., 1996). The carboxylate group and the amido nitrogen of Asp-194 in chymotrypsin (Wang et al., 1985) and other SPs have an observable rotation (approximately 140°) around the Calpha -C bond of Asp-194(cn) between zymogen and active forms (see Fig. 15, a-c).




View larger version (30K):
[in this window]
[in a new window]
 
FIGURE 15   Rearrangements around the s1 specificity pockets after zymogen activation: zymogen (dark tracing) and active protease (light tracing). (a) Chymotrypsin/chymotrypsinogen. (b) Thrombin/prethrombin-2. (c) Elastase/proproteinase E. (d) APC/PC at t = 1500 ps. (e) APC/PC at t = 2900 ps.

We initiated the PC simulation at a conformation similar to the activated protein. Under normal simulation conditions, it is not unreasonable to expect that an extremely long trajectory calculation would be necessary to complete all of the conformational changes in PC that are similar in magnitude to the three experimental systems (zymogen/activated pairs). Currently available computational resources cannot handle such lengthy simulations. Ultimately, improvements in sampling techniques may speed this process. We observe a small rotation for the carboxylate group and the amido nitrogen of Asp-194 (about 27°) during the first 1500 ps of the simulation (Fig. 15 d). Increase in the solvent accessibilities of the neighboring residues to Asp-194(cn) occurs. The Calpha deviations at t = 1500 ps indicate that structural changes are underway and in a direction leading to steric hindrance of the substrate binding site. The rate of the Asp-194(cn) conformational change may be down-regulated by the Asp-194(cn) (side chain)/Cys-356(cn191) (backbone) H-bond in the activated form and/or by the water molecules H-bonded to the protein. The time scale of these refolding events may require a much more lengthy simulation before the penultimate conformational changes are observed. We subsequently forced this reverse transition (activated during the 1500- to 1600-ps segment of the trajectory) through step-wise constrained dynamics. All constraints were relaxed after 1600 ps. Such a forced change might have led to large positional fluctuations. The system, however, responded favorably to this implemented change, and no aberrations were observed in the following dynamics, e.g., no large fluctuations in RMSD or B-factors. Changes in the relative energies of the system before and after this transformation are indistinguishable from the noise level. The final configuration is given in Fig. 15 e. It corresponds well with the zymogen orientations of this fragment in the three experimental zymogen structures.

Because of the observed differences in substrate specificity, it is reasonable to expect dissimilarities near the specific binding pockets. For instance, Met-192 of chymotrypsin moves from a deeply buried position in the zymogen to the surface in the active form. In contrast, the SAA of Glu-357(cn192) decreases from 124 Å2 in zymogen PC to 88 Å2 in APC. All the neighboring residues in APC similarly have decreased SAAs. The region of the substrate binding site in chymotrypsin that accommodates aromatic and bulky nonpolar groups is not fully formed in its zymogen (see Fig. 15 a). Fig. 15 e displays that in the predicted zymogen form of PC, the loop comprising residues 186-194(cn) restricts the accessibility of the active site. This is consistent with the chymotrypsin/chymotrypsinogen case. The "oxyanion hole" at the catalytic serine side chain required to stabilize the proposed tetrahedral transition state in chymotrypsin is incomplete in chymotrypsinogen (Stryer, 1988). The same is true for the other two pairs as well as for our final solution model of PC. Analysis of the APC and chymotrypsin x-ray crystal structures yields similar structural arrangements around the catalytic serine.

Activation

Electrostatic potential surfaces can be useful in alignment of interacting regions for incoming molecules. Specifically, large electropositive or electronegative regions can reasonably attract oppositely charged entities through long-range electrostatic interactions. The analysis of the calculated electrostatic potential surfaces of the enzyme and substrate separately may guide the relative alignment of the substrate/enzyme complex. We have evaluated the electrostatic potential surface of the calcium-bound PC zymogen model (our substrate) using the popular code Grasp (Nicholls et al., 1991; results not shown). Electrostatic potential surfaces of the IIa/TM complex (pdb entry 1dx5) and IIa alone, after removing TM from the IIa/TM complex, were also calculated.

The picture that emerges for the formation of the catalytic complex from consideration of the structures and their electrostatic potential surfaces is as follows. The active site cleft of IIa is relatively electronegative. The concentration of acidic residues near the PC scissile bond naturally imposes a barrier for interaction with the active site of IIa. Due to the SP calcium ion, Glu residues in the loop become interior residues, while basic residues in this loop become exposed and partially neutralize the acidic residues around the scissile bond. Likewise, the basic residues in the fragment connecting the AP to the EGF2 domain similarly provide additional charge balancing for suitable IIa interaction. We find that, viewed externally via Grasp, the scissile bond region is actually positive. In fact, the comparison of the electrostatic potential surfaces of the IIa/TM complex with bare IIa after TM is removed, shows significant enhancement in the electronegativity around the IIa catalytic site due to TM. Since the binding of TM does not induce marked allosteric structural changes in the vicinity of IIa catalytic site (Fuentes-Prior et al., 2000