help button home button Biophys. J.
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS

Originally published as Biophys J. BioFAST on February 24, 2005.
doi:10.1529/biophysj.104.058446
This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow All Versions of this Article:
biophysj.104.058446v1
88/5/3133    most recent
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Lee, M. C.
Right arrow Articles by Duan, Y.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Lee, M. C.
Right arrow Articles by Duan, Y.
Biophysical Journal 88:3133-3146 (2005)
© 2005 The Biophysical Society

Large-Scale Conformational Dynamics of the HIV-1 Integrase Core Domain and Its Catalytic Loop Mutants

Matthew C. Lee *, Jinxia Deng {dagger}, James M. Briggs {dagger} {ddagger} and Yong Duan § ¶

* Department of Chemistry and Biochemistry, University of Delaware, Newark, Delaware 19716; {dagger} Department of Biology and Biochemistry and {ddagger} Department of Chemical Engineering, University of Houston, Houston, Texas 77204-5001; and § University of California, Davis, Genome Center and Department of Applied Science, University of California, Davis, California 95616

Correspondence: Address reprint requests to Yong Duan, Tel.: 530-754-7632; Fax 530-754-9658; E-mail: duan{at}ucdavis.edu.


    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 THEORETICAL METHODS
 RESULTS AND DISCUSSION
 CONCLUSIONS
 ACKNOWLEDGEMENTS
 REFERENCES
 
HIV-1 integrase is one of the three essential enzymes required for viral replication and has great potential as a novel target for anti-HIV drugs. Although tremendous efforts have been devoted to understanding this protein, the conformation of the catalytic core domain around the active site, particularly the catalytic loop overhanging the active site, is still not well characterized by experimental methods due to its high degree of flexibility. Recent studies have suggested that this conformational dynamics is directly correlated with enzymatic activity, but the details of this dynamics is not known. In this study, we conducted a series of extended-time molecular dynamics simulations and locally enhanced sampling simulations of the wild-type and three loop hinge mutants to investigate the conformational dynamics of the core domain. A combined total of >480 ns of simulation data was collected which allowed us to study the conformational changes that were not possible to observe in the previously reported short-time molecular dynamics simulations. Among the main findings are a major conformational change (>20 Å) in the catalytic loop, which revealed a gatinglike dynamics, and a transient intraloop structure, which provided a rationale for the mutational effects of several residues on the loop including Q148, P145, and Y143. Further, clustering analyses have identified seven major conformational states of the wild-type catalytic loop. Their implications for catalytic function and ligand interaction are discussed. The findings reported here provide a detailed view of the active site conformational dynamics and should be useful for structure-based inhibitor design for integrase.


    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 THEORETICAL METHODS
 RESULTS AND DISCUSSION
 CONCLUSIONS
 ACKNOWLEDGEMENTS
 REFERENCES
 
The human immunodeficiency virus type-1 integrase (IN) is a key enzyme in the replication cycle of the HIV-1 virus (Asante-Appiah and Skalka, 1997Go). It is responsible for inserting the retroviral cDNA into the host genome, a required step for efficient viral replication. Because of its pivotal role in the replication cycle, IN has gained much attention as a novel target for drug development in recent years (Bushman, 1995Go; Chen et al., 2002aGo,bGo; Pani et al., 2002Go; Pluymers et al., 2001Go; Sayasith et al., 2001Go). However, drug discovery programs targeting IN have not been very successful. High-throughput screening programs have so far only identified one series of active compounds (Dayam and Neamati, 2003Go) and structure-based methodologies have been hampered by the lack of complete and detailed structural data. The difficulty in obtaining structural information of IN stems from the low solubility of the full-length IN, which presents a major challenge for x-ray and NMR analysis.

Despite the lack of detailed structural information, the basic outline of the IN's structural organization has been known for some time. Biochemical studies (Engelman and Craigie, 1992Go; Johnson et al., 1986Go) have revealed that IN is organized into three domains: the N-terminal domain, the core domain, and the C-terminal domain. Mutagenesis studies have located the catalytic site to the core domain and identified a conserved three-residue motif (D,D35-E) common in many other retroviral INs and transposases (Engelman and Craigie, 1992Go; Kulkosky et al., 1992Go). Thus, the core domain has been the focal point of the structure-function investigations.

Based on functional analogy, a possible catalytic mechanism patterned after that of the Escherichia coli DNA polymerase I has been proposed (Beese and Steitz, 1991Go). However, crystallographic analysis of the core-domain has shown that the three-dimensional structure around the active site has a high degree of flexibility (Bujacz et al., 1996Go; Dyda et al., 1994Go; Greenwald et al., 1999Go), and the true active conformation of IN is still not completely determined. Without more detailed structural information about the structural and dynamical properties of the active site, the structural mechanism of the IN remains unsolved. One of our long-term objectives is to understand the structural mechanism through theoretical modeling. To help better define the scope of this study, we briefly summarize the relevant experimental and theoretical findings of the IN to date.

As a DNA manipulating enzyme, IN catalyzes two major reactions during the integration process: 3'-processing and strand transfer. In the first step, known as 3'-processing, just after the viral RNA genome is reverse transcribed into double-stranded linear cDNA (Brown et al., 1987Go; Lobel et al., 1989Go) by reverse transcriptase, IN recognizes two conserved nucleotides (5'-CA-3') and removes two terminal nucleotides (5'-GT-3') 3', exposing the 3'-hydroxyl group of A for use in nucleophilic attack in the next step of catalysis (Gallay et al., 1995Go). After 3'-processing, IN stays bound to the viral DNA and forms a preintegration complex. In the second reaction, known as strand transfer or joining, IN catalyzes a concerted insertion of the viral DNA into the host genome using the previously exposed 3'-hydroxyl groups of the viral DNA. A 5-bp gap between the insertion points results in a 5-bp duplicate sequence flanking each side of the inserted provirus.

Because these two steps utilize the same active site and yet involve different substrates, it has been postulated that the active site conformation of IN may be different for each step. This view is supported by a recent study in which selective inhibition of the strand transfer reaction but not the 3'-processing reaction was observed (Espeseth et al., 2000Go). Further evidence of the conformational flexibility arose from the crystallographic analysis of the core-domain, which showed that the three-dimensional structure around the active site has a high degree of flexibility (Bujacz et al., 1996Go; Dyda et al., 1994Go; Greenwald et al., 1999Go).

In the available crystal structures of IN, the core domain consists of an {alpha}/ß-fold with five ß-strands at the center, sandwiched by six {alpha}-helices (Fig. 1). On each side of the ß-core is a surface loop. The first loop (139-152) is located near the conserved catalytic triad containing several conserved residues. We refer to it here as the catalytic loop. Three conserved acidic residues (D64, D116, E152) form a catalytic triad motif commonly found in other retroviral INs and transposases (Engelman and Craigie, 1992Go; Kulkosky et al., 1992Go) right below the catalytic loop (see Table 2 for schematics). Because of its proximity to the active site, we are most interested in the dynamics of this loop.



View larger version (61K):
[in this window]
[in a new window]
 
FIGURE 1  Schematics of the IN core domain. Backbone of the loop is colored in yellow (residues 141–148), the two hinge residues are colored in blue; the green sphere is the Mg2+; the three conserved catalytic residues are highlighted in red; the two prolines in the loop are also highlighted in stick representation. The figure was prepared using the PyMol software (DeLano, 2002Go).

 

View this table:
[in this window]
[in a new window]
 
TABLE 2  Number of snapshots and relative populations identified from the 40-ns wild-type trajectory

 
In a recent mutagenesis study on the core domain by Greenwald et al. (1999)Go, the flexibility of the catalytic loop overhanging the active site was found to correlate with enzymatic activity. In that experiment, the glycine residues at each end of the catalytic loop (G140 and G149) were mutated into alanines. The prevalent effect of the mutations is to decrease the loop flexibility as shown by crystallographic analysis that demonstrated the reduced temperature factors of the mutants in the loop region. Because of the reduced flexibility, G149A and G140A mutants exhibited ninefold and 18-fold reduction in activity, respectively, and the G140A/G149A double mutant was virtually inactive. Thus, it is now clear that the flexibility of the loop is essential for IN's enzymatic activity.

Yet, despite accumulation of several crystallographic structures of the core domain, this important catalytic loop has often been found to be either completely disordered or poorly ordered in crystals, and its structure and dynamics could have been influenced by the neighboring units through crystal contacts. Much less is known about the flexibility of the loop. What are the major modes of its native motion and how do the loop hinge mutations affect the native modes of dynamics? What are the conformations accessible to the loops in solution? How does the energy landscape look in the vicinity of the preferred (native) conformation? Answers to these questions are central to our understanding of IN's function. However, these questions cannot be easily derived from the crystal structures. It is also interesting to understand the crystal packing effect on the dynamics of exposed loops.

In this study, we conducted an extensive set of molecular dynamics (MD) simulations to explore the conformational space around the crystallographic native state of IN's core domain. To circumvent the limited conformational sampling ability of molecular dynamics simulations at room temperature, we chose a three-stage simulation strategy. In the first stage, we used multiple-trajectory short-time simulations. By combining the sampling ability of the multiple trajectories, we expect to sample more conformational space than single trajectory of the same length. In the second stage, we selected the most interesting trajectory in which the loop reached positions closest to the active site from the ensemble of the multiple trajectories and extended it to a much longer timescale to probe more fully the time-dependent aspect of the dynamics.

By using this two-tiered approach, we successfully followed the loop from its open state to a relatively stable closed state. The closed conformation was observed after ~8 ns of simulation in one trajectory and took another ~2 ns to become completely closed. Once closed, the loop remained in that state for nearly 30 ns. To further enhance the sampling around the closed conformation, we applied the locally enhanced sampling (LES; Elber and Karplus, 1990Go) method to the closed state conformation. The enhanced sampling allowed us to observe the reopening event of the loop within ~4 ns of LES simulation time.

These simulations were further complemented by the simulations of the three loop hinge mutants (G140A, G149A, and G140A/G149A). Our results indicated that this large-scale loop motion was not observed in the mutant structures within the simulation timescale. Comparison of the major conformational states sampled by the three mutants to that sampled by the wild-type showed that the differences are mainly concentrated in the catalytic loop region. Because the gating motion was significantly hampered (completely eliminated in the case of the double mutant), we believe that this conformational dynamics is functionally relevant and is likely to play a role in catalysis.

In the following sections, we present the details and results of our simulation and discuss the implications of our findings in the context of IN structure-dynamics-function relationship and IN specific inhibitor design.


    THEORETICAL METHODS
 TOP
 ABSTRACT
 INTRODUCTION
 THEORETICAL METHODS
 RESULTS AND DISCUSSION
 CONCLUSIONS
 ACKNOWLEDGEMENTS
 REFERENCES
 
Model building
Four molecular systems were constructed, the wild-type IN, the G140A mutant, the G149A mutant, and the G149A/G140A double mutant. The wild-type model was constructed from chain B of the crystal structure 1QS4. This structure contains one Mg2+ ion coordinated by D64 and D116, but has two unresolved residues (I141 and P142) in the catalytic loop. Although, two metal ions are thought to be present for the catalysis (the second one is thought to be between E152 and D64), we chose to include single Mg2+ in the simulations to probe the dynamics of the core domain in its apo-state when only one metal ion is bound. The missing coordinates were homology modeled from chain B of 1BIS, which contains these two missing residues, using the SwissPDB software (Guex and Peitsch, 1997Go). The completed models were then subjected to 200 cycles of steepest descent energy minimization using the GROMOS force field available in the SwissPDB software, allowing only residues 140–143 to move while holding the rest of the protein fixed. The tleap module in AMBER7 (Case et al., 2002Go) was used to prepare the input files for simulations. The Duan et al. (2003)Go force field (also known as ff03) was selected to represent the molecular mechanical potentials. Our experience has shown that this force field has a reasonable balance between the {alpha}-helix and ß-strand conformations and therefore is more appropriate for extended-time simulations of proteins whose structure contains both {alpha}- and ß-structural elements. Hydrogens were added with the tleap program as well, and protonation states of the ionizable side chains were assigned according to values previously predicted (Lins et al., 1999Go) by the UHBD (Madura et al., 1995Go) program. (The neutral H114 was protonated at the {varepsilon}-position, whereas the rest of the ionizable side chains were kept at their standard protonation states.) The completed model was then solvated with TIP3P water (Jorgensen et al., 1983Go) in a box measuring 73 x 77 x 60 Å3. The system was neutralized with two Cl ions. Both mutations introduced in the 1QS4 structure to aid in crystallization (F185K and W131E) were changed back to their wild-type identities since we are interested in the dynamics of the domain in its native form. The parameters of Cl and Mg2+ ions were taken from the standard AMBER database.

The three mutant models were constructed by replacing the appropriate residues in the above wild-type model with the mutant residues using the SwissPDB software. These mutated structures were then energy minimized, solvated, and neutralized as outlined above.

Explicit solvent molecular dynamics simulations
The initial solvated structures were first subjected to 200 steps steepest descent energy minimization, whereas the solute atoms, including both the protein and the Mg2+ ion, were restrained by a harmonic potential with a force constant of 100.0 kcal/mol/Å2. After the initial solvent minimization, the entire system was minimized using 200 steps of steepest descent minimization without harmonic restraints.

The minimized structures were then subjected to an equilibration protocol in which the temperature of the systems was gradually raised from 100 K to 300 K over a 10-ps period while holding both the volume and temperature constant, followed by another 10-ps of solvent density adjustment at 300 K by holding the temperature and pressure constant while allowing the volume to change. The initial velocities were assigned randomly from a Maxwellian distribution at 100 K. At the end of the equilibration, the average temperature of the final 5 ps was ~300 K, and the average density was ~1.0 g/ml. Long range electrostatic interactions were treated with the particle mesh Ewald (Darden et al., 1993Go) method. Periodic boundary conditions were applied via both nearest image and the discrete Fourier transform implemented as part of the particle mesh Ewald method. All bonds involving hydrogen atoms were restrained using the SHAKE (Ryckaert et al., 1977Go) algorithm, allowing larger time steps (2 fs) to be taken. Global translation and rotation of the system (solvent and solute) was removed every 100 integration steps during the simulation.

The initial 20-ps stage was designed to equilibrate those particles that were added during the initial model-building process, including water molecules and hydrogen atoms, and to allow the systems to be solvated adequately. It was not intended to bring the system into "thermodynamic equilibrium". Therefore, the initial 20-ps trajectories were discarded and were followed by the production stage in which both pressure (1.0 ATM) and temperature (300 K) were held constant by Berendsen's coupling scheme. This allowed us to monitor the conformational transitions from the early stages. On the other hand, the first 5 ns of trajectories were excluded from more quantitative analyses, such as B-factor calculation, population, and distribution, to allow adequate equilibration of the system. In the latter cases, the effective equilibration phase was more than 5 ns in each trajectory.

A set of 44 independent simulations (11 for each model system) was conducted using the same simulation protocol. These multiple trajectories allow us to study events that are one order of magnitude slower than the simulation time of the individual trajectories. In this case, the 11-trajectory sets allowed us to study the events of ~100-ns timescale. The differences among the trajectories for each system were the initial velocities, which were assigned by choosing different random number seeds. These trajectories sampled different regions in the phase space and conformational space, as clearly shown in root mean-square deviation (RMSD) plots (Fig. 2). These multiple trajectories were intended to examine the consistency of the observations and to maximize the sampling ability with limited simulation timescales.



View larger version (63K):
[in this window]
[in a new window]
 
FIGURE 2  RMSD versus time. The horizontal axes represent RMSD values in Å, and the vertical axes represent time in nanoseconds. The RMSDs were calculated using C{alpha} atoms only and fitted to the initial structure of each simulation. The black lines represent the RMSD of the whole protein, and the shaded lines represent the RMSD without the loop region (residue 140–149, 186–196).

 
Explicit solvent molecular dynamics simulations with LES
In this set of simulations, we took one structure from each model as our starting structures for LES simulations. The wild-type structure was taken from the final snapshot of the 40-ns extended MD simulation. Because none of the mutant loops were closed, we arbitrarily took the final structures of the last run (run 11) as starting points. We then divided the loop into three regions (140-143, 144-146, and 147-149) and replaced each segment with five duplicate copies by using the addles module in AMBER7. All copies belonged to the same region and had the same initial conformation as that of the template structure but were given different initial velocities to permit divergence. Four nanoseconds of data were collected for each model.

Conformation clustering
A heuristic clustering approach was used to characterize the snapshots of the MD simulations based on the C{alpha}-RMSD of the loop region. In this method, a snapshot may become a member of its closest cluster if the C{alpha}-RMSD is smaller than a given cutoff (1.5 Å), otherwise a new cluster is created. The C{alpha}-RMSD was calculated after rigid body alignment of the loop and the two bracing secondary elements (the ß-strand N-terminal to the loop and the {alpha}-helix C-terminal to the loop) with respect to the average structure of the cluster. This method is semilinear and is rather efficient for clustering large data sets. However, it is a heuristic method and is inherently approximate. The error margin is related to the cutoff used in the clustering; clusters generated using a small cutoff can be highly accurate. To further enhance the accuracy, the clusters were filtered by removing structures that were far away from the average of the cluster as measured by the RMSD. The removed snapshots were then compared to the existing clusters.


    RESULTS AND DISCUSSION
 TOP
 ABSTRACT
 INTRODUCTION
 THEORETICAL METHODS
 RESULTS AND DISCUSSION
 CONCLUSIONS
 ACKNOWLEDGEMENTS
 REFERENCES
 
Multiple trajectory (10-ns) results
Fig. 2 shows the main-chain RMSD of the protein core, excluding the loop regions, and of the whole protein for all 44 trajectories. The core RMSD averaged over individual trajectories ranged from 0.89 Å to 1.27 Å with an overall 1.05 ± 0.09 Å when averaged over all 44 trajectories. The RMSD indicated that the overall structures were well maintained; the protein core was quite stable throughout the simulations. This high stability is clearly demonstrated by the B-factors calculated from the simulations (discussed later). On the other hand, the loops showed notably higher degrees of fluctuations, as exemplified by the overall RMSD of the whole protein. When the entire protein was considered, the average main-chain RMSD ranged from 5.00 Å to 2.56 Å for an overall average of 3.27 ± 0.41 Å. The large RMSD indicates that loops underwent conformational changes.

Crystal structures of the wild-type (1B2D), G149A mutant (1B92), and G149A/G140A double mutant (1B9F) were solved by Greenwald et al. (1999)Go. The structure of the single mutant G140A has yet to be solved due to poor crystallization. In Fig. 3 we compared the crystallographic B-factors to those calculated from our MD trajectories by where is the mean-square atomic fluctuation averaged over the last 5 ns of the 11 simulations on each of the wild-type and mutants. The correlation between the experimental values and the MD calculated values, excluding the loop regions, were in the range of 0.7 to 0.8.



View larger version (26K):
[in this window]
[in a new window]
 
FIGURE 3  Calculated B-factors from the MD simulations and the B-factors obtained from the indicated Protein Data Bank file. Horizontal axes are residue indices, and vertical axes are B-factors in Å2.

 
One notable difference is found in the interface loop region where the crystallographic B-factors in all of the crystal structures are generally lower than the MD B-factors. This is mainly due to crystal packing effects. In the case of 1QS4, the interface loop is packed against another subunit. In the case of 1B9D and 1B9f, the loops are packed against structures from neighboring unit cells due to crystallographic symmetry. By comparison, the MD simulations mimicked the protein in solution phase; there were no protein neighbors to form stabilizing interactions with the interface loop, hence the interface loop was allowed a greater degree of freedom.

Although the B-factor offers a convenient yardstick by which one can compare experimental and simulation results, one should note that the crystallographic B-factor measures both the thermal fluctuation (including conformational heterogeneity of the proteins) and global translation and rotation. In our calculated B-factors, however, the global translation/rotation has been removed by rigid-body alignment. Therefore, one might expect that the calculated B-factors should be smaller than the experimental ones. However, one should also consider other factors, such as crystal packing, which can play a role and can reduce the flexibility of the parts involved in the crystal contacts. Notwithstanding these differences, comparison with crystallographic B-factors offers a qualitative assessment of the simulations.

To summarize the nearly 40,000 snapshots, we performed a clustering analysis using the heuristic algorithm outlined in the Methods section. Note that in this analysis, only the conformational states of the catalytic loop were clustered to highlight the conformational transitions of the loop observed in the simulations. The clustering was based on the C{alpha}-RMSD computed by aligning the loop and the two bracing secondary elements, ß-5 (residues 135–149) and {alpha}-4 (residues 150–160), from each end of the loop.

Figs. 4–7GoGoGo show the representative structures of the clusters identified. As shown in the figures, the majority of the loop conformations are distributed in various types of open conformations. Among the clusters generated from the wild-type simulations (Fig. 4), clusters 6 and 8 are in the closed form. The main difference between these two clusters is at the P142/P145. The loop conformational change in cluster 8 is mainly localized on residue P142, whereas in cluster 6, both P142 and P145 have changed position.



View larger version (55K):
[in this window]
[in a new window]
 
FIGURE 4  Representative structures from the 11 multiple-trajectory MD simulations of the wild-type. The percentage values indicate the percentage of snapshots belonging to each cluster.

 


View larger version (39K):
[in this window]
[in a new window]
 
FIGURE 5  Representative structures from the 11 multiple-trajectory MD simulations of the G140A mutant. The percentage values indicate the percentage of snapshots belonging to each cluster.

 


View larger version (33K):
[in this window]
[in a new window]
 
FIGURE 6  Representative structures from the 11 multiple-trajectory MD simulations of the G149A mutant. The percentage values indicate the percentage of snapshots belonging to each cluster.

 


View larger version (50K):
[in this window]
[in a new window]
 
FIGURE 7  Representative structures from the 11 multiple-trajectory MD simulations of the G140A/G149A double mutant. The percentage values indicate the percentage of snapshots belonging to each cluster.

 
The closed conformation makes up ~3% of all snapshots (Fig. 4, clusters 6 and 8). Given a total of 110 ns simulations, we would estimate a time constant of ~3.6 µs, assuming a two-state kinetics. Thus, the event is a relatively slow process. On the other hand, the small fraction of closed conformation observed in the LES simulation (discussed later) suggests that the loop favors the open conformation. This is in agreement with the currently available crystallographic structures whose resolved portion of the loop is mostly found in the open conformation.

Another interesting conformation is cluster 7 shown in Fig. 4 in which the loop is bent backward with Y143 pointing away from the active site, making contact with the hydrophobic patch consisting of residues I60, V79, and A80. This provides a possible explanation for the reduced activity of Y143L mutation (Table 1). A possible scenario is that the L143 of the mutant makes stronger contact with the hydrophobic patch and stabilizes the loop into the open conformation.


View this table:
[in this window]
[in a new window]
 
TABLE 1  Catalytic activity of various IN core domain single-point mutants found in the literature.

 
Extended-time MD results
To explore the loop motion in greater detail, we extended the wild-type trajectory that showed the closed conformation to ~40 ns (including the initial 10 ns) and closely monitored the conformational state of the loop. Fig. 8 a shows the time course of C{alpha}-RMSD of this extended wild-type simulation. To facilitate tracking the position of the loop, we defined two planes (one for the loop, one for the active site) and monitored the angle and distance between the two planes as a measure of the gating motion. The loop plane is made up of the two hinge-C{alpha} atoms and the Y143-C{alpha} atom; the active site plane is made up of the two hinge-C{alpha} atoms and the Mg2+ atom. Fig. 8 b shows the gate behavior as a function of time.



View larger version (18K):
[in this window]
[in a new window]
 
FIGURE 8  (a) RMSD versus time plot of the 40-ns wild-type simulation. The black line represents the C{alpha}-RMSD including the loops, and the shaded line represents the C{alpha}-RMSD excluding the loops. (b) Loop position versus time. The shaded line represents the angle of opening of the gate, and the black line represents the distance between the tip of the loop to the center of the catalytic site.

 
Starting from the initial open conformation, the loop began to dip forward toward the catalytic triad at ~8 ns. By 10 ns, the loop was completely bent over and assumed a closed conformation. It remained in the closed conformation for another 10 ns and then opened slightly for a brief moment at ~20 ns before closing again for another 20 ns. We stopped the simulation after seeing that the loop attempted to open up (see Fig. 4, cluster 5) for a second time at ~40 ns when the loop appeared to move upward but did not return to the completely open position found in the initial structure.

To characterize the snapshots of this long simulation, we also performed a clustering analysis using all snapshots from this trajectory. Four major clusters were found in this trajectory, two in the closed state and two in the open state. Fig. 9 shows the backbones of these four states superposed on one another and the position of Y143 in each structure.



View larger version (40K):
[in this window]
[in a new window]
 
FIGURE 9  Wall-eyed stereogram of the four representative conformations identified from the 40-ns wild-type simulation. The catalytic loop is color coded to correspond to the color coding scheme of Table 2 (maroon, cluster 1; blue, cluster 2; red, cluster 3; and pink, cluster 4). The three conserved catalytic residues are colored by atom (C, green; O, red; and H, white). The Mg2+ ion is shown in sphere representation.

 
In extending the simulation, our goal was to gain a more complete view of the loop dynamics and to gain insights into the perturbations due to the mutations. To facilitate our analysis of the motion, we applied essential dynamics (ED) analysis. ED analysis (Amadei et al., 1993Go) is a motion analysis technique that was developed specifically to reduce the complexity of the data to aid in the extraction of meaningful insights. In this analysis, the principal components of the protein's motion are extracted from the variance-covariance matrices where xi and xj are, respectively, the ith and jth dimension of the coordinates from which the fast modes of local motions and thermal fluctuations are filtered out to reveal the slower, correlated modes of motions that are more likely to be relevant to biological function.

In our analysis, we selected the C{alpha} atoms to describe the backbone conformations, and the variance-covariance matrices were calculated by averaging over the simulations. The results of the ED analysis are the collection of eigenvectors that represent the correlated modes of motion and their associated eigenvalues that specify the amplitudes of the motion. Earlier studies have shown that, when reduced to the essential space, the first few eigenvectors are sufficient to describe most of a protein's correlated motions (de Groot et al., 1996Go). Herein we present only the first essential modes for the extended-time simulations. For comparison, we also show the representative ED modes for the three mutants.

In Fig. 10, we show the maximum (red) and minimum (yellow) projected structures of the first essential mode for each of the four systems. They were superimposed to highlight the extrema of each essential motion. In the wild-type simulation, the primary mode of motion is the opening and closing of the catalytic loop. The red and yellow spheres in the middle of the loop represent Y143 at the minimum and maximum positions, respectively. This residue occupies the tip of the loop and has been implicated in catalysis (Beese and Steitz, 1991Go; Chen et al., 2000Go; Esposito and Craigie, 1998Go; van Gent et al., 1993Go). In the "loop closing" mode, the Y143 was brought closer to the other three catalytic residues (also depicted in spheres of red and yellow) to form a four-point active site configuration. The distance between the C{alpha} atoms of Y143 in the two projected structures is 13.8 Å. This large-scale motion has never been observed in previous MD simulations largely due to the smaller timescales sampled in those simulations, and hence represents a new possible active site conformation. The blue and magenta spheres represent the positions of the two loop hinges, and the bottom two green spheres represent the positions of T66 and S119. We show these two residues as anchoring points around the active site to facilitate the visual comparison.



View larger version (57K):
[in this window]
[in a new window]
 
FIGURE 10  First essential modes from ED analysis. The figures were generated by projecting the entire molecular dynamics trajectory onto the first eigenvector of each respective trajectory. The minimum structure is colored in yellow, and the maximum is red. The motions of each of the shown essential modes are bounded by these two extrema. The red and yellow spheres in the middle of the loop show Y143 at the minimum and maximum positions, respectively. The blue and magenta spheres represent the positions of the two loop hinges. The two green spheres are T66 and S119. The three catalytic residues are shown in red and yellow spheres.

 
In comparison to the wild-type, the catalytic loop in the three mutant systems showed significant reduction in mobility. The G149A single mutant still retained some residual wild-type loop motion with a 7.07 Å maximum separation between the open and closed conformations (C{alpha} of P145), but the catalytic loop in the G140A mutant was virtually stationary and the double mutant showed a slight backward bending motion that formed a more open active site configuration.

The geometry of the IN catalytic loop is asymmetric and contains one proline residue near each end of the loop (P142 and P145). Because proline side chains are rigid in nature, these two residues are responsible for much of the ordered internal structure of the loop. In the open conformation of the loop, the C{gamma} atom of P142 points toward the active site, whereas the C{gamma} atom of P145 points away from the active site. In the completely closed conformation, the orientations of these two residues become reversed. As in most ß-{alpha} types of loops (Oliva et al., 1997Go), the region near the C-terminal end, around P145, has a more helical character and the region near the N-terminal end, around P142, has more of an extended and ß-character. During the simulations, residues P145, Q146, and S147 form a transient 310-helix ~20% of the time in the wild-type, 50% in G149A, 37% in G40A, and 12% in the G140A/G149A double mutant. Available mutagenesis data show that P145 and Q148 are particularly sensitive to mutations, suggesting that the transient 310-helical region plays a role in IN's function. Because of the extensive hydrogen-bonding interactions involved, we hypothesize that the hydrogen-bonding network in this region might contribute to the loop motion.

The role of Y143 in catalysis has received some attention recently. Based on the proposed structural arrangement of the active site of E. coli polymerase I (Beese and Steitz, 1991Go), it has been suggested that the role of Y143 may be to similarly stabilize the activated water molecule. Mutagenesis studies have shown that mutations at this position shift the preference of nucleophile during the 3'-processing reaction from water to alcohol (van Gent et al., 1992Go, 1993Go; Vink et al., 1991Go), which appears to support this hypothesis.

In our simulations, we observed that the wild-type Y143 side chain has a high degree of mobility as shown in Fig. 11. Interestingly, in the three mutant simulations, the orientations of Y143 are predominately pointing toward the active site. This change of orientation was previously interpreted as evidence that Y143 plays a significant role in catalysis, and the inward-pointing conformation of Y143 was generally assumed to be the active conformation. Here our simulation suggests that the loop flexibility is necessary to position Y143 such that it is in close proximity to the substrate DNA when the loop is in the closed form. Thus, the function of Y143 is closely linked to the dynamics of the loop.



View larger version (42K):
[in this window]
[in a new window]
 
FIGURE 11  Y143 {chi}-angle distribution. Positive {chi}1 corresponds to the phenol group of Y143 pointing away from the active site, and positive {chi}2 corresponds to the hydrogen of the hydroxyl group on the phenol ring pointing away from the active site.

 
Closed to open transition by LES
The limited simulation time, although already considerably long in comparison to the typical simulations on systems of similar size, precluded the possibility of the observation of reversible open-close events. This could largely be due to kinetically trapped closed or open conformations. In the extended-time simulation, once the loop entered into the closed conformation, it never completely opened up again throughout the 40-ns trajectory. In other 10-ns simulations, despite the large degree of conformational flexibility in the loop regions, reversible conformational changes were rare because of the timescale. One interesting question was the following: can the loop in the completely closed conformation open up again? This question can be more effectively addressed by advanced sampling techniques such as LES (Elber and Karplus, 1990Go; Roitberg and Elber, 1991Go) without resorting to exceedingly long conventional MD simulations. The LES method is based on a mean-field theory in which enhanced sampling is achieved through making multiple copies of parts of the protein. As a result of enhanced sampling, the barriers separating local minima in the LES calculation are lower, which enhances the probability of barrier-crossing events. In these simulations, the multiple copies are allowed to diverge in the simulations by assigning different initial velocities.

The starting structures of the LES simulations were taken from the final structures from either the extended simulations or from one of the trajectories. These structures were already well equilibrated in terms of side-chain orientation and solvent environment because they were subjected to at least 10 ns of MD simulation. We observed that the divergence of the copies in all four systems leveled off after ~1 ns of simulation, and in the case of the double mutant, there was even a slight decrease (data not shown). In the wild-type LES simulations, we observed that the loop returned to its open conformation after 1.5 ns and remained open for the remainder of the simulation. In the three mutants, the LES simulation also resulted in the open form of the loop. In short, the LES simulations suggest that the open conformation is the preferred state of the catalytic loop. This is in agreement with the experimental observations that the loops are in the open state. Nevertheless, our simulations, starting from the open conformation, were able to sample the closed conformation.

The mobility observed in the simulations highlights the need to take into account protein dynamics to design effective inhibitors. In the case of IN, the conformational flexibility is intimately linked to the active site conformation. The high degree of flexibility implies plasticity of the active site and the ability to accommodate changes (e.g., mutation). It also implies that the active site may undergo considerable conformational changes upon binding to ligands. If so, one may wish to consider the correlated motions between ligands and IN to design effective inhibitors. In the closed conformation, the loop actually forms a canopy overhanging the active site, forming a deeper pocket than the open form. We speculate this conformation may be used to identify ligands that may stabilize the closed conformation of IN. The benefits of such a ligand are twofold. In addition to the obvious benefit that it may be a lead compound for further inhibitor design, it may also serve as an experimental tool to investigate the dynamics of the loop.

In light of the high degree of conformational variability exhibited by the loop, one may conclude that the functionally important catalytic loop is much less ordered than the rest of the protein. One may also anticipate that binding to the substrate can reduce the mobility and make the loop more ordered. Because of close proximity, the active site significantly changed its shape when the loop moved from the open form to the closed form (Fig. 12). Clearly, for efficacy of drug design, these structural variations should be explored.



View larger version (98K):
[in this window]
[in a new window]
 
FIGURE 12  Molecular surface representation of three representative structures from the wild-type simulations and the crystal structure 1QS4, chain A, with the inhibitor 5CITEP superimposed in the active site using the orientation and position found in the crystal structure. The blue patches represent residues K156, K159 known to interact with DNA; the red patches represent the three conserved catalytic residues; the yellow patches represent the catalytic loop; and the green patches represent the newly identified alternative binding trench for 5CITEP from docking studies by Schames et al. (2004)Go. Some residues belong to both the loop region and the trench region (F139, G140, P142, and Q148).

 
We superimposed 5CITEP onto the binding site of the wild-type open and closed conformations obtained from clustering analysis. The location of 5CITEP also overlaps in between the E152 and D64 where the second metal ion is supposed to coordinate during catalysis. This suggests a possible mechanism for 5CITEP's inhibitory action. Note that the green patch actually makes up the underside of the loop and becomes buried in the closed conformation. In the open form, this "trench" is fully exposed and forms a very well-defined shape for binding of ligands, but in the closed form, the trench is completely buried. These conformations may offer a better platform for further structure-based inhibitor design efforts. We speculate that a bidentate ligand fitting into both binding sites may be a stronger inhibitor that can bind across the grooves formed by residues C65, T66, H67, Q92, Q148, E152, K156, and K159. An example of such a ligand is the L-chicoric acid that has two rigid wings joined by a common flexible center. This may provide an explanation of why the L-chicoric acid is so far the most potent IN inhibitor.

Inasmuch as the active form of IN in vivo requires at least a dimer, the conformational change involved may not be limited to the secondary or tertiary structural levels; it may likely extend to the quaternary structural level as indicated by the dynamic nature of the interface loop (data not shown). But because the full-length structure of the IN is still unavailable and the exact functional multimeric state of IN has not been unambiguously determined, our research focus is on understanding the structure and dynamics of the core domain.


    CONCLUSIONS
 TOP
 ABSTRACT
 INTRODUCTION
 THEORETICAL METHODS
 RESULTS AND DISCUSSION
 CONCLUSIONS
 ACKNOWLEDGEMENTS
 REFERENCES
 
The IN is an interesting enzyme both because it is clinically important and because it is a member of the transposase/IN family of enzymes that are believed to be the main driving forces behind evolution. In this study, we focused on understanding the structure-function-dynamics relationships of the catalytic core domain, paying special attention to the catalytic loop. Starting from the experimental 5CITEP-bound crystal structure, our simulations demonstrated large-scale conformational changes of the catalytic loop. Analysis of protein loop conformations is a notoriously difficult task because they do not have easily identifiable regular geometric patterns. Until recently, analysis and classification of loops were done manually, and languages used to describe these structures remained mostly qualitative (Espadaler et al., 2004Go; Oliva et al., 1997Go, 1998Go; Turcotte et al., 2001Go). In our work, we have attempted a systematic characterization of the loop dynamics in an effort to further our understanding of this important enzyme. We summarize the lessons learned as follows:

The wild-type catalytic loop has a slow mode of motion that closes and opens the space around the active site, and the transient formation of a 310-helical structure by the residues around P145 appears to be a major influence in the dynamics of the loop. Because the hinge mutations sterically hinder the loop from closing, the associated loss of activity strongly suggests that this closing-opening conformational change is functionally important. The seven conformations identified by clustering analysis provide a starting point for further work on docking and virtual screening studies. The dynamics of Y143 has been further clarified in our simulations and is closely linked to the loop conformation. Judging from the high flexibility rendered by the catalytic loop, one may speculate that the role of the loop is to provide the mobility to allow Y143 to access the substrate easily and to allow easy release of the products.


    ACKNOWLEDGEMENTS
 TOP
 ABSTRACT
 INTRODUCTION
 THEORETICAL METHODS
 RESULTS AND DISCUSSION
 CONCLUSIONS
 ACKNOWLEDGEMENTS
 REFERENCES
 
We gratefully acknowledge the usage of VMD, Pymol, and SwissPDB Viewer. Computer time was provided by the Pittsburgh Supercomputer Center.

This work was supported by research grants from the National Institutes of Health (GM64458 and GM067168 to Y.D., GM56553 to J.M.B.) and the Robert A. Welch Foundation (J.M.B.).


    FOOTNOTES
 
Matthew C. Lee's present address is U.S. Patent and Trademark Office, Alexandria, VA 22313.

Jinxia Deng's present address is Dept. of Pharmaceutical Sciences, University of Southern California, School of Pharmacy, Los Angeles, CA 90089.

Submitted on December 21, 2004; accepted for publication February 4, 2005.


    REFERENCES
 TOP
 ABSTRACT
 INTRODUCTION
 THEORETICAL METHODS
 RESULTS AND DISCUSSION
 CONCLUSIONS
 ACKNOWLEDGEMENTS
 REFERENCES
 
Amadei, A., A. B. Linssen, and H. J. Berendsen. 1993. Essential dynamics of proteins. Proteins. 17:412–425.[CrossRef][Medline]

Asante-Appiah, E., and A. M. Skalka. 1997. Molecular mechanisms in retrovirus DNA integration. Antiviral Res. 36:139–156.[CrossRef][Medline]

Beese, L. S., and T. A. Steitz. 1991. Structural basis for the 3'-5' exonuclease activity of Escherichia coli DNA polymerase I: a two metal ion mechanism. EMBO J. 10:25–33.[Medline]

Brown, P. O., B. Bowerman, H. E. Varmus, and J. M. Bishop. 1987. Correct integration of retroviral DNA in vitro. Cell. 49:347–356.[CrossRef][Medline]

Bujacz, G., J. Alexandratos, Z. L. Qing, C. Clement-Mella, and A. Wlodawer. 1996. The catalytic domain of human immunodeficiency virus integrase: ordered active site in the F185H mutant. FEBS Lett. 398:175–178.[CrossRef][Medline]

Bushman, F. 1995. Targeting retroviral integration. Science. 267:1443–1444.[Free Full Text]

Cannon, P. M., W. Wilson, E. Byles, S. M. Kingsman, and A. J. Kingsman. 1994. Human immunodeficiency virus type 1 integrase: effect on viral replication of mutations at highly conserved residues. J. Virol. 68:4768–4775.[Abstract/Free Full Text]

Case, D. A., D. A. Pearlm, J. W. Caldwell, T. E. Cheatham 3rd, J. Wang, W. S. Ross, C. Simmerling, T. Darden, K. M. Merz, R. V. Stanton, and others. 2002. AMBER. Version 7. University of San Francisco, San Francisco, CA.

Chen, I. J., N. Neamati, and A. D. MacKerell Jr. 2002a. Structure-based inhibitor design targeting HIV-1 integrase. Curr. Drug Targets Infect. Disord. 2:217–234.[CrossRef][Medline]

Chen, I. J., N. Neamati, and A. D. MacKerell Jr. 2002b. Structure-based inhibitor design targeting HIV-1 integrase. Curr. Drug Targets Infect. Disord. 2:217–234.[CrossRef][Medline]

Chen, J. C., J. Krucinski, L. J. Miercke, J. S. Finer-Moore, A. H. Tang, A. D. Leavitt, and R. M. Stroud. 2000. Crystal structure of the HIV-1 integrase catalytic core and C-terminal domains: a model for viral DNA binding. Proc. Natl. Acad. Sci. USA. 97:8233–8238.[Abstract/Free Full Text]

Darden, T., D. York, and L. Pedersen. 1993. Particle mesh Ewald: an N·log(N) method for Ewald sums in large systems. J. Chem. Phys. 98:10089–10092.[CrossRef]

Dayam, R., and N. Neamati. 2003. Small-molecule HIV-1 integrase inhibitors: the 2001—2002 update. Curr. Pharm. Des. 9:1789–1802.[CrossRef][Medline]

de Groot, B. L., D. M. van Aalten, A. Amadei, and H. J. Berendsen. 1996. The consistency of large concerted motions in proteins in molecular dynamics simulations. Biophys. J. 71:1707–1713.[Abstract/Free Full Text]

DeLano, W. L. 2002. The PyMOL Molecular Graphics System. Version 0.95. DeLano Scientific, San Carlos, CA.

Drelich, M., R. Wilhelm, and J. Mous. 1992. Identification of amino acid residues critical for endonuclease and integration activities of HIV-1 IN protein in vitro. Virology. 188:459–468.[CrossRef][Medline]

Duan, Y., C. Wu, S. Chowdhury, M. C. Lee, G. Xiong, W. Zhang, R. Yang, P. Cieplak, R. Luo, T. Lee, and others. 2003. A point-charge force field for molecular mechanics simulations of proteins based on condensed-phase quantum mechanical calculations. J. Comput. Chem. 24:1999–2012.[CrossRef][Medline]

Dyda, F., A. B. Hickman, T. M. Jenkins, A. Engelman, R. Craigie, and D. R. Davies. 1994. Crystal structure of the catalytic domain of HIV-1 integrase: similarity to other polynucleotidyl transferases. Science. 266:1981–1986.[Abstract/Free Full Text]

Elber, R., and M. Karplus. 1990. Enhanced sampling in molecular dynamics: use of the time-dependent Hartree approximation for a simulation of carbon monoxide diffusion through myoglobin. J. Am. Chem. Soc. 112:9161–9175.[CrossRef]

Engelman, A., and R. Craigie. 1992. Identification of conserved amino acid residues critical for human immunodeficiency virus type 1 integrase function in vitro. J. Virol. 66:6361–6369.[Abstract/Free Full Text]

Engelman, A., Y. Liu, H. Chen, M. Farzan, and F. Dyda. 1997. Structure-based mutagenesis of the catalytic domain of human immunodeficiency virus type 1 integrase. J. Virol. 71:3507–3514.[Abstract]

Espadaler, J., N. Fernandez-Fuentes, A. Hermoso, E. Querol, F. X. Aviles, M. J. Sternberg, and B. Oliva. 2004. ArchDB: automated protein loop classification as a tool for structural genomics. Nucleic Acids Res. (database issue) 32:D185–D188.[Abstract/Free Full Text]

Espeseth, A. S., P. Felock, A. Wolfe, M. Witmer, J. Grobler, N. Anthony, M. Egbertson, J. Y. Melamed, S. Young, T. Hamill, and others. 2000. HIV-1 integrase inhibitors that compete with the target DNA substrate define a unique strand transfer conformation for integrase. Proc. Natl. Acad. Sci. USA. 97:11244–112449.[Abstract/Free Full Text]

Esposito, D., and R. Craigie. 1998. Sequence specificity of viral end DNA binding by HIV-1 integrase reveals critical regions for protein-DNA interaction. EMBO J. 17:5832–5843.[CrossRef][Medline]

Gallay, P., S. Swingler, C. Aiken, and D. Trono. 1995. HIV-1 infection of nondividing cells: C-terminal tyrosine phosphorylation of the viral matrix protein is a key regulator. Cell. 80:379–388.[CrossRef][Medline]

Gerton, J. L., S. Ohgi, M. Olsen, J. DeRisi, and P. O. Brown. 1998. Effects of mutations in residues near the active site of human immunodeficiency virus type 1 integrase on specific enzyme-substrate interactions. J. Virol. 72:5046–5055.[Abstract/Free Full Text]

Greenwald, J., V. Le, S. L. Butler, F. D. Bushman, and S. Choe. 1999. The mobility of an HIV-1 integrase active site loop is correlated with catalytic activity. Biochemistry. 38:8892–8898.[CrossRef][Medline]

Guex, N., and M. C. Peitsch. 1997. SWISS-MODEL and the Swiss-PDBViewer: an environment for comparative protein modeling. Electrophoresis. 18:2714–2723.[CrossRef][Medline]

Johnson, M. S., M. A. McClure, D. F. Feng, J. Gray, and R. F. Doolittle. 1986. Computer analysis of retroviral pol genes: assignment of enzymatic functions to specific sequences and homologies with nonviral enzymes. Proc. Natl. Acad. Sci. USA. 83:7648–7652.[Abstract/Free Full Text]

Jorgensen, W. L., J. Chandrasekhar, J. Madura, and M. L. Klein. 1983. Comparison of simple potential functions for simulating liquid water. J. Chem. Phys. 79:926–935.[CrossRef]

Kulkosky, J., K. S. Jones, R. A. Katz, J. P. Mack, and A. M. Skalka. 1992. Residues critical for retroviral integrative recombination in a region that is highly conserved among retroviral/retrotransposon integrases and bacterial insertion sequence transposases. Mol. Cell. Biol. 12:2331–2338.[Abstract/Free Full Text]

Leavitt, A. D., L. Shiue, and H. E. Varmus. 1993. Site-directed mutagenesis of HIV-1 integrase demonstrates differential effects on integrase functions in vitro. J. Biol. Chem. 268:2113–2119.[Abstract/Free Full Text]

Lins, R. D., J. M. Briggs, T. P. Straatsma, H. A. Carlson, J. Greenwald, S. Choe, and J. A. McCammon. 1999. Molecular dynamics studies on the HIV-1 integrase catalytic domain. Biophys. J. 76:2999–3011.[Abstract/Free Full Text]

Lobel, L. I., J. E. Murphy, and S. P. Goff. 1989. The palindromic LTR-LTR junction of Moloney murine leukemia virus is not an efficient substrate for proviral integration. J. Virol. 63:2629–2637.[Abstract/Free Full Text]

Madura, J. D., J. M. Briggs, R. C. Wade, M. E. Davi, B. A. Luty, Ilin, J. Antosiewicz, M. K. Gilson, B. Bagheri, L. R. Scot, and others. 1995. Electrostatics and diffusion of molecules in solution: simulations with the University of Houston Brownian dynamics program. Comput. Phys. Commun. 91:57–95.[CrossRef]

Oliva, B., P. A. Bates, E. Querol, F. X. Aviles, and M. J. Sternberg. 1997. An automated classification of the structure of protein loops. J. Mol. Biol. 266:814–830.[CrossRef][Medline]

Oliva, B., P. A. Bates, E. Querol, F. X. Aviles, and M. J. Sternberg. 1998. Automated classification of antibody complementarity determining region 3 of the heavy chain (H3) loops into canonical forms and its application to protein structure prediction. J. Mol. Biol. 279:1193–1210.[CrossRef][Medline]

Pani, A., A. G. Loi, M. Mura, T. Marceddu, P. La Colla, and M. E. Marongiu. 2002. Targeting HIV: old and new players. Curr. Drug Targets Infect. Disord. 2:17–32.[CrossRef][Medline]

Pluymers, W., E. De Clercq, and Z. Debyser. 2001. HIV-1 integration as a target for antiretroviral therapy: a review. Curr. Drug Targets Infect. Disord. 1:133–149.[CrossRef][Medline]

Roitberg, A., and R. Elber. 1991. Modeling side chains in peptides and proteins: application of locally enhanced sampling and simulated annealing methods to find minimum energy conformations. J. Chem. Phys. 95:9277–9287.[CrossRef]

Ryckaert, J. P., G. Ciccitti, and H. J. Berendsen. 1977. Numerical integration of the cartesian equations of motion of a system with constraints: molecular dynamics of n-alkanes. J. Comput. Phys. 23:327–341.[CrossRef]

Sayasith, K., G. Sauve, and J. Yelle. 2000. Characterization of mutant HIV-1 integrase carrying amino acid changes in the catalytic domain. Mol. Cells. 10:525–532.[Medline]

Sayasith, K., G. Sauve, and J. Yelle. 2001. Targeting HIV-1 integrase. Expert Opin. Ther. Targets. 5:443–464.[CrossRef][Medline]

Schames, J. R., R. H. Henchman, J. S. Siegel, C. A. Sotriffer, H. Ni, and J. A. McCammon. 2004. Discovery of a novel binding trench in HIV integrase. J. Med. Chem. 47:1879–1881.[CrossRef][Medline]

Tsurutani, N., M. Kubo, Y. Maeda, T. Ohashi, N. Yamamoto, M. Kannagi, and T. Masuda. 2000. Identification of critical amino acid residues in human immunodeficiency virus type 1 IN required for efficient proviral DNA formation at steps prior to integration in dividing and nondividing cells. J. Virol. 74:4795–4806.[Abstract/Free Full Text]

Turcotte, M., S. H. Muggleton, and M. J. Sternberg. 2001. Automated discovery of structural signatures of protein fold and function. J. Mol. Biol. 306:591–605.[CrossRef][Medline]

van den Ent, F. M., A. Vos, and R. H. Plasterk. 1998. Mutational scan of the human immunodeficiency virus type 2 integrase protein. J. Virol. 72:3916–3924.[Abstract/Free Full Text]

van Gent, D. C., A. A. Groeneger, and R. H. Plasterk. 1992. Mutational analysis of the integrase protein of human immunodeficiency virus type 2. Proc. Natl. Acad. Sci. USA. 89:9598–9602.[Abstract/Free Full Text]

van Gent, D. C., A. A. Oude Groeneger, and R. H. Plasterk. 1993. Identification of amino acids in HIV-2 integrase involved in site-specific hydrolysis and alcoholysis of viral DNA termini. Nucleic Acids Res. 21:3373–3377.[Abstract/Free Full Text]

Vink, C., E. Yeheskiely, G. A. van der Marel, J. H. van Boom, and R. H. Plasterk. 1991. Site-specific hydrolysis and alcoholysis of human immunodeficiency virus DNA termini mediated by the viral integrase protein. Nucleic Acids Res. 19:6691–6698.[Abstract/Free Full Text]




This article has been cited by other articles:


Home page
Biophys. JHome page
C. N. Alves, S. Marti, R. Castillo, J. Andres, V. Moliner, I. Tunon, and E. Silla
A Quantum Mechanic/Molecular Mechanic Study of the Wild-Type and N155S Mutant HIV-1 Integrase Complexed with Diketo Acid
Biophys. J., April 1, 2008; 94(7): 2443 - 2451.
[Abstract] [Full Text] [PDF]


Home page
J. Virol.Home page
K. Shimura, E. Kodama, Y. Sakagami, Y. Matsuzaki, W. Watanabe, K. Yamataka, Y. Watanabe, Y. Ohata, S. Doi, M. Sato, et al.
Broad Antiretroviral Activity and Resistance Profile of the Novel Human Immunodeficiency Virus Integrase Inhibitor Elvitegravir (JTK-303/GS-9137)
J. Virol., January 15, 2008; 82(2): 764 - 774.
[Abstract] [Full Text] [PDF]


Home page
Biophys. JHome page
N. Nunthaboot, S. Pianwanit, V. Parasuk, J. O. Ebalunode, J. M. Briggs, and S. Kokpol
Hybrid Quantum Mechanical/Molecular Mechanical Molecular Dynamics Simulations of HIV-1 Integrase/Inhibitor Complexes
Biophys. J., November 15, 2007; 93(10): 3613 - 3626.
[Abstract] [Full Text] [PDF]


Home page
J. Virol.