| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Center for the Study of Systems Biology, School of Biology, Georgia Institute of Technology, Atlanta, Georgia
Correspondence: Address reprint requests to Jeffrey Skolnick, E-mail: skolnick{at}gatech.edu.
| ABSTRACT |
|---|
|
|
|---|
250 amino acids in length. The average TM-scores of the best of top five models per target are 0.266, 0.336, and 0.362 by the threading algorithm SP3, original TASSER and chunk-TASSER, respectively. For a subset of 80 proteins with predicted
-helix content
50%, these averages are 0.284, 0.356, and 0.403, respectively. The percentages of proteins with the best of top five models having TM-score
0.4 (a statistically significant threshold for structural similarity) are 3.76, 20.94, and 28.94% by SP3, TASSER, and chunk-TASSER, respectively, overall, while for the subset of 80 predominantly helical proteins, these percentages are 2.50, 23.75, and 41.25%. Thus, chunk-TASSER shows a significant improvement over TASSER for modeling hard targets where no good template can be identified. We also tested chunk-TASSER on 21 medium/hard targets <200 amino-acids-long from CASP7. Chunk-TASSER is
11% (10%) better than TASSER for the total TM-score of the first (best of top five) models. Chunk-TASSER is fully automated and can be used in proteome scale protein structure prediction. | INTRODUCTION |
|---|
|
|
|---|
Ab initio protein structure prediction is not only useful for providing low-resolution structures that help the annotation of protein function (13
,15
,16
), but is also fundamentally important for understanding the mechanism of protein folding (17
). While there have been many efforts dedicated to ab initio protein structure prediction, no consistently reliable algorithm is currently available (18
–30
). In practice, ab initio methods fall into two groups: physics-based and knowledge-based. Methods in the first group fold proteins using only physical principles (30
–32
). The main obstacles that physics-based ab initio protein structure prediction methods face are the lack of accurate energy functions and the requirement of extensive computational power to find the global minimum of the energy function. Despite their conceptual appeal, this method is currently not as successful as knowledge-based approaches that make use of information from solved protein structures—in particular, knowledge-based potentials (18
).
Over the past several years, we have developed the Threading ASSEmbly Refinement (TASSER) methodology (13
,33
) for automated tertiary structure prediction that generates full-length models by rearranging the continuous fragments identified by threading. It is a kind of hybrid method that has the capacity to do template-free as well as template-based modeling. TASSER can significantly refine the initial template alignment structures provided by threading methods (33
). Furthermore, TASSER has some success in modeling template-free targets of small sizes when decoupled from threading templates (34
). In this work, we develop a variant of the TASSER methodology called chunk-TASSER that utilizes consensus contacts and distance restraints from ab initio folded protein chunks of supersecondary structure in addition to information extracted from threading templates. Modeling of protein chunks is much more efficient than full-length modeling in that the sampling space is much smaller, and thus large proteins can be handled. However, it does tend to favor global topologies of lower contact order. The method is assessed on a large set of 425 effectively hard targets
250 residues in length where the structure of the closest identified template is at best weakly related to that of the target. The results are compared to the SP3 threading method (35
,36
) and the original TASSER approach (13
,33
). Significant improvement of chunk-TASSER over both SP3 and TASSER is observed for these hard targets. We also tested chunk-TASSER on 21 CASP7 targets <200 residues in length that our threading algorithm classified as medium/hard targets.
| METHODS |
|---|
|
|
|---|
|
Another change made to SP3 that increases its sensitivity is the inclusion of profiles generated by PSIBLAST with a looser e-value cutoff of 1.0. To the target sequence, the sequence profile is replaced by the average of two profiles with e-value cutoffs 0.001 and 1.0, and to the templates, the structurally derived profile is replaced by the average of original and the PSIBLAST profile with an e-value cutoff of 1.0.
We extend the SP3 threading method to compute local sequence similarity by computing and recording the alignment score at each query sequence position aligned to each template during threading. The position-dependent score is then smoothed by averaging over a nine-residue sliding window. For each position, nine-residue-long fragments of the top 25 scored templates are selected to form the fragment library for the ab initio folding of chunks (18
).
Ab initio folding of chunks and chunk model selection
Chunks are defined as three consecutive regular secondary structure (helix or strand) segments including their enclosed two loops. The total number of chunks for a given target is Nsegment – 2, where Nsegment is the number of regular secondary structure segments. Chunk structures are predicted independently by a fragment insertion method as in Simons et al. (18
) but with our own implementation and force field. Each residue is described by its main chain atoms (N, C
, C, O), Cß atom, and side-chain center of mass. The force field contains the following terms:
coordinates (29
The description of DFIRE-based terms 1 and 2 are published elsewhere (38
,39
). The relative weight of terms 1 and 2 is set to one, since they are based on the same principle and procedure.
For the TASSER hydrogen-bond term, only main-chain hydrogen bonds are considered, and they are dependent on the C
coordinates by (29
)
![]() | (1) |
coordinates: li = ri,i+1/|ri,i+1|, ui = (li-1 – li)/|li-1 – li|, vi = ui x li/| ui x li|, where ri,i+1 is the C
-C
bond vector from residue i to i+1. The terms ui · uj and |vi · vj| impose a bias to a specific C
-C
bond vector orientation of regular H-bonds. The expression
i,j defines the conditions when residue i is hydrogen-bonded to residue j:
![]() | (2) |
H-bond formation also depends on the predicted secondary structure: H-bonds between residues in strand and helix are prohibited. Here,
is a stiffness modulation factor that is used to enhance the H-bond in the better-assigned secondary structure regions. It is set to 1.5 if a regular helix or strand structure is predicted; otherwise, it is set to 1.0. We shall optimize the relative weight wHB of this hydrogen-bond term relative to the other three terms.
The excluded volume term 4 is defined as
![]() | (3) |
Monte Carlo (MC) simulated annealing is used to sample chunk structure conformations with a fragment insertion method (18
) for ab initio chunk model prediction. Initially, a structure with random main-chain torsional angles is built. At each step in the MC procedure, a residue is randomly selected and then a nine- or three-residue-long fragment corresponding to the picked residue is randomly selected from the 25 fragments in the library obtained by the above-described chunk fragment generation procedure and inserted into the position by substituting the backbone torsional angles with those from the fragment. A new conformation is accepted or rejected according to the canonical Metropolis protocol.
The usual way of selecting models that are closer to native from a set of decoys generated by ab initio approaches is to cluster the models and select the most populated clusters. The success of a clustering method depends on the fact that lower free energy conformations are closer to native than higher ones. Since chunk models are not full protein models, this free energy condition may not be satisfied. Therefore, we developed and tested an alternative way of selecting chunk models that uses the information from the fragment library used for conformational sampling. The score Echunk for ranking chunk models is
![]() | (4) |
Full-length model assembly by TASSER and final model selection
TASSER (13
) represents a protein by a C
and side-chain center of mass representation in both off- and on-lattice space. The initial full-length model is built by connecting the continuous template-provided fragments (off-lattice building blocks) by a random walk confined to lattice bond vectors. If the specified number of unaligned residues cannot span the gap, a long C
-C
bond remains, and a springlike force draws sequential fragments together until a physically reasonable bond length is achieved. Parallel hyperbolic Monte Carlo (MC) sampling (41
) with replica exchange explores conformational space by rearranging the continuous fragments excised from the template. During assembly, the template fragments are kept rigid and off-lattice to retain their geometric accuracy; unaligned regions are modeled on a cubic lattice by an ab initio procedure and serve as linkage points for rigid body fragment rotations. Conformations are selected using an optimized force field which includes knowledge-based statistical potentials describing short-range backbone correlations, pairwise interactions, hydrogen bonding, secondary structure propensities, consensus C
and side-chain center of mass contacts, and short and long distance restraints for C
atoms. TASSER obtains these sequence-specific contact potentials and distance restraints from threading templates and/or fragments. The complete details of the TASSER force field are given in the literature (13
,29
) and references therein. Here, we give the contact potential and distance restraint terms that are relevant to this study. The contact potential between the C
atoms or side-chain centers of mass is calculated as
![]() | (5) |
(x) and
(x) are step functions defined as
![]() | (6) |
or side-chain center of mass in contact with that of the jth residue obtained from threading templates/fragments, and rij is the distance between residues i and j. The value p0 defines the minimal probability that two residues are predicted to be in contact and 6 Å is the distance cutoff for contacting residues. The first term in Eq. 5 favors pairs predicted as being in contact that are within 6 Å, whereas the secondary term penalizes predicted contact pairs that are farther apart than 6 Å when the total violation exceeds a threshold value of Ncp.
Local distance restraints for C
atoms with a less-than-six-residue sequence separation are collected from the threading templates/fragments. They are incorporated into the force field by
![]() | (7) |
ij is the root mean-square deviation of the predictions. The second term in Eq. 7 is a penalty when the cumulative normalized deviations to the predicted distance map exceeds the number of predictions Ndp.
Long-distance restraints for C
atoms are calculated in the force field as
![]() | (8) |
The weights wr1, wr2, wr3, wr4, and wr5 as well as those of other terms in TASSER not described above were optimized against a set of nonredundant decoys (29
).
Chunk-TASSER uses ab initio folded chunks as well as threading templates/fragments for consensus contact potentials and distance restraint predictions. In this study, the top 10 (ranked by SP3 Z-scores) threading templates and the top five chunk models for each chunk are included in the prediction of contact probability and distance restraints. Contacts and distance pairs from selected threading templates and chunk models are counted with equal weights in the prediction. For example, if there are n1 distance pairs from templates and n2 pairs from chunk models between residue i and j, the total distance pairs for residue i and j is (n1 + n2); and if there are n3 pairs within 6 Å (in contact), then the contact probability pij in Eq. 5 will be n3/(n1+ n2). When pij > p0 = 0.3, we consider residues i and j to be involved in a real contact in the native structure, and the contact potential is effective in Eq. 5. The threading templates/fragments also serve as starting structures in the chunk-TASSER simulation as they do in TASSER. Therefore, the only difference between chunk-TASSER and TASSER is that chunk-TASSER has contact and distance information from the selected ab initio folded chunks whereas TASSER does not. The final full-length models are selected by clustering the low energy trajectories (containing 16,000 energies) using SPICKER (42
).
Benchmark sets and parameter optimization
Benchmarking structures were randomly picked from the Protein DataBank released between May 28, 2004 and September, 2005. The threading library used structures deposited before May 28, 2004. All structures share <35% sequence identity with each other. No resolution and domain number requirements are imposed on these sets other than they are single-chain structures and that the predicted number of chunks
1. In optimizing the parameter wHB (the relative weight of the TASSER hydrogen-bond term in folding chunks) and wd (the relative weight of the DFIRE energy term used in ranking the chunk models), we used 60 structures (optimization set) that share <35% sequence identity with the 425 testing structures
250 amino-acid (AA)-long (testing set). Lists of these two sets can be found at http://cssb.biology.gatech.edu/chunk-TASSER.
To make the optimization set and testing set effectively hard targets, all structures with a TM-score (43
)
0.4 to each target are removed from the threading library (44
). We optimized the parameters to maximize the total TM-score of the top five selected chunk models on the 60-protein set. The optimization procedure is done iteratively by fixing one and changing the other. For example, we set an initial value for wHB, then let wd change within a range of 0–0.1; next, we set wd to its optimal value and let wHB change within 0–5, etc. The procedure is iterated until the optimal values of wHB and wd do not change. The resulting parameters are (wHB, wd) = (0.5, 0.01). For a realistic small-scale test, we used 21 medium/hard targets <200 AA long from CASP7 with threading library structures and sequence database released before the CASP7 season.
| RESULTS |
|---|
|
|
|---|
75 CPU hours on a dual core 2.0 GHz Opteron CPU to generate 5000 chunk models (
1 model/min). This can be easily distributed to several independent computers. To test how accurate the models are when chunks are included into TASSER, we select the top five best chunk models according to their RMSD to native in combination with the top 10 threading templates (selected according to threading Z-score) to construct contact potentials and distance restraints for chunk-TASSER to build full length models. The cumulative TM-score of chunk-TASSER on the 425 testing set proteins in this ideal scenario is shown in Table 1. The average TM-score of the best of top five SPICKER (42
0.4 to native structure.
|
760 models. The average linear correlation coefficient between Echunk and chunk C
RMSD to native is 0.266. Although this number is small, the p-value (45
3% better than scenario 2 and 1.5% better than 4 by TM-score, whereas the ideal scenario 1 is
12% better than scenario 3. The gap between the realistic scenario 3 and the ideal upper limit (scenario 1) is even larger in terms of the number of models having a TM-score
0.4 to native (123 vs. 202). The cumulative TM-score by original TASSER is also shown in Table 1. Chunk-TASSER scenario 3 is
8% better than TASSER. In what follows, we shall analyze chunk-TASSER in scenario 3 in more detail.
|
100 AA is 0.395, and for the 80 proteins with predicted
-helix content
0.5 is 0.403, whereas those of TASSER are 0.360 and 0.356, respectively. For larger proteins or less
-helix content proteins, chunk-TASSER's performance is slightly worse, but it is still better than TASSER. The percentages of proteins with models having TM-score
0.4 are 3.76, 20.94, and 28.94% by SP3, TASSER, and chunk-TASSER, respectively. These percentages are 2.50, 23.75, and 41.25% by SP3, TASSER, and chunk-TASSER, respectively, for the 80
-proteins. Fig. 3 shows the comparison of the prediction results by TASSER and chunk-TASSER on the 425 protein set with the number of proteins having models with a TM-score greater than a given threshold. Chunk-TASSER has an improvement over TASSER for all thresholds of TM-score >0.30 (the average value of the best structural alignment between a pair of randomly related structures). We compare chunk-TASSER and TASSER using the TM-score on the 80
-protein set in Fig. 4. In 60 cases, chunk-TASSER is better than TASSER, whereas in 20 cases TASSER is better than chunk-TASSER.
|
|
|
-helix content are small but very significant according to the corresponding p-values. This is due to the fact that, on average, an
-protein has fewer regular secondary structure segments for a given length and lower contact order than ß or
/ß proteins and therefore the conformational space of an
-protein is relatively smaller and easier to access. This explains why TASSER and chunk-TASSER perform better for
-proteins than for other types of proteins. The correlation between TASSER's performance and target size (as defined by the number of chunks) is marginally significant, whereas for chunk-TASSER, it is insignificant. As expected, both TASSER and chunk-TASSER have a small but significant dependence on contact order.
|
|
-protein of 68 amino acids. The best template found by SP3 for this protein is 11.0 Å to native. TASSER was able to refine it to 7.9 Å. Chunk-TASSER predicted the best model (the fourth model) to be 3.3 Å to native. 1u0sa is an 86-residue Å
-/ß-protein with three helices packed against a four-ß strand sheet. The best TASSER model has an RMSD of 7.7 Å away from native, whereas the best chunk-TASSER model (third model) is 3.7 Å to native. 1s3la is medium sized protein with 165 AA. It is a mixed
- and ß-protein. The best template has a RMSD of 11.2 Å to native and the best model given by TASSER has a RMSD of 7.6 Å and a TM-score of 0.41. With the inclusion of ab initio folded chunks, chunk-TASSER refines it to a RMSD of 6.6 Å and a TM-score of 0.51. The success of chunk-TASSER for this protein is mainly due to the ability of TASSER methodology to make use of the weak signal from the threading templates, although all templates have a RMSD > 11 Å to native. 1wixa and 1sumb are all
-proteins that contain 164 and 225 residues, respectively. The best models by TASSER have RMSDs of 13.5 Å and 9.5 Å for 1wixa and 1sumb, respectively. Chunk-TASSER significantly improves the models to 7.4 Å and 6.6 Å, respectively. 1tvca is another example that shows the ability of chunk-TASSER to utilize information from templates through the TASSER methodology. 1tvca is a two-domain protein. The templates found by SP3 all have a TM-score <0.4 to native because they cover only one of the domains. TASSER's best (ranked first) prediction for this protein is 6.1 Å and has a TM-score of 0.68. The best model (ranked first) by chunk-TASSER is almost the same in that it is 5.9 Å and has a TM-score of 0.64 to native. The relative orientation of the two domains is correctly modeled by both chunk-TASSER and TASSER. The reason for this successful modeling is due to the 3–7 overlapping residues between templates covering different domains that provide contact and distance restraints that define the domain orientation.
|
| DISCUSSION |
|---|
|
|
|---|
6% (4%) better than ROSETTA as assessed by total TM-score of the first (best) models. There are several examples in the 21 CASP7 set which show that chunk-TASSER combines the advantages of both ROSETTA and TASSER. For example, chunk-TASSER performs similarly with ROSETTA for target T0283, whereas chunk-TASSER is similar to TASSER for target T0348, T0354, T0363, and T0383 for which the SP3 template structures are also good.
Although chunk-TASSER shows significant improvement over original TASSER for hard targets, there is still much room for further improvement as indicated by the big gap between the realistic scenario 3 and the ideal scenario 1 (see Table 1). The current chunk model selection procedure is far from satisfactory. Further improvement can be achieved by more accurate selection of chunk models and generation of more near-native chunk models. As in TASSER, improvement can also be gained through more sensitive template identification from the structure library or from other modeling approaches. For example, using an approach like iterative TASSER (49
,50
), one can use the models from a first round of chunk-TASSER modeling in a second round of chunk-TASSER. This possibility of improvement is currently under investigation. Since chunk-TASSER, like TASSER, models a protein only in terms of its C
atoms and side-chain centers of mass, its accuracy is limited by the force-field resolution (
1.5 Å). Therefore, development of methods to refine chunk-TASSER models and to rebuild the finer structural details using full atomic potentials is also needed.
Recently, our laboratory has developed the TASSER-Lite algorithm (51
) that implements a limited time TASSER simulation. The algorithm was used in MetaTASSER (unpublished) that participated in CASP7 and was among one of the top performing servers. We are currently investigating a similar strategy for chunk-TASSER that will facilitate its public use.
| ACKNOWLEDGEMENTS |
|---|
|
|
|---|
This research was supported in part by grant Nos. GM-37408 and GM-48835 of the Division of General Medical Sciences of the National Institutes of Health.
| FOOTNOTES |
|---|
Submitted on March 30, 2007; accepted for publication May 9, 2007.
| REFERENCES |
|---|
|
|
|---|
2. Baker, D., and A. Sali. 2001. Protein structure prediction and structural genomics. Science. 294:93–96.
3. Pieper, U., N. Eswar, H. Braberg, M. S. Madhusudhan, F. P. Davis, A. C. Stuart, N. Mirkovic, A. Rossi, M. A. Marti-Renom, A. Fiser, B. Webb, D. Greenblatt, C. C. Huang, T. E. Ferrin, and A. Sali. 2004. MODBASE, a database of annotated comparative protein structure models, and associated resources. Nucleic Acids Res. 32:D217–D222.
4. Murzin, A. G. 2001. Progress in protein structure prediction. Nat. Struct. Biol. 8:110–112.[CrossRef][Medline]
5. Altschul, S. F., W. Gish, W. Miller, E. Myers, and D. Lipman. 1990. Basic local alignment tool. J. Mol. Biol. 215:403–410.[CrossRef][Medline]
6. Jones, D. T., W. R. Taylor, and J. M. Thornton. 1992. A new approach to protein fold recognition. Nature. 358:86–89.[CrossRef][Medline]
7. Vingron, M., and M. S. Waterman. 1994. Sequence alignment and penalty choice. Review of concepts, case studies and implications. J. Mol. Biol. 235:1–12.[CrossRef][Medline]
8. Altschul, S. F., T. L. Madden, A. A. Schäffer, J. Zhang, Z. Zhang, W. Miller, and D.-J. Lipman. 1997. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 25:3389–3402.
9. Karplus, K., C. Barrett, and R. Hughey. 1998. Hidden Markov models for detecting remote protein homologies. Bioinformatics. 14:846–856.
10. Jones, D. T. 1999. GenTHREADER: an efficient and reliable protein fold recognition method for genomic sequences. J. Mol. Biol. 287:797–815.[CrossRef][Medline]
11. David, R., M. J. Korenberg, and I. W. Hunter. 2000. 3D–1D threading methods for protein fold recognition. Pharmacogenomics. 1:445–455.[CrossRef][Medline]
12. Lundsröm, J., L. Rychlewski, J. Bunnicki, and A. Elofsson. 2001. PCONS: a neural-network-based consensus predictor that improves fold recognition. Protein Sci. 10:2354–2362.
13. Zhang, Y., and J. Skolnick. 2004. Automated structure prediction of weakly homologous proteins on genomic scale. Proc. Natl. Acad. Sci. USA. 101:7594–7599.
14. Bernstein, F. C., T. F. Koetzle, G. J. B. Williams, E. F. Meyer, M. D. B. Jr, J. R. Rodgers, O. Kennard, T. Shimanouchi, and M. Tasumi. 1977. The Protein DataBank: a computer-based archival file for macromolecular structures. J. Mol. Biol. 112:535–542.[Medline]
15. Simons, K. T., C. Strauss, and D. Baker. 2001. Prospects for ab initio protein structural genomics. J. Mol. Biol. 306:1191–1199.[CrossRef][Medline]
16. Bonneau, R., J. Tsai, I. Ruczinski, and D. Baker. 2001. Functional inferences from blind ab initio protein structure predictions. J. Struct. Biol. 134:186–190.[CrossRef][Medline]
17. Duan, Y., and P. A. Kollman. 1998. Pathways to a protein folding intermediate observed in a 1-microsecond simulation in aqueous solution. Science. 23:740–744.
18. Simons, K. T., C. Kooperberg, E. Huang, and D. Baker. 1997. Assembly of protein tertiary structures from fragments with similar local sequences using simulated annealing and Bayesian scoring functions. J. Mol. Biol. 268:209–225.[CrossRef][Medline]
19. Eyrich, V. A., D. M. Standley, A. K. Flets, and R. A. Friesner. 1999. Protein tertiary structure prediction using a branch and bound algorithm. Proteins. 35:41–57.[CrossRef][Medline]
20. Lazaridis, T., and M. Karplus. 2000. Effective energy functions for protein structure prediction. Curr. Opin. Struct. Biol. 10:139–145.[CrossRef][Medline]
21. Petrey, D., and B. Honig. 2000. Free energy determinants of tertiary structure and the evaluation of protein models. Protein Sci. 9:2181–2191.[Abstract]
22. Jaroszewski, L., L. Rychlewski, W. Li, and A. Godzik. 2000. Comparison of sequence profiles. Strategies for structural predictions using sequence information. Protein Sci. 9:232–241.[Abstract]
23. Jaroszewski, L., L. Rychlewski, and A. Godzik. 2000. Improving the quality of twilight-zone alignment. Protein Sci. 9:1487–1496.[Abstract]
24. Xia, Y., E. S. Huang, M. Levitt, and R. Samudrala. 2000. Ab initio construction of protein tertiary structures using a hierarchical approach. J. Mol. Biol. 300:171–185.[CrossRef][Medline]
25. Yue, K., and K. A. Dill. 2000. Constraint-based assembly of tertiary protein structures from secondary structure elements. Protein Sci. 9:1935–1946.[Abstract]
26. Lee, M. R., J. Tsai, D. Baker, and P. A. Kollman. 2001. Molecular dynamics in the endgame of protein structure prediction. J. Mol. Biol. 313:417–430.[CrossRef][Medline]
27. Pillardy, J., C. Czaplewski, A. Liwo, J. Lee, D. R. Ripoll, R. Kamierkiewicz, S. Odziej, W. J. Wedemeyer, K. D. Gibson, Y. A. Arnautova, J. Saunders, Y.-J. Ye, and H. A. Scheraga. 2001. Recent improvements in prediction of protein structure by global optimization of a potential energy function. Proc. Natl. Acad. Sci. USA. 98:2329–2333.
28. Zhang, C., J. Hou, and S.-H. Kim. 2002. Fold prediction of helical proteins using torsion angle dynamics and predicted restraints. Proc. Natl. Acad. Sci. USA. 99:3581–3585.
29. Zhang, Y., A. Kolinski, and J. Skolnick. 2003. TOUCHSTONE II: a new approach to ab initio protein structure prediction. Biophys. J. 85:1145–1164.
30. Chinchio, M., C. Czaplewski, S. Odziej, and H. A. Scheraga. 2006. A hierarchical multiscale approach to protein structure prediction: production of low-resolution packing arrangements of helices and refinement of the best models with a united-residue force field. Multiscale Model. Sim. 5:1175–1195.[CrossRef]
31. Odziej, S., C. Czaplewski, A. Liwo, M. Chinchio, M. Nanias, J. A. Vila, M. Khalili, Y. A. Arnautova, A. Jagielska, M. Makowski, H. D. Schafroth, R. Kamierkiewicz, D. R. Ripoll, J. Pillardy, J. A. Saunders, Y. K. Kang, K. D. Gibson, and H. A. Scheraga. 2005. Physics-based protein-structure prediction using a hierarchical protocol based on the UNRES force field: assessment in two blind tests. Proc. Natl. Acad. Sci. USA. 102:7547–7552.
32. Crivelli, S., T. Phillips, R. Byrd, E. Eskow, R. Schnabel, R. Yu, and T. Head-Gordon. 2000. A global optimization strategy for predicting protein tertiary structure:
-helical proteins. Comput. Chem. 24:489–497.[CrossRef][Medline]
33. Zhang, Y., A. Arakaki, and J. Skolnick. 2005. TASSER: an automated method for the prediction of protein tertiary structures in CASP6. Proteins. 61:91–98.[CrossRef][Medline]
34. Borreguero, J., and J. Skolnick. 2007. Benchmarking of TASSER in the ab initio limit. Proteins. In press.
35. Zhou, H., and Y. Zhou. 2005. Fold recognition by combining sequence profiles derived from evolution and from depth-dependent structural alignment of fragments. Proteins. 58:321–328.[CrossRef][Medline]
36. Zhou, H., and Y. Zhou. 2005. SPARKS 2 and SP3 servers in CASP 6. Proteins. 7:S152–S156.
37. Domingues, F. S., P. Lackner, A. Andreeva, and M. J. Sippl. 2000. Structure-based evaluation of sequence comparison and fold recognition alignment accuracy. J. Mol. Biol. 297:1003–1013.[CrossRef][Medline]
38. Zhou, H., and Y. Zhou. 2002. Distance-scaled, finite ideal-gas reference state improves structure-derived potentials of mean force for structure selection and stability prediction. Protein Sci. 11:2714–2726.
39. Zhang, C., S. Liu, H. Zhou, and Y. Zhou. 2004. An accurate residue-level pair potential of mean force for folding and binding based on the distance-scaled ideal-gas reference state. Protein Sci. 13:400–411.
40. Ramachandran, G. N., and V. Sasisekharan. 1968. Conformation of polypeptides and proteins. Adv. Protein Chem. 23:283–438.[Medline]
41. Zhang, Y., D. Kihara, and J. Skolnick. 2002. Local energy landscape flattening: parallel hyperbolic Monte Carlo sampling of protein folding. Proteins. 48:192–201.[CrossRef][Medline]
42. Zhang, Y., and J. Skolnick. 2004. SPICKER: a clustering approach to identify near-native protein fold. J. Comput. Chem. 25:865–871.[CrossRef][Medline]
43. Zhang, Y., and J. Skolnick. 2004. A scoring function for the automated assessment of protein structure template quality. Proteins. 57:702–710.[CrossRef][Medline]
44. Zhang, Y., I. Hubner, A. Arakaki, E. Shakhnovich, and J. Skolnick. 2006. On the origin and highly likely completeness of single-domain protein structures. Proc. Natl. Acad. Sci. USA. 103:2605–2610.
45. Press, W. H., B. P. Flannery, S. A. Teukolsky, and W. T. Vetterling. 1989. Numerical Recipes: The Art of Scientific Computing. Cambridge University Press, Cambridge, UK.
46. Ginalski, K., A. Elofsson, D. Fischer, and L. Rychlewski. 2003. 3D-jury: a simple approach to improve protein structure predictions. Bioinformatics. 19:1015–1018.
47. Reference deleted in proof.
48. Marti-Renom, M. A., M. S. Madhusudhan, A. Fiser, B. Rost, and A. Sali. 2002. Reliability of assessment of protein structure prediction methods. Structure. 10:435–440.[Medline]
49. Lee, S., and J. Skolnick. 2007. Development and benchmarking of TASSERiter for the iterative improvement of protein structure predictions. Proteins. In press.
50. Wu, S., J. Skolnick, and Y. Zhang. 2007. Ab initio modeling of small proteins by iterative TASSER simulations. BMC Biology 5:17.[CrossRef][Medline]
51. Pandit, S. B., Y. Zhang, and J. Skolnick. 2006. TASSER-Lite: an automated tool for protein comparative modeling. Biophys. J. 91:4180–4190.
This article has been cited by other articles:
![]() |
S. Y. Lee and J. Skolnick Benchmarking of TASSER_2.0: An Improved Protein Structure Prediction Algorithm with More Accurate Predicted Contact Restraints Biophys. J., August 15, 2008; 95(4): 1956 - 1964. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Shen, J. Bai, and M. Vihinen Physicochemical feature-based classification of amino acid mutations Protein Eng. Des. Sel., January 1, 2008; 21(1): 37 - 44. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |