| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |


* Department of Molecular and Cellular Biology, Harvard University, Cambridge, Massachusetts;
Department of Biochemistry and Biophysics, University of North Carolina School of Medicine, Chapel Hill, North Carolina; and
Department of Chemistry and Chemical Biology, Harvard University, Cambridge, Massachusetts
Correspondence: Address reprint requests to Eugene Shakhnovich, Dept. of Chemistry and Chemical Biology, Harvard University, Cambridge, MA 02138. Tel: 617-495-4130; Fax: 617-496-5948; E-mail: eugene{at}belok.harvard.edu.
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
The task of understanding protein structural evolution has relied on the analysis of structural similarities between proteins (Dokholyan et al., 2002
; Karev et al., 2002
; Koonin et al., 2002
; Qian et al., 2001
), much as the study of evolutionary relationships between genes and species has relied on studying similarities in primary sequence or other characters (Giribet, 2002
). Structural similarity has been defined at varying levels of detail, from the assignment of structures to families and folds in human-annotated databases (Karev et al., 2002
; Koonin et al., 2002
; Qian et al., 2001
) to the patterns of structural neighbors in quantitative comparisons (Dokholyan et al., 2002
). Recently, graph theoretic approaches have been utilized to represent structural similarity at these varying scales, and have been used by many to motivate and implement various models of protein structural evolution (Dokholyan et al., 2002
; Karev et al., 2002
; Koonin et al., 2002
; Qian et al., 2001
). One particular application of these approaches, called the protein domain universe graph or PDUG (Dokholyan et al., 2002
), revealed that the distribution of the number of structural neighbors k per domain follows a power law p(k)
k-µ and represents a scale-free network (Albert and Barabasi, 2002
; Barabasi and Albert, 1999
).
The main result of the recent research in protein structural evolution has been the emergence of divergent evolution as a dominant paradigm (Dokholyan et al., 2002
; Karev et al., 2002
; Koonin et al., 2002
; Qian et al., 2001
). The divergent models that have been proposed to accompany these studies, although often successfully reproducing various global patterns of structural similarity observed in real proteins, have been nonetheless relatively abstract in nature. In particular, although the existence of a protein structural space or "universe" (that is, the set of all possible protein structures) has been postulated (Koonin et al., 2002
), none of the proposed models have attempted to represent the evolution of proteins within this space (Dokholyan et al., 2002
; Karev et al., 2002
; Koonin et al., 2002
; Qian et al., 2001
). Indeed, there exists little understanding of what features (such as the underlying distribution of the number of structural neighbors) characterize this space, or how those features might influence structural evolution. Also, although the observed patterns of structural similarity have been explained in terms of evolution, it is unclear if those patterns of similarity are truly the imprint of evolutionary processes or simply a general feature of a space of compact polymer structures. A detailed characterization of the mechanisms of protein structural evolution will therefore not only require at least some level of understanding of the protein structures that have been available to organisms over the course of their evolution, but also an understanding of which patterns observed in real proteins likely contain information about their evolution and which might simply result from their identity as polymers.
An understanding of this space, however, is impeded by the fact that it is difficult to obtain a characterization of the space of real proteins. It is currently impossible to reliably predict the structure of a protein that shares no detectable sequence homology with a known structure (Bonneau et al., 2002
; Lesk et al., 2001
). Although structural genomics projects may soon provide a much greater structural description of the extant protein universe, the protein structures that have been discovered over the course of evolution may represent only a fraction of the structural possibilities available from protein structural space (Koonin et al., 2002
). Thus, a complete description of this space in terms of actual protein structures is infeasible at this time.
Model protein systems, such as lattice polymers, do not necessarily share these limitations. One can completely enumerate the structural space of 3 x 3 x 3 lattice polymers, for instance, subject only to the reasonable constraint that the polymers be maximally compact. The resulting 103,346 distinct structures represent a completely characterized structural space. Although there are many important differences between this set of structures and real proteins (such as a lack of secondary structure elements), this space nonetheless presents an interesting and simple model system for study (Li et al., 1996
; Mirny and Shakhnovich, 1996
; Shakhnovich and Gutin, 1990
). The cubic lattice polymer also represents a generalized compact polymer, and thus can provide the necessary "baseline" for what one would expect when looking at distributions of structural similarity in spaces of other compact polymers.
To understand the space of lattice polymers, we must develop a system for classifying and studying the relationships between lattice structures. Given the large number of structures, it is impractical to do so in a manner analogous to that used in the development of human-annotated databases such as SCOP and CATH (Murzin et al., 1995
; Orengo et al., 2002
). We thus utilize an automated method similar to that employed by FSSP (Holm and Sander, 1996
): we develop a method of structural comparison between lattice polymers based on the contact maps of these polymers and produce a Z-score of structural similarity analogous to that produced by DALI (Dietmann and Holm, 2001
; Holm and Sander, 1993
). Given the success of graph theoretic approaches in the representation of real protein spaces (Dokholyan et al., 2002
), we use this lattice Z-score to construct a graph in which the lattice structures are represented as nodes and structural similarity (above some Z-score cutoff) is used to define the edges between those nodes. This graph, termed the lattice structure graph or LSG, is a completely characterized structural space that can be analyzed using the same methodologies that have been applied to the limited set of real protein structures available.
Our analysis shows that the space of lattice structures exhibits a distribution of structural neighbors that is consistent with a random graph, a finding that indicates that the scale-free nature of the PDUG is not a general feature of polymer structures. We also approach and characterize a number of features of certain subspaces of the LSG. We employ a recently developed measure of stability and "designability" of lattice structures (England and Shakhnovich, 2003
) to determine the effects of physical constraints on this space. We find that these subsets of structures also represent random graphs. We then develop two modified versions of a "duplication-and-divergence" evolutionary model and find that subsets of the LSG chosen according to evolutionary rules can recapitulate the scale-free features of the PDUG.
| RESULTS |
|---|
|
|
|---|
![]() | (1) |
ij is 1 if positions i and j in the conformation are in contact and 0 otherwise, and N is the total number of contacts possible in the polymer (in the case of the 3 x 3 x 3 cubic lattice, N = 28). The distribution of Q-scores between lattice structures is well fit by a Gaussian (data not shown), and thus we define the Z-score of the comparison as
![]() | (2) |
Q
is the average value of QAB over all AB pairs and
Q is the standard deviation of Q over all AB pairs. In the case of the lattice,
Q
= 0.188 and
Q = 0.075. This process produces a Z-score that is the exact analog of the DALI Z-score (Dietmann and Holm, 2001
One can define a cutoff for structural similarity using the above framework in a manner similar to that employed for structural comparisons between real proteins. To assay the similarity between such comparisons on the lattice and in real proteins, we compared the distribution of Z-scores greater than a cutoff (Zmin) of 2 (the smallest Z-score reported by the DALI algorithm) on the lattice and in the set of protein domains that constitute the PDUG (Dokholyan et al., 2002
).
In both cases, the distribution of Z-scores exhibit long, non-Gaussian tails (see Fig. 1). Although the scales of the Z-scores differ (given that there is a maximum Z of
9 on the lattice), we observe that the distribution of structural similarity in this respect is quite similarindicating that this most likely represents a feature of compact polymers under this framework of structural comparison.
|
One of the most telling features of any graph is its degree distribution, or the distribution of the number of edges (in this case structural neighbors) per node. We calculate the degree distribution at various values of Zmin for the entire LSG, yielding the results in Fig. 2. In stark contrast to the power-law degree distribution observed in the PDUG (Dokholyan et al., 2002
), we find that p(k) for the LSG is well fit by a Gaussian. Gaussian degree distributions are a well-known feature of random graphs, or graphs in which edges are distributed randomly (Albert and Barabasi, 2002
).
|
Although the LSG does not share its degree distribution with the PDUG, this does not indicate that other features might not be observed within both systems. Another striking feature exhibited by proteins is their distribution in structural "families"certain structural classes (also known as "folds") contain many more representatives than other folds (Dokholyan et al., 2002
; Finkelstein et al., 1993
; Koonin et al., 2002
; Qian et al., 2001
). Within our graph-theoretical framework, these folds correspond to disjoint clusters of nodes (Dokholyan et al., 2002
), and the distribution of cluster sizes has been shown to follow a power law in various classification schemes (Dokholyan et al., 2002
; Koonin et al., 2002
; Qian et al., 2001
). Given that such distributions can also be observed in random graphs (Dokholyan et al., 2002
), we desired to identify corresponding "families" of lattice structures.
The space of lattice structures is much too large to cluster using current computational resources, so to explore this question we consider smaller subgraphs of the LSG. To facilitate direct comparison with the
3500 nodes in the PDUG, we construct random subgraphs of the LSG with the same number of structures. These subgraphs exhibit Gaussian degree distributions at various values of Zmin (Fig. 3). We cluster these subgraphs at various values of Zmin (Dokholyan et al., 2002
), and discover that, as observed in other random graphs (Dokholyan et al., 2002
), they undergo a transition in the size of the largest cluster (or "giant component" of the graph)similar to the percolation transition in this subspace (Albert and Barabasi, 2002
; Cohen et al., 2002
) (Fig. 4 A).
|
|
Physical constraints
Although the random-graph nature of the LSG may have implications for the nature of the real protein space, the preceding analysis entirely ignores the possible existence of physical constraints that might influence the characteristics of the space of "feasible" protein structures. Although the effects of such constraints can only be roughly represented on the lattice, it is possible to identify and model how they might affect a space of protein structures.
The two simplest constraints one can apply to real structures are that the native state should be sufficiently stable energetically (retain its fold in the presence of reasonable thermal fluctuations) and that the structures should be stable with respect to mutations (retain its fold in the presence of reasonable sequence fluctuations). Recently, a reliable measure of both of these properties was discovered: the trace of the contact matrix of a structure, which contains information about its "designability" (or the number of sequences that can adopt that structure) and its stability (England and Shakhnovich, 2003
). These authors demonstrated that the maximal eigenvalue of the contact matrix can serve as a good predictor of designability of compact 27-mer lattice model proteins. Earlier studies (Li et al., 1996
; Shakhnovich, 1998
; Wolynes, 1996
) had identified the symmetry of a protein structure and the overall number of contacts between its residues as possible determinants of protein designability. Whereas the maximal eigenvalue criterion of England and Shakhnovich is consistent with these earlier suggestions, it is especially well suited for the compact 27-mers studied here, since symmetry can be defined for such structures only intuitively, and density criterion (Wolynes, 1996
) would predict equal designability for all maximally compact lattice 27-mer structures. The maximal eigenvalue correlates well with the stability and observed sequence entropy of lattice structures (England and Shakhnovich, 2003
), and thus provides a good approximation to these physical constraints in this system.
We thus considered sets of 3500 lattice structures with the greatest, least, and median contact traces corresponding to the most, least, and median "designable" structures in the LSG. As with the random subgraphs, we cluster these graphs and identify the transition in the giant component of each (Fig. 5 A). We calculate the degree distributions and cluster size distributions for these subgraphs (Fig. 5, B and C). It is evident that the most designable structures, and to a lesser extent the least designable structures, represent a specific subset of the graph that is much more highly connected than one readily observes at random (Fig. 5 C). This behavior most likely results from the shared structural features that yield either exceptionally high or exceptionally low contact traces. These graphs, while certainly not a random subset of the space, nonetheless exhibit degree distributions well fit by Gaussian functions and are thus similar to those of random graphs.
|
Evolutionary model
The random-graph nature of the LSG, and the potentially similar nature of the space of real protein structures, raises the question of how evolution might proceed in such a structural space. To address this question, we create a version of our earlier model of evolution in an arbitrary space (Dokholyan et al., 2002
) to model the evolution of lattice proteins. To preserve the "duplication-and-divergence" paradigm of our previous model (Dokholyan et al., 2002
), we add "daughter" structures to the evolving subgraph according to their level of relatedness to a randomly chosen "parent" structure. At each duplication step, the newly added daughter node has some probability of diverging far enough to become an "orphan" (i.e., not connecting to the parent node) and some probability of being structurally related to the parent node.
The nature of the lattice space complicates the definition of "orphan" nodes, however. In our previous model the addition of an orphan is a completely local eventthe new node does not connect the parent node and is not allowed to contact any other node on the graph. On the lattice, however, nodes that do not contact a given parent node at a particular Zmin may still contact other nodes in the "evolved" subgraph. Thus the addition of "true" orphans (i.e., nodes with degree 0) requires a global understanding of the connectivity between a candidate orphan and any of the structures currently on the graph. Given the predominance of orphan nodes in the degree distributions of both the previous model and the PDUG (Dokholyan et al., 2002
), we define orphans in our new model as nodes that have a degree of 0 in the evolving subgraph, rather than nodes that simply do not connect the chosen parent node. Orphan nodes are chosen with some probability pO, analogous to the probability that a daughter will be chosen beyond the cutoff for similarity in our earlier model (Dokholyan et al., 2002
).
Connected daughter structures, chosen with probability of 1-pO, are defined as any node that connects the chosen parent node above a Zmin cutoff. Although this is similar to the previous model (Dokholyan et al., 2002
), in the first instantiation of this model we do not choose daughter nodes according to the level of similarity above the cutoff. The connectivity between non-orphan nodes that do not belong to a parent-daughter pair depends only on the underlying edges in the LSG. In this model, duplicate nodes are not allowed, resulting in an evolved subgraph consisting solely of unique lattice structures.
We find that this model is quite sensitive to two parameters: pO and the Zmin at which edges are defined. Within certain regimes the model simply cannot create subgraphs of the target size (3500 nodes). The number of "potential orphans" in the template graph decreases exponentially as the algorithm progresses (data not shown), and the rate of this decrease depends on pO (the percentage of nodes that are added as orphans) and Zmin. At high pO and low Zmin this decrease is too fast to allow for construction of a graph with 3500 nodes. Conversely, at low pO and high Zmin the nodes added to the graph do not have enough available "daughters" in the template space to fulfill the requirements of the model. We do not observe a successful run of 3500 nodes outside of a very narrow range of the parameter space, and simulations that could be completed using any value of pO were only readily observed at a Zmin of 4. The small size of this parameter regime may simply result from the restricted nature of the lattice space (the relatively small number of available nodes, for instance); nonetheless this behavior clearly illustrates the effects an underlying space may exert on structural evolution.
We cluster the resulting subgraphs at varying Zmin values, and find that the ZC of the transition in the giant component depends strongly on the pO of the simulation (Fig. 6 A). The transitions at lower values of pO not only occur at higher Zmin values but are also more gradual in nature. We analyze the degree distributions of graphs produced by this model at both a Zmin of 4 (the Zmin at which edges are defined during the evolutionary simulations) and at the ZC for each particular graph (Fig. 6, C and D) At a Zmin of 4 the degree distributions of the resulting graphs are well fit by stretched exponential functions (Fig. 6 C), whereas at ZC one cannot make a distinction between stretched exponential and power-law fits due to the lack of data at higher values of k. Although not necessarily consistent with the degree distributions of random graphs, these evolved graphs do not exhibit the power-law degree distributions that characterize both the PDUG and the evolutionary graphs produced by our previous model (Dokholyan et al., 2002
).
|
10.8 (resulting from a Q-score of 1, see Eq. 2), representing the highly identical structures possible in both the PDUG and models developed for continuous, arbitrary spaces. In the second model, orphan nodes are chosen in exactly the same manner as before, and thus the algorithm cannot proceed below a Zmin of 4. The choice of connected daughters, however, proceeds quite differently. The new algorithm chooses a Z-score above the cutoff with equal probability from all Z-score bins (including Z > 10). A random node is then chosen from that class of similarity, regardless of whether or not it has already been placed in the subgraph. Complete duplication events occur when a daughter node is chosen from the Z > 10 bin, or when a node that already exists on the graph is chosen from a particular 4 < Z < 10 bin. Not only does this model more faithfully reproduce the fact that extremely similar structures exist on the PDUG, it also allows for the choice of a greater number of nodes that are similar (if not identical) to a parent node (given the Z-score distribution in Fig. 1 A, less similar nodes are entropically favored in the first lattice evolution model proposed).
The transition in the giant component exhibited by the graphs produced under the second algorithm follows a similar pattern to the first, with ZC for the graphs at different values of pO similar between the two models (Figs. 6 A and 7 A). The degree distributions of graphs produced by this model at a Zmin of 4 are also well fit by stretched exponential functions, although the "perfect" duplication events result in nodes of much higher connectivity in these graphs than in those produced by the previous model (compare Figs. 6 C and 7 C). At ZC for each respective pO we observe degree distributions that are well fit by power-law functions (Fig. 7 D), indicating that subgraphs with strong scale-free character can be observed at the critical value of Zmin in these graphs. The power-law exponent of the fit of the degree distribution for the pO = 0.1 graph is -1.6, a value close to that observed for the PDUG and the graphs produced (in certain parameter regimes) by our original model (Dokholyan et al., 2002
).
|
| CONCLUSIONS |
|---|
|
|
|---|
A striking feature of all of the subgraphs we analyze is the distribution of cluster sizes at the critical point. In every case, this distribution follows a power law, and the distributions are similar in terms of power-law exponent regardless of the rules used to generate each subgraph (compare Figs. 47, B). This behavior seems to thus be a universal feature of these polymer subgraphs, and thus cannot be used to distinguish those that have been chosen according to evolutionary rules (Figs. 6 B and 7 B) or physical rules (Fig. 5 B) from those that have been chosen at random (Fig. 4 B). Given that similar distributions have been reported for folds in real proteins (Qian et al., 2001
), our lattice findings highlight the caution that must be employed when interpreting power-law distributions of cluster or fold family sizes in terms of evolutionary processes.
In contrast to this nearly universal behavior, we observe subgraphs with power-law degree distributions only as a result of a very specific evolutionary sampling procedure. This not only demonstrates that scale-free graphs may be derived from such spaces but also that the rules underlying divergent graph evolution models are sufficient to produce this behavior. Although the polymers we employ in this study are only rough approximations to proteins, the evolutionary graphs we produce consist of real structures and not of arbitrary nodes and edges. This data is thus not only useful as a proof of evolutionary principle but may also be seen as a source of structures with a more "realistic" distribution of structural similarity for other studies based on the 3 x 3 x 3 cubic lattice (Mirny and Shakhnovich, 1996
).
One remaining challenge with respect to evolutionary models involves understanding the discovery of orphan structures in real biological systems. In our model we are capable of calculating the degree of candidate orphan nodes and can thus ensure that nodes of degree 0 will be added to the graph. This calculation is most likely not performed in similar kind by evolving organisms, and the question remains as to how a given fraction of orphan nodes will be discovered over the course of protein evolution. Orphans may be a simple consequence of sequence dynamics, or they may result from a need for completely new structures to fulfill functional pressures. Further understanding of this phenomenon will require the development of more detailed models of structural evolution.
It is important to note that the conclusions we draw based on this model system, although quite rigorous due to the completeness of our description of this structural space, are not necessarily transferable to other systems. Although the features of the lattice space do provide a necessary control for structural comparisons involving general compact polymers, one must undertake further studies to assess the extent to which structural spaces that cannot be so fully understood exhibit similar behavior. In the case of polypeptides, it may be possible to use proteinlike decoy structures (Bonneau et al., 2002
; McConkey et al., 2003
) to gain a glimpse of the features of the real protein space beyond those structures that have been crystallized and beyond the set of structures that have been discovered over the course of evolution. Nonetheless, it is clear that the lattice will continue to provide insights into protein structural evolution, and the implications that this evolution carries for the study of protein folding and protein function.
| ACKNOWLEDGEMENTS |
|---|
|
|
|---|
We thank the Howard Hughes Medical Institute and the National Institutes of Health for financial support. N.V.D. acknowledges support from The University of North Carolina at Chapel Hill Research Council grant.
Submitted on June 2, 2003; accepted for publication July 28, 2003.
| REFERENCES |
|---|
|
|
|---|
Barabasi, A. L., and R. Albert. 1999. Emergence of scaling in random networks. Science. 286:509512.
Bonneau, R., C. E. Strauss, C. A. Rohl, D. Chivian, P. Bradley, L. Malmstrom, T. Robertson, and D. Baker. 2002. De novo prediction of three-dimensional structures for major protein families. J. Mol. Biol. 322:6578.[Medline]
Cohen, R., D. Ben-Avraham, and S. Havlin. 2002. Percolation critical exponents in scale-free networks. Phys. Rev. E Stat. Nonlin. Soft Matter Phys. 66:036113-1036113-4.
Dietmann, S., and L. Holm. 2001. Identification of homology in protein structure classification. Nat. Struct. Biol. 8:953957.[Medline]
Dokholyan, N. V., B. Shakhnovich, and E. I. Shakhnovich. 2002. Expanding protein universe and its origin from the biological Big Bang. Proc. Natl. Acad. Sci. USA. 99:1413214136.
England, J. L., and E. I. Shakhnovich. 2003. Structural determinant of protein designability. Phys. Rev. Lett. 90:218101.[Medline]
Finkelstein, A. V., A. M. Gutun, and A. Badretdinov. 1993. Why are the same protein folds used to perform different functions? FEBS Lett. 325:2328.[Medline]
Giribet, G. 2002. Current advances in the phylogenetic reconstruction of metazoan evolution. A new paradigm for the Cambrian explosion? Mol. Phylogenet. Evol. 24:345357.[Medline]
Holm, L., and C. Sander. 1993. Protein structure comparison by alignment of distance matrices. J. Mol. Biol. 233:123138.[Medline]
Holm, L., and C. Sander. 1996. The FSSP database: fold classification based on structure-structure alignment of proteins. Nucleic Acids Res. 24:206209.
Karev, G. P., Y. I. Wolf, A. Y. Rzhetsky, F. S. Berezovskaya, and E. V. Koonin. 2002. Birth and death of protein domains: a simple model of evolution explains power law behavior. BMC Evol. Biol. 2:18.126.[Medline]
Koonin, E. V., Y. I. Wolf, and G. P. Karev. 2002. The structure of the protein universe and genome evolution. Nature. 420:218223.[Medline]
Lesk, A. M., L. Lo Conte, and T. J. Hubbard. 2001. Assessment of novel fold targets in CASP4: predictions of three-dimensional structures, secondary structures, and interresidue contacts. Prot. Suppl. 5:98118.
Li, H., R. Helling, C. Tang, and N. Wingreen. 1996. Emergence of preferred structures in a simple model of protein folding. Science. 273:666669.[Abstract]
McConkey, B. J., V. Sobolev, and M. Edelman. 2003. Discrimination of native protein structures using atom-atom contact scoring. Proc. Natl. Acad. Sci. USA. 100:32153220.
Mirny, L. A., and E. I. Shakhnovich. 1996. How to derive a protein folding potential? A new approach to an old problem. J. Mol. Biol. 264:11641179.[Medline]
Murzin, A. G., S. E. Brenner, T. Hubbard, and C. Chothia. 1995. SCOP: a structural classification of proteins database for the investigation of sequences and structures. J. Mol. Biol. 247:536540.[Medline]
Orengo, C. A., J. E. Bray, D. W. Buchan, A. Harrison, D. Lee, F. M. Pearl, I. Sillitoe, A. E. Todd, and J. M. Thornton. 2002. The CATH protein family database: a resource for structural and functional annotation of genomes. Proteomics. 2:1121.[Medline]
Ponting, C. P., and R. R. Russell. 2002. The natural history of protein domains. Annu. Rev. Biophys. Biomol. Struct. 31:4571.[Medline]
Qian, J., N. M. Luscombe, and M. Gerstein. 2001. Protein family and fold occurrence in genomes: power-law behaviour and evolutionary model. J. Mol. Biol. 313:673681.[Medline]
Shakhnovich, E. I. 1998. Protein design: a perspective from simple tractable models. Fold. Des. 3:R45R58.[Medline]
Shakhnovich, E. I., and A. Gutin. 1990. Enumeration of all compact conformations of copolymers with random sequence links. J. Chem. Phys. 93:59675971.
Tatusov, R. L., D. A. Natale, I. V. Garkavtsev, T. A. Tatusova, U. T. Shankavaram, B. S. Rao, B. Kiryutin, M. Y. Galperin, N. D. Fedorova, and E. V. Koonin. 2001. The COG database: new developments in phylogenetic classification of proteins from complete genomes. Nucleic Acids Res. 29:2228.
Wolynes, P. G. 1996. Symmetry and the energy landscapes of biomolecules. Proc. Natl. Acad. Sci. USA. 93:1424914255.
This article has been cited by other articles:
![]() |
C. B. Roland and E. I. Shakhnovich Divergent Evolution of a Structural Proteome: Phenomenological Models Biophys. J., February 1, 2007; 92(3): 701 - 716. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. J. Deeds and E. I. Shakhnovich The Emergence of Scaling in Sequence-Based Physical Models of Protein Evolution Biophys. J., June 1, 2005; 88(6): 3905 - 3911. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. E. Shakhnovich, E. Deeds, C. Delisi, and E. Shakhnovich Protein structure and evolutionary history determine sequence space topology Genome Res., March 1, 2005; 15(3): 385 - 392. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. J. Deeds, H. Hennessey, and E. I. Shakhnovich Prokaryotic phylogenies inferred from protein structural domains Genome Res., March 1, 2005; 15(3): 393 - 402. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |