| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Molecular Biophysics Unit, Indian Institute of Science, Bangalore 560012, India
Correspondence: Address reprint requests to Saraswathi Vishveshwara, Molecular Biophysics Unit, Indian Institute of Science, Bangalore 560012, India. Tel.: 91-80-22932611; Fax: 91-80-23600535; E-mail: sv{at}mbu.iisc.ernet.in.
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
Protein structure networks have earlier been constructed with varying definitions of nodes and edges (10
16
). These investigations have focused on elucidating the network properties such as the shortest path length, clustering coefficient, and other small-world properties. The folding behavior of proteins has also been investigated in some of these studies using the structure of the transition state known in some proteins (10
12
). Although this study also considers the protein structures as networks, the method of construction and the analysis of these networks are different from previous studies. Here, the protein structure graphs (PSGs) are constructed by defining the amino acids in the polypeptide chain as the nodes and the noncovalent interactions among them as links. It has been established that such graphs are useful in the identification of clusters of amino acid residues that stabilize the protein structure and protein-protein interfaces (7
,17
20
). An important feature of such a graph is the definition of edges based on the normalized strength of interaction between the amino acid residues in proteins. Interestingly, we find that the network topology of such PSGs depends on the cutoff of the interaction strength between amino acid residues used in the graph construction.
Apart from analyzing the topological properties of the PSG, two other major findings emerge from the definition of edge-weighted PSG in this work. First, at a critical cutoff of interaction strength, we find a transition as probed by the size of the largest cluster. Interestingly, we find that this critical interaction cutoff, which we have evaluated for more than 200 proteins, falls within a narrow range, emphasizing the fact that this transition is a universal behavior of globular proteins. Second, we are able to identify the amino acid residues, which are highly connected and are crucial for the stability of the protein structure network. In the network terminology, these are the equivalent of "hubs". In many real-world cases, the networks are known to be less sensitive to random attacks on nodes but much more susceptible to targeted attacks on hubs (21
). A similar situation may exist in PSGs, where an inappropriate mutation of the hub residues can destabilize the protein structure. We have also analyzed the role of these hubs in bringing together the different secondary structure elements in the protein tertiary structure. Finally, we have demonstrated that the network parameters are able to account for the additional stability of thermophilic proteins. In a broad sense, this analysis yields novel insights into protein structure and stability by elucidating the role of the amino acid side chains in maintaining the unique topology of protein structures. Thus, we believe that this study will be able to motivate new experiments in protein folding, stability, and design.
| MATERIALS AND METHODS |
|---|
|
|
|---|
|
Definition of nodes and edges
Each protein in the data set is represented as a graph consisting of a set of nodes and edges. Each amino acid in the protein structure is represented as a node, and these nodes (amino acids) are connected by edges based on the strength of noncovalent interaction between the side chains of the two amino acid residues. The strength of interaction between two amino acid side chains is evaluated as a percentage given by:
![]() | (1) |
|
Iij is thus evaluated for all the ij pairs in the protein structure. We then choose a cutoff value, Imin and any ij residue pair with Iij > Imin is connected by an edge in the PSG, which has N nodes, where N is the number of amino acid residues in the protein structure. This cutoff (Imin) is varied from 0% (>0% is denoted as 0%) to 10% (very few nodes interact with a value >10%), and the PSG is constructed for all the proteins in the data set at these varying cutoffs. As the interaction cutoff is increased from 0% to 10%, the number of edges in the PSGs decreases because, at higher cutoff, the number of nodes making the high level of interaction will be less. Thus, we are able to quantify the interactions among the side chains of the residues and thus construct amino acid-based PSGs at varying strengths of interaction using this method. Our definition of amino acid interaction is based purely on the number of distance-based contacts between two amino acid residues. (This could further be refined by other factors such as hydrogen bonds and electrostatic interactions, where the energy of interaction can be directly taken into account). The PSGs of all the proteins in the data set, constructed at different Imin values, have been analyzed using various parameters given below.
Analysis of PSGs
Network properties
The networks are analyzed for the distribution of nodes with k links. For each PSG, the number of nodes n with k edges (links), n(k), is evaluated at various Imin values. The cumulative value (ntot(k)) over all proteins in the data set is taken, and then ntot(k) versus k is plotted at different Imin values. Further, we also evaluate the total number of edges or links in a PSG at a given Imin, referred to as ktotal and the ratio of the total number of edges to the total number of nodes in the PSG at a particular Imin, given by ktotal/N (where N is total number of residues or nodes in the protein structure). Both these parameters (ktotal and ktotal/N) are used in understanding the stability of thermophilic proteins.
Size of the largest cluster
The PSG is represented as an adjacency matrix (A), where
j and i and j are connected according to the Imin criterion.
j and i and j are not connected.
Contact number versus interaction strength
It is important to understand the difference between the two parameters, namely, the contact number and the interaction strength, both of which are used in the analysis of the PSGs in this study. The interaction strength is a parameter evaluated between two residues using the number of atom-atom contacts between them as given in either Eq. 1 or 2 (given below). However, the contact number of a residue i is defined as the total number of interactions which it makes with all other residues at a particular cutoff of the interaction strength (Imin). Although the interaction strength is evaluated between a pair of residues i and j and is based on the number of atom-atom contacts between them, the contact number works at a higher level and includes the number of residue-residue contacts made by a residue i at a particular cutoff of the interaction strength. Fig. 1, a and b, elucidates the difference between contact number and interaction strength, where examples of high interaction strengths and high contact number are shown clearly. We obtain the contact number (number of links or edges) of all the residues at varying Imins to analyze the PSGs of all the proteins in the data set. Specifically, we look at the high contact number residues (those which interact with more than four residues in the protein structure), referred to as "hubs" henceforth, at both high and low Imins. As explained earlier, the evaluation of interaction between two residues in a protein structure involves the normalization values of both the residue types. However, for the identification of hubs in a protein structure, it would be accurate to use the normalization value of the hub-forming residue alone. Hence, the interaction equation given in Eq. 1 reduces to the following for hub identification.
![]() | (2) |
Edge distribution profile of amino acids
The contact numbers of each of the 20 amino acid types in all the proteins in the data set (cumulative) were calculated at different Imin values. The number of amino acids of type i with contact numbers varying from 0 (orphans), 1 to 2, 3 to 4, and >4 (hubs) have been obtained using the definition given in Eq. 2 for all the proteins in the data set. The cumulative values have been obtained using all the proteins at desired Imin values for the 20 amino acid types, and the frequency distribution is plotted. This is referred to as the edge distribution profile of amino acids.
The plots presented in this work were obtained using MATLAB (The MathWorks, Natick, MA) and the protein structure figures were generated using VMD (25
).
| RESULTS |
|---|
|
|
|---|
Distribution of the nodes with k links as a function of the interaction criterion
The plot of the number of nodes (ntot(k)) with k links (cumulative value over all proteins in the data set), as a function of the number of links (k) at various interaction cutoffs is shown in Fig. 2. This plot gives us an idea of the number of orphans (k = 0) and the number of hubs (k > 4) in the PSGs at various interaction cutoffs (Imin). As the interaction cutoff is increased, ntot(k) decreases in general for most of the k values. However, at lower Imin values (04%), the number of nodes with less than two links is small, thereby giving rise to a bell-shaped curve. At Imin values
4.56% a sigmoidal curve is obtained, and at Imin >6% the curves show a steep decay behavior. At Imin = 4.5, the number of orphans in the PSGs exceeds the number of nodes with any k connections with k > 0 and this number keeps increasing when Imin is further increased. Since the nature of the distribution shown in Fig. 2 varies from bell shaped to sigmoidal to decay with increasing Imin, the PSGs certainly show a complex behavior. However, it is a consistent one, seen for a large number of proteins of various sizes and folds. It can be noted that the maximum number of edges made by any node in the PSGs in the complete range of Imin values is 12, and the maximum size of the PSGs is only
1500 nodes (this may be higher in the case of multimers). Hence, the PSGs are small networks when compared to most of the real-world networks analyzed (21
). The results presented in Fig. 2, represent a cumulative value over all the proteins in the data set. Nevertheless, an examination of the behavior of n(k) versus k for individual proteins qualitatively shows the same behavior of network topology, irrespective of the protein size.
|
ranging from 1.2 to 2.3. In essence, the PSGs seem to behave in a complex manner with varied network topologies at different interaction cutoffs.
Size of the largest cluster as a function of the interaction cutoff
The size of the largest cluster (or the giant component) is often used to understand the nature and properties of graphs (21
) and to assess whether there is a phase transition from the percolation point of view (26
). Here, we have monitored the variations in the size of the largest cluster with Imin values in all the proteins in the data set. The normalized size of the largest cluster (in terms of the number of nodes) is plotted as a function of Imin for a set of 200 proteins, belonging to various sizes and folds (Fig. 3). It is evident from Fig. 3 that irrespective of the protein size or fold, the size of the largest cluster in each of the proteins undergoes a transition at a particular Imin value. This Imin value at which the size of the largest cluster decreases dramatically (i.e., the midpoint of the transition) is termed Icritical. The plots in Fig. 3 are similar to the phase transition curves described by percolation theory and observed in physical systems (26
). Surprisingly, these plots show that Icritical, where this transition occurs, is within a narrow range for proteins of all sizes and folds. The standard deviation of Icritical is 0.9 around a mean of
3.9. We find that >85% of the proteins have an Icritical varying between 3.0 and 5.0, which is a significantly narrow range. However, Icritical is a function of the size of the protein and is generally higher for bigger proteins as indicated by the spread of the plots in Fig. 3. Thus, mean Icritical is
3.25% in proteins with 100200 residues, 3.75% in those with 200300 residues, 4.25% in those with 300400 residues, and >4.25% in those with 4001300 residues. When the proteins are segregated into bins of varying sizes, the standard deviation of the Icritical varies from 0.60.7, which further confirms the point that Icritical is dependent on protein size to a small extent. The critical Imin values varying from 3.0% to 5.0% are close to the Imin values discussed earlier (4.5%), where the number of orphans in the PSGs exceeds the number of nodes with any k connections with k > 0. In physical terms, a transition from one giant cluster to small disjoint clusters occurs around Imin = Icritical. This transition reveals that there are large numbers of residue pairs in the protein structures, which have an interaction strength value (Iij) around the region of 4%, which is the critical Imin value. Hence, an interaction cutoff (Imin) of 4% or above makes a large number of residues lose a lot of these contacts, thus causing a sudden drop in the size of the largest cluster and leading to the transition seen in Fig. 3. This transition is indicative of the fact that the PSG exists as a completely connected giant cluster at Imin values lower than Icritical (
4.5%), and these separate into smaller disjoint clusters at higher Imin values.
|
|
The edge distribution profile (Fig. 4) shows the significant loss of weak interactions when Imin is increased from 0% to 4%, which leads to the transition shown in Fig. 3. A pictorial representation of the hubs and clusters determined in barnase (1RNB) at Imin = 0% and Imin = 6% are shown in the supplementary figure (Fig. S1) to elucidate this aspect. The significance of weak connections in a network has been earlier demonstrated by Granovetter during his quest for understanding social networks (27
). Similarly, from the PSGs obtained at lower Imin values, we find that the weak interactions play an important role in maintaining the integrity of the PSGs, whereas the strong interactions are undoubtedly essential for the stability of protein structures.
The role of hubs in integrating secondary structures
We have analyzed the secondary structure preferences of the hubs as well as that of the residues with which the hubs interact. This provides information on the role of hubs in bringing together different secondary structural elements within the protein structure. The secondary structures of the amino acid residues in the protein structures have been obtained using the DSSP program (28
). The hubs and the residues with which they interact are classified as belonging to helices (
, 310,
), extended regions, turns (including bends), or unassigned regions (mainly loops). We find that most of the hubs belong to the regular secondary structural regions of helices and sheets though the loops, turns, and the unassigned regions are not excluded at any Imin (data not shown).
The distribution of the secondary structures of the residues interacting with these hubs at any Imin showed that the hubs interact with residues from both regular and nonregular secondary structural elements. We also find that these structural hub-forming residues form many inter- and intrasecondary structural contacts, thereby integrating different regions of the protein tertiary structure. Fig. 5 shows an example of a hub along with its interacting residues in a protein structure. It can be seen from the figure that the hub-forming phenylalanine residue, which belongs to a helix, interacts with residues belonging to different secondary structures, including a strand, another helix, and some loop regions. Hence, there is a clear indication of the stitching together of different secondary structures through the side-chain interactions of the hubs. Therefore, these hubs play a significant role in intersecondary structural interactions in the folded tertiary structure of the protein.
|
There are a few exceptions where the mesophilic protein performs better than the thermophilic one as indicated in Table 1. For example, in neutral protease, the number of hubs at 4% and the size of the largest cluster at 2% show a discrepancy, with the mesophilic protein having a higher value than the thermophilic protein. However, in this case, the total number of edges and the edge/node ratio show a better profile for the thermophilic protein than the mesophilic protein at all Imins. Further, the size of the largest cluster at 4% is significantly higher in the thermophilic protein than the mesophilic protein, thus compensating for the other losses by making many stronger interactions. Similarly, in the case of phopshoglycerate kinase, the number of hubs and the total number of edges in mesophilic protein are higher than that in the thermophilic one at Imin = 4%, though the edge/node ratio and the size of the largest cluster are not. However, in this case, all four parameters at 0% and 2% show a much higher percentage in the thermophilic protein than the mesophilic one. This may indicate that the lack of strong interactions at a higher Imin in the thermophile is made up significantly of a very large number of weak interactions at lower Imin. Phosphofructo kinase and TATA box-binding protein also exhibit some deviations from the trend in some parameters; however, the thermophilic counterparts of these proteins score better with some other parameters. In all the other proteins shown in Table 1, the trend observed in the number of hubs, total number of edges, the edge/node ratio, and the size of the largest cluster are quite straightforward, with the numbers being higher for the thermophilic counterpart than the mesophilic protein at all Imins. Thus, in general, there is very good correlation between the network parameters evaluated here and the additional stability of thermophilic proteins, with reasonably valid explanation for the few cases of exception. This analysis clearly shows that the network representation of protein structures presented in this work and the hubs identified are extremely useful in understanding protein stability.
A cartoon representation of the differences in the hubs (at Imin = 4%) of the thermophilic and the mesophilic carboxy peptidase is depicted in Fig. 6, which clearly shows more hubs in the thermophile than the mesophile. It should be noted that the common hubs in the thermophilic and mesophilic proteins are limited and the additional ones in the two proteins are not present in structurally identical positions. Further, the figure also shows that the backbone topologies of both the thermophilic and mesophilic proteins are very similar and hence it is the interactions involving the side chains that impart additional stability to the thermophilic proteins, which is what has been considered in the PSG representation presented in this work. Hubs, which are conserved in sequence, are likely to be more important from the biological perspective, and hence, this aspect is analyzed in the following subsection.
|
On mapping the strong hubs obtained at Imin = 4% (82 in total) onto the multiple sequence alignments of the thermophiles and mesophiles in each of the 10 families, we find that these hubs fall into four distinct categories according to their conservation. These include the common hubs, the exclusive hubs, the nonexclusive hubs, and the nonconserved hubs. The definitions and features of these four types of hubs are described below, and the relevant results are summarized in Table 2.
30% of the total hubs obtained at Imin = 4%. The only family without any exclusive hub is the neutral protease, whereas all others have at least one exclusive hub, which is specific to the thermophile or the mesophile. The common and exclusive hubs together are referred to as conserved hubs. We find that the aromatic and charged residues are preferred in these conserved hubs in both thermophilic and mesophilic proteins (Table 2).
|
| DISCUSSIONS |
|---|
|
|
|---|
12, which is very low when compared to the other real-world networks, where there are no restraints with respect to the number of connections acquired by a single node.
The PSGs also differ from many other complex networks in regard to the network growth. Most of the real-world networks are known to grow with time, i.e., the number of nodes in the network generally increases with time (21
). In case of the PSGs, the sizes of the proteins selected by nature range from
50 to 1500 amino acids. This range is fairly constant and has been stabilized during the course of evolution. Though the size of proteins range from
50 to 1500 amino acid residues, the bigger proteins form multiple structural modules (called domains) of similar size of
150200 amino acids. As a result, the larger proteins are made up of modules of individual domains. Thus, the protein domain networks have attained their size limits, and therefore the network growth aspect in the PSG is no longer a relevant factor. Apart from the analysis of the network topology of PSGs, this study has also provided insights into the role of amino acid hubs as sources of robustness and stability in protein structure as discussed in the following section.
PSGs and stability of thermophilic proteins
Various theoretical (from analysis of protein sequences and structures) and experimental (using protein engineering methods) studies have attributed the thermal stability of thermophilic proteins to different factors like higher salt bridges, hydrogen bonds, hydrophobic interactions, aromatic interactions, and better internal packing (1
7
). One of the conclusions from all these studies has been that the additional stability of different thermophilic proteins is not a consequence of a single factor. Instead it is a combined effect of various subtle interactions characteristic of each protein. Hence, we thought it appropriate to combine all these factors under a single umbrella and then study the thermophilic proteins from a broader perspective. This we achieve using a network representation of protein structures presented in this work, which considers all kinds of interactions in the protein structure without any discrimination and also takes into account the global topology of the protein structure. Although the strengths of individual interactions are not considered, a crude estimate of the interaction strength is incorporated on the basis of the number of atom-atom contacts between the interacting side chains. We then evaluate different well-known network parameters like the size of the largest cluster, total number of hubs, edge/node ratio, and the total number of edges in a set of 10 thermophilic proteins and their mesophilic counterparts. The analysis of these network parameters showed that in general, the thermophilic proteins have a higher magnitude of these network parameters than the mesophilic proteins. Even in cases where the mesophilic proteins performed better than the thermophilic proteins, we find that the losses in the thermophilic proteins are compensated in various ways, as discussed in the Results section. Though the analysis of the thermophilic proteins from an overall network perspective has given a better picture of the factors involved in their stability and though we find that the network parameters correlate well with the stability of these proteins, we also find that there is no single parameter that can be used as a measure to predict their stability. Some thermophilic proteins make more weak interactions, whereas some make more numbers of stronger interactions. Some of these proteins spread these interactions across the protein structure, giving rise to large interconnected clusters with many weak hubs, whereas some others concentrate their interactions in a particular location of the structure, thereby giving rise to smaller and stronger clusters with more numbers of stronger hubs. It only seems to emphasize the fact that each protein has its own way of achieving the additional stability, and hence a combination of all the network parameters presented here gives a better knowledge of the factors responsible for the stability of these proteins. Hence, the network representation of protein structures and the analysis of the network parameters have significantly improved the understanding of the principles involved in stabilizing the folded three-dimensional structure of proteins.
Hubs in protein structures
From the network perspective, it is known that the role of hubs in a network is to provide robustness to the network against random attacks (21
). Moreover, protein structures are made up of a significant number of strongly and weakly interacting amino acid hubs, which integrate different regions of the polypeptide chain, thereby stabilizing the tertiary structure of the protein. These hubs possibly provide robustness to the protein structures against random mutations. Hence, in protein structures, mutation of a single residue chosen randomly may not affect the protein structure or stability unless it is a very crucial hub. Therefore, it is important to carry out mutations of multiple residues (specifically the hub-forming amino acids) simultaneously to significantly destabilize the amino acid networks involved in stabilizing the protein structures. Our study offers a rational method for choosing these important residues in the protein structure by identifying the hubs. Further, this study also shows how the hubs aid in stabilizing the thermophilic proteins in comparison to their mesophilic counterparts.
| CONCLUSIONS |
|---|
|
|
|---|
| SUPPLEMENTARY MATERIAL |
|---|
|
|
|---|
| ACKNOWLEDGEMENTS |
|---|
|
|
|---|
We acknowledge the Computational Genomics Initiative at the Indian Institute of Science, funded by the Department of Biotechnology, India, for support. K.V.B. thanks the Council of Scientific and Industrial Research, India, for the award of a fellowship.
Submitted on April 11, 2005; accepted for publication August 9, 2005.
| REFERENCES |
|---|
|
|
|---|
2. Ladenstein, R., and G. Antranikian. 1998. Proteins from hyperthermophiles: stability and enzymatic catalysis close to the boiling point of water. Adv. Biochem. Eng. Biotechnol. 61:3782.[Medline]
3. Nicholson, H., W. J. Becktel, and B. J. Matthews. 1988. Enhanced protein thermostability from designed mutations that interact with
-helix dipoles. Nature. 336:651656.[CrossRef][Medline]
4. Serrano, L., A. G. Day, and A. R. Fersht. 1993. Step-wise mutation of barnase to binase. A procedure for engineering increased stability of proteins and an experimental analysis of the evolution of protein stability. J. Mol. Biol. 233:305312.[CrossRef][Medline]
5. Querol, E., J. A. Perez-Pons, and A. Mozo-Villarias. 1996. Analysis of protein conformational characteristics related to thermostability. Protein Eng. 9:265271.
6. Szilagyi, A., and P. Zavodszky. 2000. Structural differences between mesophilic, moderately thermophilic and extremely thermophilic protein sub-units: results of a comprehensive survey. Structure. 8:493504.[Medline]
7. Kannan, N., and S. Vishveshwara. 2000. Aromatic clusters: a determinant of thermal stability of thermophilic proteins. Protein Eng. 13:753761.
8. Onuchic, J. N., and P. G. Wolynes. 2004. Theory of protein folding. Curr. Opin. Struct. Biol. 14:7075.[CrossRef][Medline]
9. Fersht, A. R., and V. Daggett. 2002. Protein folding and unfolding at atomic resolution. Cell. 108:120.[CrossRef][Medline]
10. Vendruscolo, M., N. V. Dokholyan, E. Paci, and M. Karplus. 2002. Small-world view of the amino acids that play a key role in protein folding. Phys. Rev. E. 65:061910.[CrossRef]
11. Vendruscolo, M., E. Paci, C. M. Dobson, and M. Karplus. 2001. Three key residues form a critical contact network in a protein folding transition state. Nature. 409:641645.[CrossRef][Medline]
12. Dokholyan, N. V., L. Li, F. Ding, and E. I. Shakhnovich. 2002. Topological determinants of protein folding. Proc. Natl. Acad. Sci. USA. 99:86378641.
13. Amitai, G., A. Shemesh, E. Sitbon, M. Shklar, D. Netanely, I. Venger, and S. Pietrokovski. 2004. Network analysis of protein structures identifies functional residues. J. Mol. Biol. 344:11351146.[CrossRef][Medline]
14. Atilgan, A. R., P. Akan, and C. Baysal. 2004. Small-world communication of residues and significance for protein dynamics. Biophys. J. 86:8591.
15. Greene, L. H., and V. A. Higman. 2003. Uncovering network systems within protein structures. J. Mol. Biol. 334:781791.[CrossRef][Medline]
16. Bagler, G., and S. Sinha. 2005. Network properties of protein structures. Physica A. 346:2733.[CrossRef]
17. Kannan, N., and S. Vishveshwara. 1999. Identification of side-chain clusters in protein structures by a graph spectral method. J. Mol. Biol. 292:441464.[CrossRef][Medline]
18. Kannan, N., P. Chander, P. Ghosh, S. Vishveshwara, and D. Chatterji. 2001. Stabilizing interactions in the dimer interface of alpha-subunit in Escherichia coli RNA polymerase: a graph spectral and point mutation study. Protein Sci. 10:4654.
19. Brinda, K. V., N. Kannan, and S. Vishveshwara. 2002. Analysis of homodimeric protein interfaces by graph-spectral methods. Protein Eng. 4:265277.
20. Vishveshwara, S., Brinda K. V., and N. Kannan. 2002. Protein structure: insights from graph theory. J. Theor. Comput. Chem. 1:187211.[CrossRef]
21. Barabasi, A. L. 2002. Linked: The New Science of Networks. Persues Publishing, Cambridge, MA.
22. Berman, H. M., J. Westbrook, Z. Feng, G. Gilliland, T. N. Bhat, H. Weissig, I. N. Shindyalov, and P. E. Bourne. 2000. The protein data bank. Nucleic Acids Res. 28:235242.
23. Henringa, J., and P. Argos. 1991. Side-chain clusters in protein structures and their role in protein folding. J. Mol. Biol. 220:151171.[CrossRef][Medline]
24. West, D. B. 2000. Introduction to Graph Theory. Prentice-Hall of India Private Limited, New Delhi, India.
25. Humphrey, W., A. Dalke, and K. Schulten. 1996. VMD: visual molecular dynamics. J. Mol. Graph. 14:2728, 3338.
26. Stauffer, D. 1985. Introduction to Percolation Theory. Taylor and Francis, London.
27. Granovetter, M. S. 1973. The strength of weak ties. AJS. 78:13601380.[CrossRef]
28. Kabsch, W., and C. Sander. 1983. Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features. Biopolymers. 22:25772637.[CrossRef][Medline]
29. Mizuguchi, K., C. M. Deane, T. L. Blundell, and J. P. Overington. 1998. HOMSTRAD: a database of protein structure alignments for homologous families. Protein Sci. 7:24692471.[Abstract]
30. Amaral, L. A. N., A. Scala, M. Barthélémy, and H. E. Stanley. 2000. Classes of small-world networks. Proc. Natl. Acad. Sci. USA. 97:1114911152.
31. Hoang, T. X., A. Trovato, S. Flavio, J. R. Banavar, and A. Maritan. 2004. Geometry and symmetry presculpt the free-energy landscape of proteins. Proc. Natl. Acad. Sci. USA. 101:79607964.
32. Cohen, G. H. 1997. ALIGN: a program to superimpose protein coordinates, accounting for insertions and deletions. J. Appl. Crystallogr. 30:11601161.[CrossRef]
This article has been cited by other articles:
![]() |
G. Bagler and S. Sinha Assortative mixing in Protein Contact Networks and protein folding kinetics Bioinformatics, July 15, 2007; 23(14): 1760 - 1767. [Abstract] [Full Text] [PDF] |
||||
![]() |
Md. Aftabuddin and S. Kundu Hydrophobic, Hydrophilic, and Charged Amino Acid Networks within Protein Biophys. J., July 1, 2007; 93(1): 225 - 231. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Ghosh, K. V. Brinda, and S. Vishveshwara Dynamics of Lysozyme Structure Network: Probing the Process of Unfolding Biophys. J., April 1, 2007; 92(7): 2523 - 2535. [Abstract] [Full Text] [PDF] |
||||
| |||||||||||||||||||||||||||