| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||

* Niels Bohr Institute, Copenhagen, Denmark; and
Risø National Laboratory, Roskilde, Denmark
Correspondence: Address reprint requests to Karin B. Stibius, E-mail: karin_stibius{at}hotmail.com.
| ABSTRACT |
|---|
|
|
|---|
prey). We have investigated two possible scenarios for the asymmetry in the directed network by developing a biochemical model for the protein-DNA and protein-protein bindings inside the living yeast. One scenario assumes a background activity of bait proteins acting even without the prey, the other scenario explores the asymmetry in the chemistry associated with the bait being automatically located in the right position on the DNA. We conclude that the latter model gives the best description of the observed asymmetry. | INTRODUCTION |
|---|
|
|
|---|
Constructing a model for the two-hybrid detector could be of great importance if it is to be used to examine protein-protein interactions on a large scale. A model of the system can help to improve the experimental setup and help to screen the data for the most reliable interactions. The work presented here is meant to give an idea of some features that are of importance for the two-hybrid experiment.
| RESULTS AND DISCUSSION |
|---|
|
|
|---|
prey). Further, the asymmetry of the data can be quantified in terms of a systematic tendency with proteins acting as bait having larger connectivity than the same proteins acting as prey.
|
|
In our investigation we deal with a total of five networks: a), the real network of protein interactions in the cell, henceforth called the "real network"; b), the two-hybrid experiment gives us the "observed network"; c), we create a "simulated network", and on this simulated network we perform the two different simulated two-hybrid experiments just mentioned, d), model I (random firing); and e), model II (sequestering).
The situation is like reconstructing collision events in nuclear or high energy physics on basis of the incomplete data obtained from the detectors. Thus we want to analyze a situation where
![]() | (1) |
The results from the two models (see Method section and the Supplementary Material) with different values of the parameters used can be seen in Figs. 3 and 4. In Fig. 3 we have investigated the models where we for simplicity have assumed all proteins in the cell to have the same total concentration. For model I (A) we do not see a clear difference between bait and prey, whereas for model II (B) we see clearly different trends for the two. To see whether this effect would still be present if the protein concentrations of bait and prey were systematically higher than that of other individual proteins, Fig. 4 shows the effects of assuming the total concentrations of the bait and prey to be 10 times that of other proteins. This is relevant because bait and prey are typically expressed on multicopy plasmids. Fig. 4 demonstrate that the systematic differences between bait and prey persists at this more realistic setup, although the effects are less pronounced than for the parameters used in Fig. 3.
|
|
1. Overall, such variations tend to decrease the difference between bait and prey connectivity. However, for larger values of the detection threshold, T, the findings from above are nicely reproduced. With varying protein concentrations, the random firing model I always fails to obtain noticeable difference between bait and prey connectivity. In contrast the sequestering model II predicts substantial bait-prey asymmetry for any distribution of protein concentrations, provided a sufficiently large threshold is used. In the Supplementary Material we show figures that substantiate the robustness of our conclusions with respect to initial protein concentrations and choice of threshold values. Of the two hypotheses for the asymmetry we conclude that model II provides the best explanation for the observed features. For example model II is completely consistent with the fact that more proteins act as prey than as bait. We also find that the high connectivities are mostly seen for proteins functioning as bait, an effect not nearly as pronounced in model I. The fact that proteins with prey connectivity kprey = 0 has surprisingly high values of bait connectivity kbait, is also better explained by the sequestering model II. One explanation of this effect could be that if the connectivity of a protein is very large, the free concentration of the protein will be small; see Fig. 2. When a bait protein has a small concentration, we can imagine that because of the very strong binding to the DNA operator site it will be located close to the DNA at all times. For the prey protein, however, this effect does not exist. Here the binding to DNA is only a result of the binding to the bait protein, and this is expected to be a weaker interaction. Proteins with a small free concentration may therefore be seen in the experiment when acting as bait, whereas it will be very difficult for them to be seen binding as prey.
Our approach also opens for analysis of to what extent various network motifs (14
) may survive given the bait-prey asymmetry. In particular we find that for triangles of three interacting proteins it is particularly difficult to survive this asymmetry, and thus a triangle in itself should be taken as an indication of a more reliable/stronger interaction.
To the extent that model II describes the data, our analysis suggests that one should believe prey data for prey proteins with low connectivity, and bait data for proteins acting as bait with high connectivity. This conclusion would be softened if model I is also contributing. That is, if baits acting as autoactivators contribute to some additional interactions and thereby make some weak interactions detectable for bait proteins that have autoactivating that in itself was below the detection threshold. In any case prey data missed many links associated to high connectivity proteins.
| CONCLUSION |
|---|
|
|
|---|
| METHOD |
|---|
|
|
|---|
To examine the models we create a "simulated network" (17
) consisting of N nodes by assuming that their connectivity, k, is power distributed. That is, we assign each node a connectivity drawn from a probability distribution
![]() | (2) |
For each node, i, one chooses a random number
[0;1], and selects its connectivity ki = k such that F(k – 1)
F(k).
After each node, i, is given a certain connectivity, ki, the nodes are sorted by descending connectivities and subsequently connected into a network as described by Trusina et al. (17
). Finally the network is randomized by link swapping as described by Maslov and Sneppen (15
) to generate a truly random connected network with connectivity distribution represented by Eq. 2.
In the models the interest lies in the probability of binding the bait-prey complex to the DNA operator site, because this complex alone is able to activate transcription. The probability of having an operator site, O, with any molecule, X, bound to it can be calculated by Eq. 4, where
is the binding constant of the molecule X to the operator site O on the DNA. Thus [XO] is found by
![]() | (3) |
![]() | (4) |
Model I
In this, the random firing model, we assume that the only molecules that bind are the ones with a bait protein, i.e., free bait, bait-prey complexes, and bait in complex with other molecules, Y. This means that the total concentration of X will be:
![]() | (5) |
![]() | (6) |
In the two-hybrid experiment some baits always activate transcription, and these proteins were not used in the final experiment. Thus in model I we make the hypothesis that the bait proteins will have some binding to the RNA-polymerase and thereby activate transcription, but only at a level insufficient for the survival of the cell. This implies that the threshold for seeing a particular bait will depend on the level of activation that bait protein has. In terms of our model this means that the promoter activity associated to the bait-prey complex Pbait-prey is supplemented with an additional activity associated to the bait itself.
We have simulated the extra bait firing by giving each bait a random value, bait – firing (rb
[0, 0.1]), which is selected from a flat distribution. In the figures in the main text we typically chose threshold T = 0.1, just above the maximal random firing, whereas we in the supplement investigate larger T values. The total activity associated with a given bait-prey complex is then calculated from:
![]() | (7) |
Model II
In this, the sequestering model, we assume that baits bound to other molecules than prey are unable to bind to the DNA operator site. It could, e.g., be that, Y, bound to the bait is a membrane protein, and the complex therefore is located at the membrane. Thus we will have the molecules that are able to bind to the DNA operator site to be:
![]() | (8) |
![]() | (9) |
Calculations
To calculate these probabilities for the two models we need an estimate of the free protein concentrations. These can be found by Eq. 11
![]() | (10) |
![]() | (11) |
In the models we assign binding constants, Kij, between all proteins in the network. Two proteins, i and j, that have a connection in the network are given the same binding constant, Kij = Kbinding. In the model this is given the value, Kbinding = e–5
10–2 (the smaller the value the stronger the binding). This binding strength should be seen in the perspective that typical concentrations of individual proteins are of the order of one, thus representing fairly strong interactions.
Proteins, pi and pk, with no connection in the network are given a binding constant, Kik = Kno binding = 108, which is so large that we effectively disregard all bindings not present in the assumed network (no false positives are possible).
Finally we select the binding constant KD = 10–5 reflecting a very strong binding of the GAL4 binding region to the operator site. Other values of the binding constants have been investigated (see Supplementary Material) without altering the conclusions given in the main article.
In the first iteration we have used a value of [pi]free = 0.1 x [pi]total for all free protein concentrations, but the final free protein concentration is independent of which value is used to begin the numerical simulation. The iteration was continued until the value of the free concentration did not vary more than 10–10.
Now the model networks can be formed by calculating the probability of two proteins binding for each combination of proteins thereby creating a N x N matrix of probabilities. This matrix can be converted to a network by applying a threshold, T, where node i as bait is connected to node j as prey if Pij
T, where we have investigated values of T from 0.05 to 0.9; see also Supplementary Material.
| SUPPLEMENTARY MATERIAL |
|---|
|
|
|---|
| ACKNOWLEDGEMENTS |
|---|
|
|
|---|
| FOOTNOTES |
|---|
Submitted on September 27, 2006; accepted for publication May 30, 2007.
| REFERENCES |
|---|
|
|
|---|
2. Kumar, A., and M. Snyder. 2002. Protein complexes take the bait. Nature. 415:123–124.[CrossRef][Medline]
3. Ho, Y., A. Gruhler, A. Heilbut, G. D. Bader, L. Moore, S. L. Adams, A. Millar, P. Taylor, K. Bennett, K. Boutilier, L. Y. Yang, C. Wolting, et al. 2002. Systematic identification of protein complexes in Saccharomyces cerevisiae by mass spectrometry. Nature. 415:180–183.[CrossRef][Medline]
4. Fields, S., and O. Song. 1989. A novel genetic system to detect protein-protein interactions. Nature. 340:245–246.[CrossRef][Medline]
5. Chien, C., P. Bartel, R. Sternglanz, and S. Fields. 1991. The two-hybrid system: a method to identify and clone genes for proteins that interact with a protein of interest. Proc. Natl. Acad. Sci. USA. 88:9578–9582.
6. Ito, T., T. Chiba, R. Ozawa, M. Yoshida, M. Hattori, and Y. Sakaki. 2001. A comprehensive two-hybrid analysis to explore the yeast protein interactome. Proc. Natl. Acad. Sci. USA. 98:4569–4574.
7. Ito, T., K. Tashiro, S. Muta, R. Ozawa, T. Chiba, M. Nishizawa, K. Yamamoto, S. Kuhara, and Y. Sakaki. 2000. Toward a protein-protein interaction map of the budding yeast: a comprehensive system to examine two-hybrid interactions in all possible combinations between the yeast proteins. Proc. Natl. Acad. Sci. USA. 97:1143–1147.
8. Giot, L., J. S. Bader, C. Brouwer, A. Chaudhuri, B. Kuang, Y. Li, Y. L. Hao, C. E. Ooi, B. Godwin, E. Vitols, G. Vijayadamodar, P. Pochart, et al. 2003. A protein interaction map of Drosophila melanogaster. Science. 302:1727–1736.
9. Uetz, P., and M. Pankratz. 2004. Protein interaction maps on the fly. Nature. 22:43–44.[CrossRef]
10. Stelzl, U., U. Worm, M. Lalowski, C. Haenig, F. H. Brembeck, H. Goehler, M. Stroedicke, M. Zenkner, A. Schoenherr, S. Koeppen, J. Timm, S. Mintzlaff, et al. 2005. A human protein-protein interaction network: a resource for annotating the proteome. Cell. 122:957–968.[CrossRef][Medline]
11. Aloy, P., and R. Russell. 2002. Potential artefacts in protein-interaction networks. FEBS Lett. 530:253–254.[CrossRef][Medline]
12. Mrowka, R., A. Patzak, and H. Herzel. 2001. Is there a bias in proteome research? Genome Res. 11:1971–1973.
13. Ghaemmaghami, S., W. Huh, K. Bower, R. W. Howson, A. Belle, N. Dephour, E. K. O'Shea, and J. S. Weissman. 2003. Global analysis of protein expression in yeast. Nature. 425:737–741.[CrossRef][Medline]
14. Shen-Orr, S. S., R. Milo, S. Mangan, and U. Alon. 2002. Network motifs in the transcriptional regulation of Escherichia coli. Nat. Genet. 31:64–68.[CrossRef][Medline]
15. Maslov, S., and K. Sneppen. 2002. Specificity and stability in topology of protein networks. Science. 296:910–913.
16. Stibius, K. B. 2004. Analysis and modelling of protein interaction networks—a study of the two-hybrid experiment. Master's thesis. Niels Bohr Institute, University of Copenhagen, Copenhagen, Denmark. http://cmol.nbi.dk/thesis/Karin.pdf. [Online].
17. Trusina, A., S. Maslov, P. Minnhagen, and K. Sneppen. 2004. Hierarchy and anti-hierarchy in real and scale free networks. Phys Rev Lett. 92:178702.[CrossRef][Medline]
18. Shea, M. A., and G. K. Ackers. 1985. The OR control system of bacteriophage lambda—a physical-chemical model for gene regulation. J. Mol. Biol. 181:211–230.[CrossRef][Medline]
This article has been cited by other articles:
![]() |
I. Feldman, A. Rzhetsky, and D. Vitkup Network properties of genes harboring inherited disease mutations PNAS, March 18, 2008; 105(11): 4323 - 4328. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |